Twitter and the JVM
Oracle, November 5, 2012
Brandon Mitchell
Twitter Inc.
Marius Eriksen
Twitter Inc.
Brandon Mitchell
Twitter Inc.
Marius Eriksen
Twitter Inc.
The JVM is used throughout
Systems are both in Scala and Java; most are using ParNew+CMS.
We have a VM team (Brandon, Kaushik) contributing to OpenJDK.
Garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collecton, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collecton, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collecton, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collection, garbage collecton, garbage collection, garbage collection, garbage collection, garbage collection
We have good throughput (GC overhead is usually .2-.5%)
But tail latencies are very important. Effect is compounded in distributed systems.
Most of our systems are stateless.
Google-perftools compatible tooling
Heap profilers
github.com/mariusaeriksen/heapster
CPU, contention prolfilers
github.com/twitter/util/tree/master/util-jvm
Accessible via HTTP on production systems.
Still experimental:
RPC: building in GC avoidance
Speculative execution.
To minimize tail latency effects, we run with the largest heaps possible.
But we want to share, to colocate (see: Mesos). Our cluster scheduler should be able to make latency-heap-size tradeoffs (eg. to do better binpacking or to oversubscribe).
-Xmx
)