Loading…
This event has ended. Visit the official site or create your own event on Sched.

CoreOS Fest Berlin, the Open Source Distributed Systems Conference, brings together experts from all over the world to discuss container technologies, distributed systems and open source software to help take the industry to the next level in distributed computing.

The 2016 event will gather 500+ developers, devops professionals and sysadmins together to explore the next steps in making container technologies successful and secure.

Learn more at coreos.com/fest 

Back To Schedule
Tuesday, May 10 • 11:00 - 11:40
Scaling fleet Clusters with gRPC

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Fleet is a distributed init system that we use to schedule both infrastructure as well as user services on our clusters. Fleet is one of the main building blocks of our architecture that we use to schedule both infrastructure as well as user services on our microservices infrastructures. Fleet has seen production use for some time and is largely considered stable. The versatility and simplicity of fleet offers a distributed orchestration system, on which services in the form of systemd units (optionally running containers), can easily be deployed on the cluster. However, fleet can run into scalability issues that grow when using it in large-scale infrastructures. In particular, we realized of these scalability issues when fleet has to coordinate many nodes and/or thousands of deployed units in a cluster. Thus, we set out to improve the scalability of this distributed init system. Through the (optional) use of gRPC for communication between the fleet nodes as well as reducing etcd load through storing transient data in-memory, we could improve fleet performance and stability significantly. This talk goes into details of our design decisions and the implementation of the performance-improving changes, which we are currently working on merging upstream.

Speakers
avatar for Hector Fernandez

Hector Fernandez

Engineer, Giant Swarm GmbH
Héctor Fernández is a DevOps at Giant Swarm, a German startup that offers simple Microservices Infrastructures to host containerized microservices. At Giant Swarm, Héctor focuses on monitoring, performance, and improving Giant Swarm's infrastructure. Over the last years, he worked... Read More →


Tuesday May 10, 2016 11:00 - 11:40 CEST
B09