Loading…
Monday, August 17 • 12:22 - 12:28
Zero-Downtime Multi-Cluster Kubernetes Platform Upgrades - Asaf Erlich & Jonathan Alaimo, Groupon

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Groupon runs Kubernetes clusters on thousands of hosts for its engineers, auditors, and automated systems. Traffic from millions of Groupon customers flows through Kubernetes. Taking downtime for any reason, especially self-inflicted, is detrimental to the business' brand and bottom line. To provide zero downtime during platform upgrades, a new cluster is created, validated, and traffic is migrated slowly to it. In addition to Kubernetes releases, platform upgrades include any change made to key software components, etcd, or the underlying cloud infrastructure it runs on. Join Asaf and Jonathan as they discuss how Groupon architects for and executes zero-downtime multi-cluster platform upgrades. Other topics include trade-offs with single-cluster solutions and lessons learned along the way with open-source tools and cloud providers.

Speakers
avatar for Jonathan Alaimo

Jonathan Alaimo

Software Engineer, Groupon
Jonathan is a software engineer at Groupon working on container orchestration, cloud security, and infrastructure automation. He has more than 15 years of industry experience in e-commerce, IOT services, and consumer electronics. Before Groupon, he worked on the Bluzone cloud at Bluvision... Read More →
avatar for Asaf Erlich

Asaf Erlich

Software Engineer, Groupon
Asaf is a software engineer with 9 years of experience, most of which were on platforms or tools that aid other software engineers to deliver software. He currently works for Groupon helping to automate and maintain multiple Kubernetes clusters built on top of AWS infrastructure... Read More →


Monday August 17, 2020 12:22 - 12:28 BST