Loading…
This event has ended. Visit the official site or create your own event on Sched.
Capacity Planning [clear filter]
Monday, July 11
 

14:20 IST

Flash Sale Engineering
From stores with ads in the Super Bowl to selling Kanye’s latest album, Shopify has built a name for itself handling some of the world’s largest flash sales. These high profile events generate write-heavy traffic that can be four times our platform’s baseline throughput and don’t lend themselves to off-the-shelf solutions.

This talk is the story of how we engineered our platform to survive large bursts of traffic. Since it’s not financially sound for Shopify to have the required capacity always running, we built queueing and page caching layers into our Nginx load balancers with Lua. To guarantee these solutions worked, we tested them with a purpose-built load testing service.

Although flash sales are unique to commerce platforms, the lessons we learn from them are applicable to any services that experience bursts of traffic.

Speakers
avatar for Emil Stolarsky

Emil Stolarsky

Production Engineer, Production Engineer, Shopify
Emil is a production engineer at Shopify where he works on performance, scriptable load balancers, and DNS tooling. When he's not trying to make Shopify's global performance heat map green, he's shivering over a spiked cup of coffee in the great Canadian north.


Monday July 11, 2016 14:20 - 14:40 IST
Lansdowne

14:40 IST

Managing Up and Sideways as an SRE
Ever have a bad manager? Or have a project go off the rails but feel powerless to stop the trainwreck? I'll talk about why knowing a little bit about management can help you as an individual contributor or tech lead, and talk about a few ways that you can help yourself and your SRE team without ever formally managing yourself.

Speakers
avatar for Liz Fong-Jones

Liz Fong-Jones

Developer Advocate, Activist, and Site Reliability Engineer, Google
Liz is a Staff Site Reliability Engineer at Google and works on the Google Cloud Customer Reliability Engineering team in New York. She lives with her wife, metamour, and two Samoyeds in Brooklyn. In her spare time, she plays classical piano, leads an EVE Online alliance, and advocates... Read More →


Monday July 11, 2016 14:40 - 15:00 IST
Lansdowne

15:40 IST

Capacity Planning at Scale
Have you ever bought machines? What if you need to even build datacenters? How can you predict how many you are going to need in two years from now? How can you make efficient use of all the resources you suddenly got? What if you are missing some resources? Can we automate all these stuff and integrate with our continuous delivery?

These are just a few questions anyone planning a large computer fleet always make. This talk will cover some of the approaches and tooling that can be used to effectively plan for the demand of services and how to cover it on the most efficient manner.

Speakers
RM

Ramón Medrano Llamas

Senior Site Reliability Engineer, Google


Monday July 11, 2016 15:40 - 16:20 IST
Lansdowne

16:20 IST

Load Shedding—Approaches, Principles, Experiences, and Impact in Service Management
Cover the experience gained in developing load-shedding solutions and the impact in service management, at large scale.

Speakers
avatar for Acacio Cruz

Acacio Cruz

Director - Frameworks & Production Platforms, Google
Acacio has been an SRE manager since 2007, and manager of Google's Load-shedding & Traffic Management team since 2009. He is now a SWE Director in Frameworks and Software Infrastructure.


Monday July 11, 2016 16:20 - 17:00 IST
Lansdowne
 
Filter sessions
Apply filters to sessions.