This event has ended. Visit the official site or create your own event on Sched.
View analytic
Tuesday, July 12 • 09:00 - 09:40
Incident Response @ FB, Facebook's SEV process

Sign up or log in to save this to your schedule and see who's attending!

Facebook is famous for our MOVE FAST AND BREAK THINGS motto. An important part of MOVING FAST while sustaining reliable systems is to FAIL FAST. This talk presents Facebook's strategy for Incident Response & Root Cause Analysis called the *Site Event (SEV) Process*. We'll describe everything from Incident Triage to Remediation paying special attention our desire fix things quickly and working to avoid having the same outage twice.

avatar for Gareth Eason

Gareth Eason

Technical Program Manager, Facebook
Gareth works as a Technical Program Manager with Facebook, focusing on designing and building their growing global network and content delivery infrastructure. Combining experience of systems architecture with networking and telecoms, Gareth has worked with Nokia, Cable & Wireless, HEAnet and Google. Come ask about Linux systems, care and feeding of large systems, CDN infrastructure or the successful use of Raspberry Pis for things they were... Read More →

Tuesday July 12, 2016 09:00 - 09:40

Attendees (32)