Loading…
This event has ended. Visit the official site or create your own event on Sched.
Workshop [clear filter]
Monday, July 11
 

11:00 IST

Data Center Networks: The Rip van Winkle Edition
Limited Capacity seats available

If Rip Van Winkle had gone to sleep around 2006 and woken up 10 years later, he'd find the world a strange brew of the new and the old. He'd be amazed that phones had grown a brain, dismayed that a most excellent rendition of the Dark Knight had wandered back to the wasteland as most Dark Knight capers do. People had warmed upto electric cars, but not to climate change. And, if Ol' Rip were a network operations guy at some of the large webscale companies, he might think he'd died and woken up in heaven. Networks were no longer slow as molasses: to deploy, manage and upgrade. He'd find some things had stayed the same (IPv4 still ruled the roost), and some others not so much. He would be puzzled by the terminology and the discussions as he wandered the hallways. SDN, Open networking, Openflow, microservices, Ansible, Puppet, Kubernetes, and so on.

This tutorial is an attempt to bring folks up to speed on whats happened with networking in the past 10 years or so, especially in the data center, concluding with some thoughts on why exciting times lie ahead. The talk will be roughly divided into the following sections:

  1. Who Moved My Network ? What's causing all this turmoil in networking
  2. Solutions: Requirements, Terminology, Pros and Cons
  3. Changing Landscape: Network Topologies
  4. Changing Foundation: Network Protocols
  5. Changing Operations: Modern Operations
  6. Changing Residents: Modern applications and their implications on networks
  7. Reading Tea Leaves

The tutorial will include demos and hands on work with some modern tools.

The audience is expected to be aware of basic networking (bridging, routing, broadcast, multicast etc.).

The key takeways from this talk will be:

  • An understanding of the forces behind the changes in data center networking
  • The morphology an physiology of modern DC networks
  • What these changes presage of the future

Some preliminary ideas for hands on work:

  • Build multi-host container network
  • Build and configure a nxm CLOS topology with BGP
  • Design a CLOS for x number of servers given certain box specifications

Speakers

Monday July 11, 2016 11:00 - 17:00 IST
Ulster

11:00 IST

Staring into the eBPF Abyss
Limited Capacity seats available

eBPF (extended Berkeley Packet Filters) is a modern kernel technology that can be used to introduce dynamic tracing into a system that wasn't prepared or instrumented in any way. The tracing programs run in the kernel, are guaranteed to never crash or hang your system, and can probe every module and function -- from the kernel to user-space frameworks such as Node and Ruby.

In this workshop, you will experiment with Linux dynamic tracing first-hand. First, you will explore BCC, the BPF Compiler Collection, which is a set of tools and libraries for dynamic tracing. Many of your tracing needs will be answered by BCC, and you will experiment with memory leak analysis, generic function tracing, kernel tracepoints, static tracepoints in user-space programs, and the "baked" tools for file I/O, network, and CPU analysis. You'll be able to choose between working on a set of hands-on labs prepared by the instructors, or trying the tools out on your own test system.

Next, you will hack on some of the bleeding edge tools in the BCC toolkit, and build a couple of simple tools of your own. You'll be able to pick from a curated list of GitHub issues for the BCC project, a set of hands-on labs with known "school solutions", and an open-ended list of problems that need tools for effective analysis. At the end of this workshop, you will be equipped with a toolbox for diagnosing issues in the field, as well as a framework for building your own tools when the generic ones do not suffice.


Speakers
avatar for Sasha Goldshtein

Sasha Goldshtein

CTO, Sela Group
Sasha Goldshtein is the CTO of Sela Group, a Microsoft Regional Director and MVP, Pluralsight and O’Reilly author, and international consultant and trainer. Sasha is the author of two books and multiple online courses, and a prolific blogger. He is also an active open source contributor... Read More →


Monday July 11, 2016 11:00 - 17:00 IST
Munster
 
Tuesday, July 12
 

09:00 IST

Accident Models in Post Mortems
Limited Capacity seats available

Many organizations want to learn from failures. Postmortem debriefings and documents are a part of that learning process. In this two part session, we will cover the theory and fundamentals on complex systems failure and “human error”, as well as techniques for facilitating an adverse event debriefing. Attendees should walk away with a more evolved sense of accident/outage investigation and a model to explore in their own organizations.

Speakers
avatar for Will Gallego

Will Gallego

Software Engineer, Fastly
Will Gallego is a systems engineer with 15+ years of experience in the web development field, currently as a Senior Software Engineer at Fastly. Comfortable with several parts of the stack, he focuses now on building scalable, distributed backend systems and tools to help engineers... Read More →
avatar for Miriam Lauter

Miriam Lauter

Software Engineer, Etsy
I'm a software engineer on Etsy's payments team and a summer 2014 Recurse Center alum. Outside work, I'm an avid rock climber and 99pi podcast listener.


Tuesday July 12, 2016 09:00 - 10:20 IST
Munster

09:00 IST

Statistics for Engineers
Limited Capacity seats available

Gathering telemetry data is key to operating reliable distributed systems at scale. Once you have set up your monitoring systems and recorded all relevant data, the challenge becomes to make sense of it and extract valuable information, like:
  • Is the system down?
  • Is user experience degraded for some percentage of our customers?
  • How did our query response times change with the last update?
Statistics is the art of extracting information from data. In this tutorial, we address the basic statistical knowledge that helps you at your daily work as an SRE. We will cover probabilistic models, summarizing distributions with mean values, quantiles, and histograms and their relations.

The tutorial focuses on practical aspects, and will give you hands-on knowledge of how to handle, import, analyze, and visualize telemetry data with UNIX tools and the IPython toolkit.

This tutorial has been given at several occasions over the last year and has been refined and extended since, cf. Twitter #StatsForEngineers

Speakers
avatar for Heinrich Hartmann

Heinrich Hartmann

Analytics Lead, Circonus
Heinrich Hartmann is the Analytics Lead at Circonus. He is driving the development of analytics methods that transform monitoring data into actionable information as part of the Circonus monitoring platform. In his prior life, Heinrich pursued an academic career as a mathematician... Read More →


Tuesday July 12, 2016 09:00 - 12:20 IST
Ulster

11:00 IST

Post Mortem Facilitation
Limited Capacity seats available

Speakers
avatar for Will Gallego

Will Gallego

Software Engineer, Fastly
Will Gallego is a systems engineer with 15+ years of experience in the web development field, currently as a Senior Software Engineer at Fastly. Comfortable with several parts of the stack, he focuses now on building scalable, distributed backend systems and tools to help engineers... Read More →
avatar for Miriam Lauter

Miriam Lauter

Software Engineer, Etsy
I'm a software engineer on Etsy's payments team and a summer 2014 Recurse Center alum. Outside work, I'm an avid rock climber and 99pi podcast listener.


Tuesday July 12, 2016 11:00 - 12:20 IST
Munster

13:40 IST

Effective Design Review Participation
Limited Capacity seats available

This workshop is a part of the "full lifecycle" workshop track which includes Post-Mortems, Incident Response, and Effective Design Review Participation. Using several example cases, participants in this session will learn to apply a variety of different points of view to analyze a design for issues which could affect its reliability and operability.

The sample designs and play list can be found at https://goo.gl/VIiN6i - now updated with the comments and suggestions that came in during the workshop.


Speakers
avatar for Kurt Andersen

Kurt Andersen

Program Committee, LinkedIn
Kurt Andersen was one of the co-chairs for SREcon-Americas in 2017 and 2018. He has been active in the anti-abuse community for over 20 years and is currently the senior IC for the Product SRE team at LinkedIn. He also works as one of the Program Committee Chairs for the Messaging... Read More →



Tuesday July 12, 2016 13:40 - 15:00 IST
Munster

13:40 IST

DivOps, Continuous Diversity at Scale
Limited Capacity seats available

This tutorial/workshop is aimed at management and individual contributors alike. We will work together on how to encourage and nurture a diversity culture in day-to-day ops teams. First we will discuss the concepts of 2- and 3-dimensional diversity, and the statistics around diverse teams performance. Then we will map out how to design, build, deploy and operate a diversity plan in our teams. This will include diversity goal setting and explicit cultural evolution, hiring processes, day to day communications, review process and team collaboration. Where possible we will encourage groups to break out and evaluate their own cultures and processes. 

Speakers

Tuesday July 12, 2016 13:40 - 17:00 IST
Ulster

15:40 IST

Practical Incident Response
Limited Capacity seats available

This workshop is structured as a fast-moving but fun game (think fluxx crossed with a hectic oncall shift) but the subject matter is entirely serious: we will use it to explore best practices and pitfalls for managing incidents as a team. You will work as part of a team managing a production outage: we'll go through the entire process from detection of the incident, problem diagnosis, mitigation, and resolution, finishing with the first draft of the postmortem.

Speakers
LN

Laura Nolan

Google LLC
I am a SRE and tech lead at Google, working in our ads data infrastructure. I presented a workshop and a talk at SRECon Europe 2015, and have presented in workshops at other USENIX conferences (LISA, federated conferences) and FLOSS UK.


Tuesday July 12, 2016 15:40 - 17:00 IST
Munster
 
Wednesday, July 13
 

09:00 IST

Distributed Log-Processing Design Workshop
Limited Capacity seats available

Participants will have the opportunity to try their hand on designing a reliable, distributed, multi-datacenter near-real-time log processing system.

The session will start with a short presentation on lessons learned about designing reliable distributed systems, and then participants will break out in small groups, assisted by Google facilitators, and try their hand at solving a real-world design challenge, from high-level architecture down to an estimate of the computing resources required to run the service.

The session will likely appeal to experienced engineers who want to have fun tackling a real-world design problem faced by many teams in Google.


Speakers
avatar for Andrea Spadaccini

Andrea Spadaccini

SRE Manager, Google


Wednesday July 13, 2016 09:00 - 12:40 IST
Munster

09:00 IST

Docker From Scratch
Limited Capacity seats available

Docker is very popular these days, but how many of us are really familiar with the basic building blocks of Linux containers and their implications? What's missing in the good ol’ chroot jails? What are the available Copy-on-Write options and what are their pros and cons? Which syscalls allow us to manipulate Linux namespaces and what are their limitations? How do resource limits actually work? What different behaviours do containers and VMs have?

In this hands-on workshop, we will build a small Docker-like tool from O/S level primitives in order to learn how Docker and containers actually work. Starting from a regular process, we will gradually isolate and constrain it until we have a (nearly) full container solution, pausing after each step to learn how our new constraints behave.

Speakers
avatar for Nati Cohen

Nati Cohen

HERE Mobility
Nati Cohen is a Production Engineer at Here Technologies and a Teaching Assistant at the Interdisciplinary Center Herzliya. Previous experience includes: operations consulting, software development, *nix administration and security research in the Intelligence Corps as well as in... Read More →
avatar for Avishai Ish-Shalom

Avishai Ish-Shalom

Avishai Ish-Shalom is a veteran Ops and a survivor of many production skirmishes. Avishai helps companies deal with web era operations and scale as an independent consultant. In his spare time Avishai is spreading weird ideas and conspiracy theories such as DevOps.


Wednesday July 13, 2016 09:00 - 14:40 IST
Ulster
 
Filter sessions
Apply filters to sessions.