Haley Tucker

Senior Software Engineer, Resilience Team @Netflix

Haley Tucker is a member of the Resilience Engineering team at Netflix where she is responsible for improving the reliability of the Netflix ecosystem by supporting developers and building trustable and safe tooling. Prior to that, she worked on the Playback Features team where her services filled a key role in enabling Netflix to stream amazing content to millions of members on thousands of device types worldwide. Prior to Netflix, Haley spent a few years building near-real-time command and control systems at Raytheon. She then moved into a consulting role where she built custom billing and payment solutions for cloud and telephony service providers. Haley enjoys applying new technologies to develop robust and maintainable systems and the scale at Netflix has been a unique and exciting challenge. Haley received a BS in Computer Science from Texas A&M University.

Find Haley Tucker at:

A Sticky Situation: How Netflix Gains Confidence in Changes

How do you know whether a change will affect end users in a negative way? As interactions in distributed systems grow increasingly complex, it can be challenging to get an answer to this question. 

One approach is to use a canary in which we introduce a new service into the environment, users are randomly routed to that service, and we compare the performance of that service to the current production build. However, this doesn’t really tell us anything about what the end users are experiencing -- it focuses on service-level metrics. In reality, a service may be happily serving successful requests, yet the end user is not able to use your product.

As a result, it can be useful to have a methodology which enables teams to observe the full impact of a change on end users. In this talk, I will demonstrate how Netflix uses sticky canaries to fulfill this need and I will highlight use cases where we have employed this methodology successfully. I will also cover the key platform features and tools to include when implementing sticky canaries.

Key Takeaways

  • What is a sticky canary?
  • What types of use cases may benefit from this methodology?
  • Platform and tooling investments required to make this a success.

Wednesday Nov 4 / 10:50AM PST (40 minutes )

TRACK Architecting for Confidence: Building Resilient Systems ADD TO CALENDAR Add to calendar

3 weeks of live software engineering content designed around your schedule.

Don’t miss out! Save your seat now