Sponsored Webinar + Live Q&A
Finding Harmony: Resilient Systems and Compensatory Growth
Many engineering teams are locked in the Sisyphean battle to prevent incidents while also accepting the SRE tenet that failure is normal. Often this results in burnout. What if there’s a better, more harmonious way? I’ll share how truly embracing failure can lead to more resilient systems—a symbiosis of more reliable applications and stronger, happier engineering teams.
Speaker
Jason Yee
Director of Advocacy @Gremlin
Jason Yee is Director of Advocacy at Gremlin where he helps people build more resilient systems by learning from how they fail. He also leads the internal Chaos Engineering practices to make Gremlin more reliable. Previously, he worked at Datadog, O’Reilly Media, and MongoDB. His...
Read moreFind Jason Yee at:
Session Sponsored By
Gremlin is a Chaos Engineering service on a mission to help build a more reliable internet. Their solutions turn failure into resilience by offering engineers a fully hosted SaaS platform to safely experiment on complex systems, in order to identify weaknesses before they impact customers and cause revenue loss. Existing customers include JPMorgan Chase, Mailchimp, Grubhub, Twilio, Walmart, and Workiva.
From the same track
Using Generic Mitigations and Playbooks as Code to Improve Reliability
Thursday May 27 / 02:10PM EDT
Every company lives in fear of an outage, or other production incident that wakes engineers up in the middle of the night, and results in unhappy customers trying to access services. It's more vital than ever to mitigate the customer impact of outages as fast as possible. In this...
Leonid Belkind
Co-Founder and Chief Technology Officer @StackPulse