The video on-demand of this session is available to logged in QCon attendees only. Please login to your QCon account to watch the session.

Session + Live Q&A

Incidents, PRRs, and Psychological Safety

A Production Readiness Review is a process that identifies the reliability needs of a service based on its specific details. Few organizations have the benefit of starting with a robust PRR process. In most instances, the PRR process came about because of a production incident. The fact is there isn’t a one size fits all PRR process. What your PRR process looks like is a function of many things, including where your organization is, and it's business goals. In this talk, Nora discusses the context around PRRs and then provides tangible takeaways on how you can improve your production reliability. Spoiler alert: PRRs at other companies are great examples of how they measure production readiness, but you can’t drop their process into your organization without context.


Speaker

Nora Jones

Founder and CEO @jeli_io

Nora is the founder and CEO of Jeli. She is a dedicated and driven technology leader and software engineer with a passion for the intersection between how people and software work in practice in distributed systems. In November 2017 she keynoted at AWS re:Invent to share her experiences helping...

Read more
Find Nora Jones at:

Date

Wednesday Nov 10 / 01:10PM EST (40 minutes)

Track

Production Readiness

Topics

Production ReadinessChaos EngineeringDevopsSREInfrastructure

Add to Calendar

Add to calendar

Share

From the same track

Session + Live Q&A Production Readiness

Production Readiness: Fighting Fires or Building Better Systems?

Wednesday Nov 10 / 11:10AM EST

In 2018 Tanya Reilly gave a talk called ‘The History of Fire Escapes’ in which she argues that we need to ‘focus on better software, not better incident response’. When I was recently asked how much time SREs should spend firefighting, that talk came to mind. The ideal...

Laura Nolan

Senior Staff Engineer @Slack, Contributor to Seeking SRE, & SRECon Steering Committee

Session + Live Q&A Production Readiness

Prod Lessons - Deployment Validation and Graceful Degradation

Wednesday Nov 10 / 12:10PM EST

Key to Site Reliability Engineering is building frameworks and “guardrails” that enable the product to be developed safely. If patterns can be identified in outages and bugs, preventing those problems systematically gives SRE unparalleled leverage to improve stability. During...

Anika Mukherji

Software Engineer @Pinterest

PANEL DISCUSSION + Live Q&A Production Readiness

Panel: Real-World Production Readiness

Wednesday Nov 10 / 02:10PM EST

What does it mean for an app to truly be ready for Production? Join Ines Sombra (Senior Director of Engineering at Fastly), Kolton Andrus (CEO of Gremlin), and Laura Nolan (Seeking SRE Contributor) as we discuss production readiness. Topics we’ll dive into range from the...

Kolton Andrus

Founder and CEO of @GremlinInc

Laura Nolan

Senior Staff Engineer @Slack, Contributor to Seeking SRE, & SRECon Steering Committee

Ines Sombra

Director of Engineering @Fastly

View full Schedule