• Latest
The Problem With MTTR: Learning From Incident Reports

The Problem With MTTR: Learning From Incident Reports

March 8, 2023
xQc reacts to Samsung Galaxy Z Flip 3 | Marques Brownlee

xQc reacts to Samsung Galaxy Z Flip 3 | Marques Brownlee

March 20, 2023
Random: Resident Evil 4 Animation Is Both Cute And Unsettling In Equal Measure

Random: Resident Evil 4 Animation Is Both Cute And Unsettling In Equal Measure

March 20, 2023
Ice Universe: Samsung Galaxy Flip5 to have much larger external screen, Fold5 to drop the gap

Ice Universe: Samsung Galaxy Flip5 to have much larger external screen, Fold5 to drop the gap

March 20, 2023
10 Things You Didn't Know About Marques Brownlee from MKBHD

10 Things You Didn't Know About Marques Brownlee from MKBHD

March 20, 2023
LG G7 ThinQ Impressions!

LG G7 ThinQ Impressions!

March 20, 2023
Best Xbox zombie games

Best Xbox zombie games

March 20, 2023
New Allies delayed indefinitely on console

New Allies delayed indefinitely on console

March 20, 2023
Moto RAZR 2 Impressions: Nostalgia Reloaded?

Moto RAZR 2 Impressions: Nostalgia Reloaded?

March 20, 2023
The Entire Mystery joins Xbox Game Pass soon

The Entire Mystery joins Xbox Game Pass soon

March 20, 2023
Apple Watch Review!

Apple Watch Review!

March 20, 2023
The TRUTH About OnePlus Nord!

The TRUTH About OnePlus Nord!

March 20, 2023
Honor 70 Lite announced with Snapdragon 480+ and 50MP camera

Honor 70 Lite announced with Snapdragon 480+ and 50MP camera

March 20, 2023
Advertise with us
Monday, March 20, 2023
Bookmarks
  • Login
  • Register
GetUpdated
  • Game Updates
  • Mobile Gaming
  • Playstation News
  • Xbox News
  • Switch News
  • MMORPG
  • Game News
  • IGN
  • Retro Gaming
  • Tech News
  • Apple Updates
  • Jailbreak News
  • Mobile News
  • Software Development
  • Photography
  • Contact
No Result
View All Result
GetUpdated
No Result
View All Result
GetUpdated
No Result
View All Result
ADVERTISEMENT

The Problem With MTTR: Learning From Incident Reports

March 8, 2023
in Software Development
Reading Time:3 mins read
0 0
0
Share on FacebookShare on WhatsAppShare on Twitter


Tracking Mean Time To Restore (MTTR) is standard industry practice for incident response and analysis, but should it be?

Courtney Nash, an Internet Incident Librarian, argues that MTTR is not a reliable metric — and we think she’s got a point.

We caught up with Courtney at the DevOps Enterprise Summit in Las Vegas, where she was making her case against MTTR in favor of alternative metrics (SLOs and cost of coordination data), practices (Near Miss analysis), and mindsets (humans are the solution, not the problem) to help organization better learn from their incidents. 

Episode Highlights

  • (1:54) The end of MTTR?
  • (4:50) Library of incidents
  • (13:20) What is an incident?
  • (19:41) Cost of coordination
  • (22:13) Near misses
  • (24:21) Mental models
  • (28:16) Role of language in shaping public discourse
  • (29:33) Learnings from The Void

Episode Excerpt

Dan: Hey, everyone; welcome to Dev Interrupted. My name is Dan lines, and I’m here with Courtney Nash, who has one of the coolest possibly made-up titles, but possibly real: Internet Incident Librarian.

Courtney: Yep, that’s right, yeah, you got it.

Dan: Welcome to the show.

Courtney: Thank you for having me on. 

Dan: I love that title 

Courtney: Still possibly made up, possibly, possibly…

Dan: Still possibly made up. 

Courtney: We’ll just leave that one out there for the listeners to decide.

Dan: Let everyone decide what that could possibly mean. We have a, I think, maybe a spicy show, a spicy topic. 

Courtney: It’s a hot topic show.

Dan: Hot topic, especially since we’re at DevOps Enterprise Summit, where we hear a lot about the DORA metrics, one of them being MTTR. 

Courtney: Yes. 

Dan: And you might have a hot take on that. The end of MTTR? Or how would you describe it?

Courtney: Yeah, I feel a little like the fox in the henhouse here, but Gene accepted the talk. So you know, there’s that.

Dan: So it’s on him.

Courtney: [laughing] It’s all Gene’s fault! So I have been interested in complex systems for a long time; I used to study the brain. And I got sucked down an internet rabbit hole quite a lot quite a while ago. And I’ve had beliefs for a long time that I haven’t had data to back up necessarily. And we see these sort of perverted behaviors, not that kind of perverted, but where we take metrics in the industry, and then with Goddard’s Law, pick whatever you pick up, people incentivize them, and then weird things happen. But I think we spend too little time looking at the humans in the system and a lot of time focusing on the technical aspects and the data that come out of the technical side of systems. So, I started a project about a year ago called The Void. It’s the Verica Open Incident Database, actually a real, not made-up name. And it’s the largest collection of public incident reports. So, if you all have an outage, and you hopefully go and figure out and talk about what happened, and then you write that up, but that’s out in the world, so I’m not writing these, I’m curating them and collecting. I’m a librarian. So, I have about 10,000 of them now. And a bunch of metadata associated with all these incident reports.

Engineering Insights before anyone else…

The Weekly Interruption is a newsletter designed for engineering leaders by engineering leaders.

We get it. You’re busy. So are we. That’s why our newsletter is light, informative, and oftentimes irreverent. No BS or fluff. Each week we deliver actionable advice to help make you – whether you’re a CTO, VP of Engineering, team lead, or IC  — a better leader.

It’s also the best way to stay up-to-date on all things Dev Interrupted — from our podcast to trending articles, Interact, and our community Discord. 

Get interrupted.



Source link

ShareSendTweet
Previous Post

5 affordable 85mm primes that will amaze you

Next Post

Wo Long: Fallen Dynasty And Final Fantasy XVI Preview | GI Show

Related Posts

AWS CodeCommit and GitKraken Basics

March 20, 2023
0
0
AWS CodeCommit and GitKraken Basics
Software Development

Git is a source code management system that keeps track of the changes made to their codebase and collaborates with other...

Read more

Reliability Is Slowing You Down

March 19, 2023
0
0
Reliability Is Slowing You Down
Software Development

Three Hard Facts First, the complexity of your software systems is through the roof, and you have more external dependencies...

Read more
Next Post
Wo Long: Fallen Dynasty And Final Fantasy XVI Preview | GI Show

Wo Long: Fallen Dynasty And Final Fantasy XVI Preview | GI Show

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

© 2021 GetUpdated – MW.

  • About
  • Advertise
  • Privacy & Policy
  • Terms & Conditions
  • Contact

No Result
View All Result
  • Game Updates
  • Mobile Gaming
  • Playstation News
  • Xbox News
  • Switch News
  • MMORPG
  • Game News
  • IGN
  • Retro Gaming
  • Tech News
  • Apple Updates
  • Jailbreak News
  • Mobile News
  • Software Development
  • Photography
  • Contact

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?