Michał Ostruszka: You put out the fire on production, what's next? | JDD 2023

youtube.com 1 miesiąc temu


Shit happens, releases don't always go as planned, production systems break sometimes. Whether it's a bug in your code, library you usage or infrastructure/hardware failure, the first thing you request is to bring the strategy back to life with minimal damage. But what happens next? Should we carry on with our regular work? Have you heard about a practice called "Post Mortem Analysis"? Seems scary but actually it's super useful erstwhile done decently and that's what I'd like to talk about. What it is, how to conduct specified analysis, who should and who shouldn't be involved, what to watch out for on the go and what are the eventual goals. All in all you don't want to end up with the same production outage tomorrow, do you?

🚀 https://jdd.org.pl/