How would you go about writing a postmortem?

  • What was the root cause?
  • How did we identify the issue?
  • What was the blast radius (affected infra/services)
  • What did we learn? (including remediation steps — automation, monitoring and alerting, training, etc.)

References: