KG
  • Home
  • Blog
  • Tools
  • Glossary
  • About
  1. Home
  2. ›
  3. Blog
  4. ›
  5. #Site Reliability

#Site Reliability

1 post tagged with #Site Reliability

A blue and black abstract background with lines
Cloud and DevOps

A Single Automated Process Broke Dozens of Major Services. Inside the AWS us-east-1 Outage.

On December 7, 2021, an automated scaling activity inside AWS overwhelmed their own internal network, knocked out their monitoring, and cascaded across dozens of services. Here's what actually happened.

March 12, 2026 7 min read
Read more
© 2026 Kunal Ganglani. Built with coffee and curiosity in Toronto.