At 4:02 a.m., a production node fails. Every alert fires. Every dashboard goes red. The system does exactly what it was designed to do. No one responds. An automated call goes unanswered. The backup ...