Sometimes, there is an edge case we didn’t anticipate that causes issues in production.
And sometimes, there is a common use case we didn’t test sufficiently cover that causes issues in production.
And sometimes, production goes down. For reasons yet unknown.
For some of us, there is a production support team that handles these things. They might call us, if they need us, but they handle this stuff. For the rest of us, we need to handle the incident ourselves.
The following is a very simple framework for responding to incidents.