Question
Why does recovery procedures fail?
Quick Answer
Writing recovery procedures that assume perfect conditions during the recovery itself. Your backup plan requires internet access, but the failure might be a network outage. Your rollback procedure requires a specific person's approval, but they might be on vacation. Recovery procedures must.
The most common reason recovery procedures fails: Writing recovery procedures that assume perfect conditions during the recovery itself. Your backup plan requires internet access, but the failure might be a network outage. Your rollback procedure requires a specific person's approval, but they might be on vacation. Recovery procedures must account for the degraded conditions that typically accompany failures — because failures rarely arrive alone, and the context in which you execute recovery is almost never the calm, fully-resourced environment in which you wrote the plan.
The fix: Pick one important recurring process in your life — a work deliverable, a creative routine, a financial procedure, anything where failure would cost you real time or real money. Write down the three most likely ways it could fail. For each failure mode, write a recovery procedure: the specific steps you would take to restore the process to working state, the resources you would need, and the maximum time the recovery should take. Keep the whole document to one page. You now have a recovery playbook for that process. Time limit: 30 minutes.
The underlying principle is straightforward: For every important process have a documented way to recover from common failures.
Learn more in these lessons