Runbook: [Title]
Owner: [team or individual] Backup owner: [secondary contact] Last validated: [YYYY-MM-DD] Validation method: [manual drill / CI test / tabletop exercise] Severity trigger: SEV[1-4] Customer impact: [description of what customers experience] Required access: [SSH, VPS, MongoDB, Redis, etc.] Related services: [list container names from docker-compose.production.yml] Related ADRs: [links to relevant ADRs]
When To Use This
[Describe the conditions that trigger this runbook]
Quick Triage
[2-3 commands to assess the situation before committing to a fix]
Procedure
[Step-by-step fix instructions with commands]
Rollback
[How to undo the fix if it makes things worse — or explicitly state “No rollback needed”]
Verification
[Commands to confirm the issue is resolved]
Escalation
[When and how to escalate if the procedure doesn’t work]
Customer Communication
[Template for notifying affected customers, if applicable]
Drill Log
| Date | Validator | Method | Result | Notes |
|---|---|---|---|---|