Skip to content

Instantly share code, notes, and snippets.

@drewr
Created December 5, 2025 02:21
Show Gist options
  • Select an option

  • Save drewr/efe1e0aed6991f2f9b43582ae6ec56f3 to your computer and use it in GitHub Desktop.

Select an option

Save drewr/efe1e0aed6991f2f9b43582ae6ec56f3 to your computer and use it in GitHub Desktop.

Production Readiness:

Background

Readiness Levels

Minimum

  • Service is defined in the Service Inventory
  • Basic alerting
  • Troubleshooting documentation
  • Preconfigured access for all oncall engineers
  • High availability
  • Logging etc.)
  • Service Dashboard
  • Deployment and rollback documentation
  • Development process documentation
  • Design documentation
  • Data backups and restore procedure (if relevant)
  • Software vulnerability identification and mitigation
  • Secrets Management
  • Customer Data or PII
  • Policy Compliance

Moderate

  • Automated testing integrated with CI and pull requests
  • Automated deployment
  • Alerting to detect degraded service
  • Infosec security review
  • Service Accounts
  • Monitoring for security fixes
  • Health checks and self healing
  • Regular capacity planning
  • Regular dependency updates scheduled
  • Disaster recovery documentation
  • Zero downtime deployment and graceful restarts
  • Structured logging
  • Error handling and backoff
  • Load testing

High

  • Full scale non-production environment for testing
  • Automated capacity planning
  • Autoscaling based on resource utilization
  • Service level metrics
  • Status Page
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment