- Service is defined in the Service Inventory
- Basic alerting
- Troubleshooting documentation
- Preconfigured access for all oncall engineers
- High availability
- Logging etc.)
- Service Dashboard
- Deployment and rollback documentation
- Development process documentation
- Design documentation
- Data backups and restore procedure (if relevant)
- Software vulnerability identification and mitigation
- Secrets Management
- Customer Data or PII
- Policy Compliance
- Automated testing integrated with CI and pull requests
- Automated deployment
- Alerting to detect degraded service
- Infosec security review
- Service Accounts
- Monitoring for security fixes
- Health checks and self healing
- Regular capacity planning
- Regular dependency updates scheduled
- Disaster recovery documentation
- Zero downtime deployment and graceful restarts
- Structured logging
- Error handling and backoff
- Load testing
- Full scale non-production environment for testing
- Automated capacity planning
- Autoscaling based on resource utilization
- Service level metrics
- Status Page