Runbooks#
This section collects runbooks for responding to Openverse infrastructure alarms.
Please see the “Run Books” section of the ECS baseline monitoring implementation plan for further details on what runbooks are and their requirements. The implementation plan also includes example runbooks that can be a good resource when writing a new one.
- Run Book: API Production HTTP 2XX responses count under threshold
- Run Book: API Production HTTP 5XX responses count above threshold
- Run Book: API Production Average Response Time above threshold
- Run Book: API Production Average Response Time anomaly
- Run Book: API Production P99 Response Time above threshold
- Run Book: API Production P99 Response Time anomaly
- Run Book: API Production Request Count anomalously high
- Run Book: API Thumbnails Production HTTP 2XX responses count under threshold
- Run Book: API Thumbnails Production HTTP 5XX responses count above threshold
- Run Book: API Thumbnails Production Request Count anomalously high
- Run Book: API Thumbnails Production Average Response Time above threshold
- Run Book: API Thumbnails Production Average Response Time anomalously high
- Run Book: API Thumbnails Production P99 Response Time above threshold
- Run Book: API Thumbnails Production P99 Response Time anomalously high
- Run Book: Nuxt 2XX responses count under threshold
- Run Book: Nuxt 5XX responses count above threshold
- Run Book: Nuxt Production Average Response Time above threshold
- Run Book: Nuxt Production Average Response Time anomalously high
- Run Book: Nuxt Production P99 Response Time above threshold
- Run Book: Nuxt Production P99 Response Time anomalously high
- Run Book: Nuxt Request Count anomalously high
- Run Book: Unhealthy hosts for ECS service