Run Book: Nuxt Production Average Response Time above threshold¶
Severity Guide¶
To identify the source of the slowdown first check if there was a recent deployment that may have introduced the problem, in that case rollback to the previous version. Otherwise, check the following, in order:
Request count and general network activity. If abnormally high, refer to the traffic analysis run book to identify whether there is malicious traffic. If not, move on.
Check if dependencies like the API or Plausible analytics are constrained. If stable, move on.
To gather more information check the log group, use the “Logs
Insights” view to query for requests that may be taking longer than expected
with a CloudWatch query similar to the following which can give more hints about
which routes are causing increased response times. Occasionally the /api/event
endpoint will take longer to respond (due to upstream issues with Plausible),
and these cases will increase our average response time while not actually
affecting frontend performance for users. The following query shows the top 10
routes where the request took longer than 0.5 seconds grouped by number of
requests made to that route.
fields request, request_time, @timestamp, @message
| filter request_time > 0.5
| stats count(*) as request_count by request
| sort request_count desc
| limit 10
Historical false positives¶
2024-04-10, 22:00 UTC: Requests to the
/api/eventendpoint were taking longer than expected and impacted average response time, but not normal frontend traffic response time.