How to Calculate Uptime SLA from Sandglass: practical guidance for converting uptime percentages into concrete monthly downtime budgets and customer promises.
This guide focuses on converting uptime percentages into concrete monthly downtime budgets and customer promises. The goal is to make the operating decision clear before a stressful incident forces the team to improvise.
Pick the measurement window, exclude agreed maintenance periods only if the contract says so, and calculate downtime from customer-visible availability. Sandglass supports the continuous side of this work with checks, incidents, alert routing, and public status visibility.
An SLA number is a commercial promise, not just a dashboard metric. Do not promise availability your architecture and response process cannot support.
Decide which failures in this topic actually reach customers before adding any monitoring.
Match each risk to a single HTTP, content, TCP, SSL certificate, or heartbeat check instead of stacking duplicates.
Give each alert one owner and one destination — email, a Slack webhook, or a generic webhook.
Revisit intervals, thresholds, and ownership once a real incident shows what was missing.