Status Page vs Incident Page from Sandglass: practical guidance for separating the always-on health page from the timeline for one specific outage.
This guide focuses on separating the always-on health page from the timeline for one specific outage. The goal is to make the operating decision clear before a stressful incident forces the team to improvise.
Use the status page for current component state and the incident page or internal timeline for detailed investigation notes. Sandglass supports the continuous side of this work with checks, incidents, alert routing, and public status visibility.
Combining every note into the public status page can make the page harder to scan when users only need current availability.
Decide which failures in this topic actually reach customers before adding any monitoring.
Match each risk to a single HTTP, content, TCP, SSL certificate, or heartbeat check instead of stacking duplicates.
Give each alert one owner and one destination — email, a Slack webhook, or a generic webhook.
Revisit intervals, thresholds, and ownership once a real incident shows what was missing.