Service-Level Agreement
The uptime we commit to. And what you get if we miss.
Last updated 2026-04-24 · Applies to paid plans only
This SLA covers the availability of the Sutrace ingest API, dashboard, and alerting pipeline. Free-plan customers have no SLA. The SLA forms part of the main service agreement with Sutrace.
Commitments by plan
| Plan | Monthly uptime | Status page latency | Credit floor |
|---|---|---|---|
| Free | — (best effort) | — | — |
| Team | 99.9% | < 5 min | 25% / month |
| Business | 99.95% | < 5 min | 50% / month |
| Scale | 99.99% | < 5 min | 100% / month (negotiated) |
How we measure
We measure availability as the percentage of 1-minute windows in a calendar month in which at least one of the three synthetic probes we run against /health returns a 2xx response from an unprivileged caller. We publish the measurement source to customer-success on request.
What counts as an exclusion
- Scheduled maintenance announced ≥7 days in advance.
- Emergency security patches, announced at the time.
- Force majeure events (per main agreement).
- Customer-caused incidents (e.g. exceeding a plan limit, using an unsupported API version, misconfigured SSO).
- Outages of upstream providers where we have no reasonable mitigation and have already implemented documented fallback measures.
Credit calculation + claim
If monthly uptime falls below the applicable threshold, you may claim a service credit equal to the percentage in the table, applied to the next invoice. Submit a claim within 30 days of month-end to sla@sutrace.io with your workspace ID and the affected window. Credits are your sole and exclusive remedy under this SLA.
Ingest-path latency goals (informational)
- Dashboard first-byte (p95): < 800 ms at edge.
- Ingest to visible on dashboard (p95): < 10 s for metrics, < 30 s for traces/logs.
- Alert fire to first notification (p95): < 30 s.
These are operational goals, not contractual commitments.
Status page
Public at status.sutrace.io(hosted off our primary infrastructure so it stays up when we're down — see what happened to Cloudflare 18 Nov 2025). Subscribe by email or webhook from the status page itself.
Incident response
For P1 (service-affecting for >10% of customers) we publish an initial update within 15 minutes, hourly updates until resolution, and a root-cause analysis within 5 business days. For regulated-industry customers needing NIS2 or DORA-shaped incident reporting, see the DPA.