SLAs

New
adopt
First Added:May 17, 2026 Updated: June 12, 2026

SLAs

Service level agreements (SLAs) are measurable commitments between a provider and a customer: availability, response time, support windows, and remedies when targets are missed. We adopt defining SLAs (and the SLO/SLI machinery behind them) for Software as a Service and managed platforms so sales, engineering, and ops share the same numbers.

Blurb

A service-level agreement (SLA) defines the level of service expected by a customer from a supplier. It lays out the metrics by which that service is measured and the remedies or penalties, if any, should the agreed-on service levels not be achieved.

Summary

SLA vs SLO vs SLI:

TermMeaning
SLIWhat you measure (e.g. successful requests / total)
SLOInternal target (e.g. 99.9% over 30 days)
SLACustomer-facing promise, often with credits or termination rights

Why adopt:

Common SLA metrics:

MetricExample commitment
Availability99.9% monthly API uptime
Latencyp95 < 300ms for core APIs
SupportP1 response within 1 hour (business hours)
RTO/RPORecovery objectives for disaster scenarios

What to avoid: marketing “five nines” without instrumentation; SLAs you cannot measure from production telemetry; unlimited liability without error-budget policy.

Details

TopicNotes
MeasurementSLIs from real probes and RED metrics; no manual spreadsheet honesty
ExclusionsDocument maintenance windows, customer-caused outages, force majeure
CreditsTie remedies to repeatable formulas; legal review
Internal SLOSet SLO stricter than SLA so you miss the customer line rarely
IncidentsPostmortems when SLO burn rate spikes; feed back to architecture

References