Skip to main content

Quickstart

SRE Framework is an open-source SRE Ops dashboard that helps teams apply Site Reliability Engineering practices. Config-driven, Prometheus-native, ships with a zero-infra demo stack.

From DockerHub (no build needed):

docker run -p 8000:8000 ops4life/sre-framework:latest

From source:

git clone https://github.com/ops4life/sre-framework
cd sre-framework
docker compose -f demo/docker-compose.yml up --build

Open http://localhost:8080 — live SRE dashboard with synthetic metrics from three fake services (frontend, api, worker). No Traefik, no Prometheus to install.


Use your own metrics

1. Pick a provider preset

PresetWorks with
traefikTraefik reverse proxy + dockerstats + node_exporter
httpAny app exposing http_requests_total + http_request_duration_seconds_bucket

2. Write sre.yaml

provider: http # or "traefik"
default_service: api

services:
- name: api
slo_target: 99.5
labels:
service: api

- name: frontend
slo_target: 99.9
labels:
service: frontend

For the traefik preset, add a container label too:

labels:
service: devex-svc@file
container: devex

3. Run

docker compose up --build

Set PROMETHEUS_URL and SRE_CONFIG_FILE in your .env (see .env.example).


Panel tooltips & Tour

Every panel has a lightbulb icon — hover (or tap on mobile) to see what the metric means, how it's computed, and links to the SRE Concepts primer.

Click Tour in the top bar for a guided walkthrough of the full dashboard.