Sentinel provides observability for Pantheon services and infrastructure.
It aggregates health, performance, and error data into a single dashboard.
- Detect failures early
- Provide visibility into system health
- Support operational decision-making
- Metric ingestion from services and hosts
- Time-series aggregation
- Threshold-based alerting
- Historical trend analysis
- Service uptime tracking
- Latency and error rate monitoring
- Resource utilization (CPU, memory)
- Alert thresholds
- Incident history views
- Full metrics backend replacement
- Log storage (log aggregation handled elsewhere)
- Hosted dashboards
- Tiered alerting policies
Implemented a monitoring dashboard aggregating service health, latency, and error metrics with alerting and historical analysis.