Traditional SRE metrics—availability, latency, error rate—measure whether systems are up, not whether they’re useful. A 99.4% uptime dashboard once masked an AI agent returning HTTP 200s while generating unusable reports, a silent regression from a cheaper model swap. This gap between infrastructure health and task completion drives the three-layer SLO …