Monitor deployed Workloads¶
Preview
The Workload API is a preview feature, on by default.
Feature flag: Enable Access to Workload API Experimental Features
The Monitoring tab on a deployed Workload surfaces the monitoring capabilities described in Monitoring concepts. Each left-rail tab opens a focused view:
| Tab | What it shows |
|---|---|
| Track service health | Request volume, latency, errors per minute, and error rate, with an interactive timeline and chart. |
| Monitor resource utilization | Replica count and CPU and memory consumption by container over time. |
| Monitor capacity | Quota usage by entity—requests, rate-limited requests, token count, and concurrent requests against the Workload's configured limits. |
| Explore monitoring data | Free-form exploration over the Workload's monitoring data. |
| View OpenTelemetry metrics | Application metrics emitted via OpenTelemetry by your container. |
The Console also surfaces a lifecycle audit trail and log viewers; those live on the Activity log tab and are documented in View deployed Workload activity.
Draft and locked Workloads use the same Console capabilities; retention differs (24 hours vs. 30 days) as noted in Monitoring concepts.