Observability Service Health¶
This page outlines the operations, endpoints, parameters, and example requests and responses for the Observability Service Health.
GET /api/v2/deployments/{deploymentId}/batchServiceStats/¶
Retrieve all deployment service health metrics over a set of batches.
Code samples¶
curl -X GET https://app.datarobot.com/api/v2/deployments/{deploymentId}/batchServiceStats/ \
-H "Accept: application/json" \
-H "Authorization: Bearer {access-token}"
Parameters
| Name | In | Type | Required | Description |
|---|---|---|---|---|
| executionTimeQuantile | query | number | false | Quantile for executionTime metric. |
| responseTimeQuantile | query | number | false | Quantile for responseTime metric. |
| slowRequestsThreshold | query | integer | false | Threshold for slowRequests metric. |
| segmentAttribute | query | string | false | The name of a segment attribute used for segment analysis. |
| segmentValue | query | string,null | false | The value of the segmentAttribute to segment on. |
| batchId | query | any | false | The id of the batch for which metrics are being retrieved. |
| modelId | query | string | false | The id of the model for which metrics are being retrieved. |
| deploymentId | path | string | true | Unique identifier of the deployment. |
Enumerated Values¶
| Parameter | Value |
|---|---|
| segmentAttribute | [DataRobot-Consumer, DataRobot-Remote-IP, DataRobot-Host-IP] |
Example responses¶
200 Response
{
"batches": [
{
"earliestPredictionTimestamp": "2019-08-24T14:15:22Z",
"id": "string",
"latestPredictionTimestamp": "2019-08-24T14:15:22Z",
"name": "string"
}
],
"metrics": {
"cacheHitRatio": 0,
"executionTime": 0,
"numConsumers": 0,
"responseTime": 0,
"serverErrorRate": 0,
"slowRequests": 0,
"totalPredictions": 0,
"totalRequests": 0,
"userErrorRate": 0
},
"segmentAttribute": "string",
"segmentValue": ""
}
Responses¶
| Status | Meaning | Description | Schema |
|---|---|---|---|
| 200 | OK | Service health metric data retrieved. | ServiceStatsForBatchRetrieveResponse |
To perform this operation, you must be authenticated by means of one of the following methods:
BearerAuth
GET /api/v2/deployments/{deploymentId}/serviceStats/¶
Retrieve all deployment service health metrics over a single period of time.
Code samples¶
curl -X GET https://app.datarobot.com/api/v2/deployments/{deploymentId}/serviceStats/ \
-H "Accept: application/json" \
-H "Authorization: Bearer {access-token}"
Parameters
| Name | In | Type | Required | Description |
|---|---|---|---|---|
| start | query | string,null(date-time) | false | Start of the period to retrieve monitoring data, defaults to 7 days ago from the end of the period. Note: this field only accepts top of the hour RFC3339 datetime strings, for example: 2019-08-01T00:00:00Z. |
| end | query | string,null(date-time) | false | End of the period to retrieve monitoring data, defaults to the next top of the hour. Note: this field only accepts top of the hour RFC3339 datetime strings, for example: 2019-08-01T00:00:00Z. |
| executionTimeQuantile | query | number | false | Quantile for executionTime metric. |
| responseTimeQuantile | query | number | false | Quantile for responseTime metric. |
| slowRequestsThreshold | query | integer | false | Threshold for slowRequests metric. |
| segmentAttribute | query | string | false | The name of a segment attribute used for segment analysis. |
| segmentValue | query | string,null | false | The value of the segmentAttribute to segment on. |
| modelId | query | string | false | The id of the model for which metrics are being retrieved. |
| deploymentId | path | string | true | Unique identifier of the deployment. |
Enumerated Values¶
| Parameter | Value |
|---|---|
| segmentAttribute | [DataRobot-Consumer, DataRobot-Remote-IP, DataRobot-Host-IP] |
Example responses¶
200 Response
{
"metrics": {
"cacheHitRatio": 0,
"executionTime": 0,
"medianLoad": 0,
"numConsumers": 0,
"peakLoad": 0,
"responseTime": 0,
"serverErrorRate": 0,
"slowRequests": 0,
"totalPredictions": 0,
"totalRequests": 0,
"userErrorRate": 0
},
"modelId": "string",
"period": {
"end": "2019-08-24T14:15:22Z",
"start": "2019-08-24T14:15:22Z"
},
"segmentAttribute": "string",
"segmentValue": ""
}
Responses¶
| Status | Meaning | Description | Schema |
|---|---|---|---|
| 200 | OK | Service health metric data retrieved. | ServiceStatsForTimeRangeResponse |
To perform this operation, you must be authenticated by means of one of the following methods:
BearerAuth
GET /api/v2/deployments/{deploymentId}/serviceStatsOverBatch/¶
Retrieve values for one single deployment service health metric over batch.
Code samples¶
curl -X GET https://app.datarobot.com/api/v2/deployments/{deploymentId}/serviceStatsOverBatch/ \
-H "Accept: application/json" \
-H "Authorization: Bearer {access-token}"
Parameters
| Name | In | Type | Required | Description |
|---|---|---|---|---|
| segmentAttribute | query | string | false | The name of a segment attribute used for segment analysis. |
| segmentValue | query | string,null | false | The value of the segmentAttribute to segment on. |
| batchId | query | any | false | The id of the batch for which metrics are being retrieved. |
| modelId | query | string | false | The id of the model for which metrics are being retrieved. |
| metric | query | string | false | A service health metric. |
| quantile | query | number | false | Quantile for executionTime and responseTime metrics |
| threshold | query | integer | false | Threshold for slowQueries metric. |
| deploymentId | path | string | true | Unique identifier of the deployment. |
Enumerated Values¶
| Parameter | Value |
|---|---|
| segmentAttribute | [DataRobot-Consumer, DataRobot-Remote-IP, DataRobot-Host-IP] |
| metric | [totalPredictions, totalRequests, slowRequests, executionTime, responseTime, userErrorRate, serverErrorRate, numConsumers, cacheHitRatio] |
Example responses¶
200 Response
{
"buckets": [
{
"batch": {
"earliestPredictionTimestamp": "2019-08-24T14:15:22Z",
"id": "string",
"latestPredictionTimestamp": "2019-08-24T14:15:22Z",
"name": "string"
},
"value": 0
}
],
"metric": "string",
"segmentAttribute": "string",
"segmentValue": ""
}
Responses¶
| Status | Meaning | Description | Schema |
|---|---|---|---|
| 200 | OK | Service health metric data retrieved. | ServiceStatsOverBatchResponse |
To perform this operation, you must be authenticated by means of one of the following methods:
BearerAuth
GET /api/v2/deployments/{deploymentId}/serviceStatsOverTime/¶
Retrieve values for one single deployment service health metric over time.
Code samples¶
curl -X GET https://app.datarobot.com/api/v2/deployments/{deploymentId}/serviceStatsOverTime/ \
-H "Accept: application/json" \
-H "Authorization: Bearer {access-token}"
Parameters
| Name | In | Type | Required | Description |
|---|---|---|---|---|
| start | query | string,null(date-time) | false | Start of the period to retrieve monitoring data, defaults to 7 days ago from the end of the period. Note: this field only accepts top of the hour RFC3339 datetime strings, for example: 2019-08-01T00:00:00Z. |
| end | query | string,null(date-time) | false | End of the period to retrieve monitoring data, defaults to the next top of the hour. Note: this field only accepts top of the hour RFC3339 datetime strings, for example: 2019-08-01T00:00:00Z. |
| bucketSize | query | string(duration) | false | The time duration of a bucket. Needs to be multiple of one hour. Can not be longer than the total length of the period. If not set, a default value will be calculated based on the start and end time. |
| metric | query | string | false | Name of the metric. See below for a list of supported metrics. |
| quantile | query | number | false | A quantile for resulting data, used if metric is executionTime or responseTime, defaults to 0.5. |
| threshold | query | integer | false | A threshold for filtering results, used if metric is slowQueries, defaults to 1000. |
| segmentAttribute | query | string | false | The name of a segment attribute used for segment analysis. |
| segmentValue | query | string,null | false | The value of the segmentAttribute to segment on. |
| modelId | query | string | false | The id of the model for which metrics are being retrieved. |
| deploymentId | path | string | true | Unique identifier of the deployment. |
Enumerated Values¶
| Parameter | Value |
|---|---|
| metric | [totalPredictions, totalRequests, slowRequests, executionTime, responseTime, userErrorRate, serverErrorRate, numConsumers, cacheHitRatio, medianLoad, peakLoad] |
| segmentAttribute | [DataRobot-Consumer, DataRobot-Remote-IP, DataRobot-Host-IP] |
Example responses¶
200 Response
{
"buckets": [
{
"period": {
"end": "2019-08-24T14:15:22Z",
"start": "2019-08-24T14:15:22Z"
},
"value": 0
}
],
"metric": "string",
"modelId": "string",
"segmentAttribute": "string",
"segmentValue": "",
"summary": {
"period": {
"end": "2019-08-24T14:15:22Z",
"start": "2019-08-24T14:15:22Z"
},
"value": 0
}
}
Responses¶
| Status | Meaning | Description | Schema |
|---|---|---|---|
| 200 | OK | Service health metric data retrieved. | ServiceStatsOverTimeResponse |
To perform this operation, you must be authenticated by means of one of the following methods:
BearerAuth
Schemas¶
Batch
{
"earliestPredictionTimestamp": "2019-08-24T14:15:22Z",
"id": "string",
"latestPredictionTimestamp": "2019-08-24T14:15:22Z",
"name": "string"
}
Batch info.
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| earliestPredictionTimestamp | string(date-time) | true | Earliest prediction timestamp of a batch. | |
| id | string | true | Batch ID. | |
| latestPredictionTimestamp | string,null(date-time) | true | Latest prediction timestamp of a batch. | |
| name | string | true | Batch name. |
ServiceStatsForBatchRetrieveResponse
{
"batches": [
{
"earliestPredictionTimestamp": "2019-08-24T14:15:22Z",
"id": "string",
"latestPredictionTimestamp": "2019-08-24T14:15:22Z",
"name": "string"
}
],
"metrics": {
"cacheHitRatio": 0,
"executionTime": 0,
"numConsumers": 0,
"responseTime": 0,
"serverErrorRate": 0,
"slowRequests": 0,
"totalPredictions": 0,
"totalRequests": 0,
"userErrorRate": 0
},
"segmentAttribute": "string",
"segmentValue": ""
}
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| batches | [Batch] | true | Info of the batches the metric is aggregated on. | |
| metrics | ServiceStatsMetricsForBatch | true | Service health metrics of the deployment | |
| segmentAttribute | string,null | false | The name of the segment on which segment analysis is being performed. | |
| segmentValue | string,null | false | The value of the segmentAttribute to segment on. |
ServiceStatsForTimeRangeResponse
{
"metrics": {
"cacheHitRatio": 0,
"executionTime": 0,
"medianLoad": 0,
"numConsumers": 0,
"peakLoad": 0,
"responseTime": 0,
"serverErrorRate": 0,
"slowRequests": 0,
"totalPredictions": 0,
"totalRequests": 0,
"userErrorRate": 0
},
"modelId": "string",
"period": {
"end": "2019-08-24T14:15:22Z",
"start": "2019-08-24T14:15:22Z"
},
"segmentAttribute": "string",
"segmentValue": ""
}
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| metrics | ServiceStatsMetricsForTimeRange | true | Service health metrics of the deployment | |
| modelId | string | false | The id of the model for which metrics are being retrieved. | |
| period | TimeRange | true | An object with the keys "start" and "end" defining the period. | |
| segmentAttribute | string,null | false | The name of the segment on which segment analysis is being performed. | |
| segmentValue | string,null | false | The value of the segmentAttribute to segment on. |
ServiceStatsMetricsForBatch
{
"cacheHitRatio": 0,
"executionTime": 0,
"numConsumers": 0,
"responseTime": 0,
"serverErrorRate": 0,
"slowRequests": 0,
"totalPredictions": 0,
"totalRequests": 0,
"userErrorRate": 0
}
Service health metrics of the deployment
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| cacheHitRatio | number,null | true | Number of cache hits. | |
| executionTime | number,null | true | Request execution time at executionTimeQuantile (in milliseconds). | |
| numConsumers | integer,null | true | Number of unique users performing requests. | |
| responseTime | number,null | true | Request response time at responseTimeQuantile (in milliseconds). | |
| serverErrorRate | number,null | true | Ratio of server errors to the total number of requests. | |
| slowRequests | integer,null | true | Number of requests with response time greater than slowRequestsThreshold | |
| totalPredictions | integer,null | true | Total number of prediction rows. | |
| totalRequests | integer,null | true | Total number of prediction requests performed. | |
| userErrorRate | number,null | true | Ratio of user errors to the total number of requests. |
ServiceStatsMetricsForTimeRange
{
"cacheHitRatio": 0,
"executionTime": 0,
"medianLoad": 0,
"numConsumers": 0,
"peakLoad": 0,
"responseTime": 0,
"serverErrorRate": 0,
"slowRequests": 0,
"totalPredictions": 0,
"totalRequests": 0,
"userErrorRate": 0
}
Service health metrics of the deployment
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| cacheHitRatio | number,null | true | Number of cache hits. | |
| executionTime | number,null | true | Request execution time at executionTimeQuantile (in milliseconds). | |
| medianLoad | number,null | true | Median of the request rate (in requests per minute). | |
| numConsumers | integer,null | true | Number of unique users performing requests. | |
| peakLoad | number,null | true | Maximum of the request rate (in requests per minute). | |
| responseTime | number,null | true | Request response time at responseTimeQuantile (in milliseconds). | |
| serverErrorRate | number,null | true | Ratio of server errors to the total number of requests. | |
| slowRequests | integer,null | true | Number of requests with response time greater than slowRequestsThreshold | |
| totalPredictions | integer,null | true | Total number of prediction rows. | |
| totalRequests | integer,null | true | Total number of prediction requests performed. | |
| userErrorRate | number,null | true | Ratio of user errors to the total number of requests. |
ServiceStatsOverBatchBucket
{
"batch": {
"earliestPredictionTimestamp": "2019-08-24T14:15:22Z",
"id": "string",
"latestPredictionTimestamp": "2019-08-24T14:15:22Z",
"name": "string"
},
"value": 0
}
Service health stats of the deployment over a batch.
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| batch | Batch | true | Batch info. | |
| value | number,null | true | Value of the metric in the bucket. |
ServiceStatsOverBatchResponse
{
"buckets": [
{
"batch": {
"earliestPredictionTimestamp": "2019-08-24T14:15:22Z",
"id": "string",
"latestPredictionTimestamp": "2019-08-24T14:15:22Z",
"name": "string"
},
"value": 0
}
],
"metric": "string",
"segmentAttribute": "string",
"segmentValue": ""
}
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| buckets | [ServiceStatsOverBatchBucket] | true | An array of buckets, representing service health stats of the deployment over selected batches. | |
| metric | string | true | Name of the metric requested. | |
| segmentAttribute | string,null | false | The name of the segment on which segment analysis is being performed. | |
| segmentValue | string,null | false | The value of the segmentAttribute to segment on. |
ServiceStatsOverTimeBucket
{
"period": {
"end": "2019-08-24T14:15:22Z",
"start": "2019-08-24T14:15:22Z"
},
"value": 0
}
A bucket object covering whole start/end time range
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| period | TimeRange | true | An object with the keys "start" and "end" defining the period. | |
| value | number,null | true | Value of the metric in the bucket. Null if no value |
ServiceStatsOverTimeResponse
{
"buckets": [
{
"period": {
"end": "2019-08-24T14:15:22Z",
"start": "2019-08-24T14:15:22Z"
},
"value": 0
}
],
"metric": "string",
"modelId": "string",
"segmentAttribute": "string",
"segmentValue": "",
"summary": {
"period": {
"end": "2019-08-24T14:15:22Z",
"start": "2019-08-24T14:15:22Z"
},
"value": 0
}
}
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| buckets | [ServiceStatsOverTimeBucket] | true | An array of bucket, representing service health stats of the deployment over time. | |
| metric | string | true | Name of the metric requested. | |
| modelId | string | false | The id of the model for which metrics are being retrieved. | |
| segmentAttribute | string,null | false | The name of the segment on which segment analysis is being performed. | |
| segmentValue | string,null | false | The value of the segmentAttribute to segment on. |
|
| summary | ServiceStatsOverTimeBucket | true | A bucket object covering whole start/end time range |
TimeRange
{
"end": "2019-08-24T14:15:22Z",
"start": "2019-08-24T14:15:22Z"
}
An object with the keys "start" and "end" defining the period.
Properties¶
| Name | Type | Required | Restrictions | Description |
|---|---|---|---|---|
| end | string,null(date-time) | false | End of the period to retrieve monitoring data, defaults to the next top of the hour. Note: this field only accepts top of the hour RFC3339 datetime strings, for example: 2019-08-01T00:00:00Z. |
|
| start | string,null(date-time) | false | Start of the period to retrieve monitoring data, defaults to 7 days ago from the end of the period. Note: this field only accepts top of the hour RFC3339 datetime strings, for example: 2019-08-01T00:00:00Z. |