API Reference
Get Model Metrics
GET
/
v1
/
models
/
{model_id}
/
metrics
This endpoint returns a JSON object that describes the metrics associated with a model given its ID.
Request
model_id
string
requiredSieve model ID
limit_min
number
default: "60"The duration (in minutes) of historical metrics to retrieve, ending at the current time
Response
Responses are organized by key as seen below. The value
key provides a list of timestamps and values for each metric.
model_requests
dict
The number of requests sent to a model at any given time.
queue_length
dict
The length of the queue for a model at any given time.
replica_count
dict
The number of replicas of the model that are up at any given time.
Was this page helpful?