ML model monitoring

Monitor inference volume, latency, and drift for each deployed ML model.

Before you start

Per-model usage (recommended)

You will see:

Summary cards — total inferences, requests, average response time, endpoints used
Charts — daily inference volume and response time, split by Heimdall UI vs API
Request log — sortable table with endpoint, inference count, response time, channel, user agent, and drift % when available
Filters — 7 / 30 / 90 day windows and endpoint filter

Use the request log to debug integration issues (wrong features, auth errors showing as zero traffic, latency spikes on specific routes).

Open Usage (/usage) → Data Intelligence for workspace-wide trends:

See Production monitoring for the full Usage page walkthrough.

Field	Meaning
Endpoint	REST path called (typically predict)
Inference count	Number of predictions in one request
Response time	Milliseconds to complete
Channel	`Heimdall UI` or `API`
Drift %	Performance drift indicator when enabled
User agent	Client string when present