Monitor usage¶
The Usage report (self-managed only) and Usage Explorer (SaaS only) provide administrators visibility into an organization's resource usage. Monitoring these resources over time can help indicate whether the organization has the correct number of resources to meet usage needs.
Usage report¶
Self-managed only
The Usage Report is only available for system admins on self-managed installations.
The Usage report provides system admins visibility into the organization's graphics processing unit (GPU) and central processing unit (CPU) usage across the platform.
The following services are tracked in the Usage Report:
- Modeling
- Inference
- NVIDIA AI Enterprise
- Vector Database Creation
- GenAI Playground
- Custom Models
- Moderations
- Data Management
- Predictions
To access the Usage Report, open Admin settings > Usage Report.
From here, you can view resource consumption for a given date range, as well as export the report as a .csv file.
| Element | Description | |
|---|---|---|
| 1 | Date range selector | Use the dropdown to display usage information for a specific time period. |
| 2 | CPU Usage | View CPU Usage information, including max aggregate usage for all groups and a usage over time chart. |
| 3 | GPU Usage | View GPU Usage information, including max aggregate usage for all groups and a usage over time chart. |
| 4 | Download | Download the report as a .csv file. |
CPU usage¶
The CPU Usage section provides an overview of central processing unit (CPU) usage within the organization. This allows for easy monitoring of core usage, potentially helping to identify areas for optimization or budgetary concerns.
| Element | Description |
|---|---|
| Max aggregate usage for all groups | Total core usage for all users across the platform. |
| CPU utilization chart | A chart that displays total usage amount over the specified time period, as well as your organization's current license limit for core usage. Hover to over a point on the chart to view additional information. |
GPU usage¶
The GPU Usage section reports data on GPU usage, providing general usage information. GenAI features, for example, rely on GPU hardware for a range of workloads related to training, hosting, and running inference on LLMs.
| Element | Description |
|---|---|
| Max aggregate usage for all groups | Total core usage for all users across the platform. |
| GPU utilization chart | A chart that displays total usage amount over the specified time period, as well as your organization's current license limit for core usage. Hover over a point on the chart to to view additional information. |
Usage Explorer¶
SaaS only
The Usage Explorer is only available for org admins on multi-tenant SaaS installations.
The Usage Explorer provides org admins visibility into the organization's graphics processing unit (GPU), central processing unit (CPU), and large language model (LLM) API usage across the platform, providing general usage information broken down by service or user. Monitoring these resources over time can help indicate whether the organization has the correct number of resources to meet usage needs.
The following services are tracked in the Usage Explorer:
- Modeling
- Inference
- NVIDIA AI Enterprise
- Vector Database Creation
- GenAI Playground
- Custom Models
- Moderations
- Data Management
- Predictions
Note
The services displayed in the Usage Explorer may vary depending on the type of usage being viewed.
To access the Usage Explorer, open Admin settings > Usage Explorer.
From here, you can view resource consumption by user or service for a given date range, as well as export the report as a .csv file.
| Element | Description | |
|---|---|---|
| 1 | By services/By organizations/By users | View usage broken down by specific services, organizations (self-managed only), or users. |
| 2 | Date range selector | Use the two fields to display usage information for a specific date range. |
| 3 | Export | Download the report as a .csv file. |
| 4 | Usage options | Select the usage information to view—GPU, LLM, or CPU. |
Toggling the page to display By users allows you to view usage details for individual user accounts within the organization.
Toggling the page to display By organizations allows you to view usage details for individual organizations within the cluster.
Self-managed-only
The By organizations tab is only available for system admins on self-managed installations.
GPU usage¶
The GPU Usage page reports data on GPU usage, providing general usage information broken down by service or user. GenAI features, for example, rely on GPU hardware for a range of workloads related to training, hosting, and running inference on LLMs.
To access the GPU Usage page, click GPU Usage in the Usage Explorer.
This page consists of a table that displays the name of the service using the resources, as well as the following details about each task under that service:
| Field | Summary |
|---|---|
| Service | The service that used the GPU (e.g., Notebooks, Modeling, Vector Database Creation, etc.) |
| Description | The number of GPUs, as well as the number of CPUs and amount of RAM/VRAM, used by the task. |
| Usage quantity | The amount of time each task used the resources within the specified time period. |
LLM API usage¶
The LLM API Usage page reports data on which LLM models are being used by which services, as well as how much each model is being used. The page provides general usage information broken down by service or user, tracking token consumption for each task. This detailed monitoring helps identify which services are consuming the most LLM resources across various services.
To access the LLM API Usage page, click LLM API Usage in the Usage Explorer.
This page consists of a table that displays the name of the service using the resources, as well as the following details about each task under that service:
| Field | Summary |
|---|---|
| Service | The service that used the LLM API (e.g., Custom Models, GenAI Playground, etc.) |
| Description | The name of the LLM model called via the API. |
| Usage quantity | The number of tokens used by the task. |
CPU usage¶
The CPU Usage page provides an overview of central processing unit (CPU) usage within the organization, broken down by service or user. This allows for easy monitoring of exactly which users and services are consuming the most CPU resources, potentially helping to identify areas for optimization or budgetary concerns.
To access the CPU Usage page, click CPU Usage in the Usage Explorer.
This page consists of a table that displays the name of the service using the resources, as well as the following details about each task under that service:
| Field | Summary |
|---|---|
| Service | The service that used the CPU (e.g., Custom Inference Model, Data Management, etc.) |
| Description | The number of processors and amount of RAM used by the task. |
| Usage quantity | The amount of time each task used the CPU. |










