Skip to content

On-premise users: click in-app to access the full platform documentation for your version of DataRobot.

Data

The Data Registry is a centralized hub for managing datasets in NextGen, allowing you to easily find, share, explore, and reuse data. Any dataset that you've added directly to the registry, you've linked to a Use Case, has been shared with you, or someone has added to a Use Case you are a member of, is displayed here. The Data Registry provides easy access to the data needed to answer a business problem while ensuring security, compliance, and consistency.

The Data Registry is comprised of two key functions:

  • Ingest: Data is imported into DataRobot and sanitized for use throughout the platform.
  • Storage: Reusable data assets are stored, accessed, and shared—allowing you to share data without sharing projects, decreasing risks and costs around data duplication.

The Data Registry also supports data security and governance, which reduces friction and speeds up model adoption, through selective addition to the Registry, role-based sharing, and an audit trail.

The following topics describe how to work with data in the Data page:

Topic Description
Add data Import data to DataRobot using a data connection, local file, or URL.
Manage data assets View, share, and delete data from the Data page.
Explore registry data For individual data assets, explore a dataset preview, metadata, and insights, as well as version history and related activity.

Feature considerations

The following is not supported by the Data Registry page:

  • Saving and editing blueprints for Composable ML.
  • Data preparation with Spark SQL.
  • Data preparation for time series datasets.

Updated March 5, 2025