# Data connections

> Data connections - Connect to an external data source to seamlessly browse, preview, and profile
> data, as well as initiate scalable data preparation for machine learning with push-down.

This Markdown file sits beside the HTML page at the same path (with a `.md` suffix). It summarizes the topic and lists links for tools and LLM context.

Companion generated at `2026-05-06T18:17:10.050679+00:00` (UTC).

## Primary page

- [Data connections](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/add-data/connect.html): Full documentation for this topic (HTML).

## Sections on this page

- [Connect to a data source](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/add-data/connect.html#connect-to-a-data-source): In-page section heading.
- [Configure the connection](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/add-data/connect.html#configure-the-connection): In-page section heading.
- [Select a dataset](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/add-data/connect.html#select-a-dataset): In-page section heading.
- [Edit a connection](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/add-data/connect.html#edit-a-connection): In-page section heading.
- [Connection support for Wrangling and SQL Editor](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/add-data/connect.html#connection-support-for-wrangling-and-sql-editor): In-page section heading.

## Related documentation

- [NextGen UI documentation](https://docs.datarobot.com/en/docs/workbench/index.html): Linked from this page.
- [Workbench](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/index.html): Linked from this page.
- [Data preparation](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/index.html): Linked from this page.
- [Add data](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/add-data/index.html): Linked from this page.
- [data preparation capabilities](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/index.html): Linked from this page.
- [considerations](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/data-faq/index.html#add-data): Linked from this page.
- [source IPs](https://docs.datarobot.com/en/docs/reference/data-ref/allowed-ips.html): Linked from this page.
- [saved credentials for the selected data source](https://docs.datarobot.com/en/docs/platform/acct-settings/stored-creds.html#credentials-management): Linked from this page.
- [supported data stores](https://docs.datarobot.com/en/docs/reference/data-ref/data-sources/index.html): Linked from this page.
- [configure automatic downsampling](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/pub-recipe.html#configure-downsampling): Linked from this page.
- [Edit a connection](https://docs.datarobot.com/en/docs/platform/acct-settings/nxt-data-connect.html#edit-a-connection): Linked from this page.

## Documentation content

In Workbench, you can easily configure and reuse secure connections to predefined data sources, allowing you to interactively browse, preview, and profile your data before using DataRobot's integrated [data preparation capabilities](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/index.html).

See the associated [considerations](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/data-faq/index.html#add-data) for important additional information.

**Q: Data store vs. data connection**

In DataRobot, `data store` and `data connection` are used interchangeably, and both refer to where your data is stored and managed as well as the dynamic link to that data. You will see both terms in the UI.

> [!WARNING] Source IP addresses for allowing
> Before setting up a data connection, make sure the [source IPs](https://docs.datarobot.com/en/docs/reference/data-ref/allowed-ips.html) have been allowed.

**Data connector limitation for vector databases**

Data ingested via data connectors (such as Google Drive or SharePoint) is stored in the File Registry and cannot be used as VDB metadata. All metadata files must be uploaded to Datasets storage in the Data Registry, as assets contained in Files storage are not validated by the EDA process. If a vector database requires metadata for filtering, access control, or downstream retrieval logic, ensure that the metadata is provided as a Dataset in the Data Registry rather than sourced from a data connector.

## Connect to a data source

Creating a data connection lets you explore external source data—from both connectors and JDBC drivers—and then add it to your Use Case. The Browse data modal only lists connections that support structured data.

To create a data connection:

1. From theData assetstile, clickAdd data > Browse datain the upper-right corner, opening theBrowse datamodal.
2. Click+ Add connection.
3. Choose eitherStructuredfor connections that support adding structured data, orUnstructuredfor connections that support unstructured data (only available during VDB creation). Then, select a data store. Now, you canconfigure the data connection.

### Configure the connection

> [!NOTE] Note
> When configuring your data connection, configuration types, authentication options, and required parameters are based on the selected data source. The example below shows how to configure Snowflake with OAuth using new credentials.

To configure the data connection:

1. With theConnection Configurationtab selected in theEdit Connectionmodal, choose a configuration method—eitherParametersorJDBC URL.
2. Enter the required parameters for the selected configuration method.
3. ClickNew Credentialsand select an authentication method—the available authentication methods are based on the selected connection. Saved credentialsIf you previouslysaved credentials for the selected data source, clickSaved credentialsand select the appropriate credentials from the dropdown.
4. ClickSavein the upper right corner. If your browser window is small, you may need to scroll up. If you selected OAuth as your authentication method, you will be prompted to sign in before you canselect a dataset. See the list ofsupported data storesfor more information about supported authentication methods and required parameters.

### Select a dataset

Once you've set up a data connection, you can add datasets by browsing the [database schemas](https://www.ibm.com/topics/database-schema) and tables you have access to.

To select a dataset:

1. Select the schema associated with the table you want to add.
2. Select the box to the left of the appropriate table. With a dataset selected, you can: ElementDescription1Add to Use CaseAdds the data asset to your Use Case, making it available to you and other team members.2Add from SQL queryAllows you to use SQL queries to add data.3SettingsAllows you to show, hide, and/or pin columns.4Actions menuProvides access to the following actions:Preview: Open a snapshot preview to help determine if the dataset is relevant to your Use Case and/or if it needs to be modified in either Wrangler or the SQL Editor.Open in Wrangler: Perform data preparation before adding the asset to your Use Case.Open in SQL Editor: Create a recipe comprised of SQL queries that enrich, transform, shape, and blend datasets together. Large datasetsIf you want to decrease the size of the dataset before adding it to your Use Case, click Wrangle. When you publish a recipe, you canconfigure automatic downsamplingto control the number of rows when Snowflake materializes the output dataset.
3. ClickAdd to Use Case, and then choose a snapshot policy by adding either dynamic data (Add as dynamic dataset) or a snapshot of the dataset (Add as snapshot). To go back without adding data, clickContinue browsing.

## Edit a connection

To modify an existing  data connection from the Browse data modal, hover over the connection and click the edit icon. For more information, see [Edit a connection](https://docs.datarobot.com/en/docs/platform/acct-settings/nxt-data-connect.html#edit-a-connection). From this modal, you can also [delete a connection](https://docs.datarobot.com/en/docs/platform/acct-settings/nxt-data-connect.html#delete-a-connection).

## Connection support for Wrangling and SQL Editor

You can connect to and add data from all connectors and JDBC drivers that are currently supported in DataRobot. For a full list of supported data stores, see [Supported data stores](https://docs.datarobot.com/en/docs/reference/data-ref/data-sources/index.html).

Note that Snowflake, BigQuery, and Databricks connections use pushdown wrangling—all other connections use Spark wrangling.

The table below highlights the capabilities supported by each wrangling method:

| Wrangling method | Snapshot datasets | Dynamic datasets | Live preview | Wrangling | In-source materialization |
| --- | --- | --- | --- | --- | --- |
| Pushdown wrangling: Snowflake, BigQuery, Databricks |  | ✔ | ✔ | ✔ | ✔ |
| Spark wrangling: snapshots uploaded from local files, public URLs, all supported connections | ✔ |  |  | ✔ |  |

> [!NOTE] JDBC driver capabilities
> You can only add snapshot datasets from a JDBC driver connection.
