DataRobot Connector for Data Prep¶
User Persona: Data Prep User, Data Prep Admin, or Data Source Admin
This document covers all configuration fields available during connector setup. Some fields may have already been filled out by your Administrator at an earlier step of configuration and may not be visible to you. For more information on Data Prep's connector framework, see Data Prep Connector setup. Also, your Admin may have named this connector something else in the list of Data Sources.
Configure Data Prep¶
This connector allows you to connect to DataRobot for Library imports and exports. This connection also allows you to create and access a DataRobot project directly from Data Prep.
The following fields are used to define the connection parameters.
Name: Name of the data source as it will appear to users in the UI.
Description: Description of the data source as it will appear to users in the UI.
You can connect Data Prep to multiple DataRobot accounts. Using a descriptive name can be a big help to users in identifying the appropriate data source.
- Server URL: The server URL for DataRobot. For example: https://app.datarobot.com
Authentication Type: Select the authentication type to use:
- API Key
- Key: The DataRobot API Key
Email: Email or username for authenticating with DataRobot.
Password: Password for authenticating with DataRobot.
- Note: Multi-factor authentication is not supported and will result in an error.
- API Key
Data Import & Export Information¶
- To import from the AI Catalog, select the AI Catalog option to view all available datasets and select the desired dataset to see a preview and adjust the import settings.
If during import you receive an error stating you do not have permissions to download datasets from the AI Catalog, you need to adjust your settings in DataRobot. Login to DataRobot and click the user icon at the top-right → Settings → Optional Products → check Enable AI Catalog Downloads → Save. Then come back to DataRobot Data Prep to continue.
To export from the AI Catalog, select the AI Catalog option, then click Select. Name the dataset and click Export.
To export a dataset directly to DataRobot and create a project in one step, see Create a DataRobot project.
Via SQL Query¶
What if I export the model I’ve generated in DataRobot and want to run that code where my data lives?¶
Data Prep has over 50 other Connectors and can likely still send the prepped data to the appropriate location. If Data Prep does not support Connectivity to the service/storage location you require, please reach out to your Customer Success Representative.
Why can’t I import my dataset?¶
Issue 1: Data Prep has designed the integration with DataRobot’s AI Catalog to only support the importing of “Snapshotted” datasets. The data contained in “Not snapshotted” datasets are not actually stored in DataRobot and are retrieved upon usage. In Data Prep’s case, that would mean DataRobot would fully import a dataset from the data source and only then would Data Prep begin importing that dataset. For “Not snapshotted” datasets, it’s much more efficient to pull the data directly from the data source into Data Prep. To determine if your dataset is a “Snapshot”, go to the AI Catalog, select the dataset in question and look at the “Status” in the right-hand panel.
Issue 2: If you receive an error stating you do not have permissions to download datasets from the AI Catalog, you need to adjust your settings in DataRobot. Login to DataRobot and click the user icon at the top-right > Settings > Optional Products > check Enable AI Catalog Downloads > Save. Then come back to DataRobot Data Prep to continue.
Issue 3: If you receive an error stating "Mapping for
not found, expected one of [ ]", you need to adjust your settings in DataRobot. Login to DataRobot and click the user icon at the top-right > Settings > CSV export >uncheck Include BOM > Save. Then come back to DataRobot Data Prep to continue.
When I export a new version of my dataset, does it appear as such in the AI Catalog?¶
Yes, versions of datasets with the same name will appear under the Version History tab of the AI Catalog, rather than as a new dataset.
Requirements for data exports to the AI Catalog¶
Datasets exported to DataRobot must meet the following criteria:
- At least 100 rows
- At least 2 columns
- Have valid column names