Skip to content

Click in-app to access the full platform documentation for your version of DataRobot.

MS Azure Data Lake Storage Gen2 (ADLS Gen2) Connector for Data Prep

User Persona: Data Prep User, Data Prep Admin, Data Source Admin, or IT/DevOps

Note

This document covers all configuration fields available during connector setup. Some fields may have already been filled out by your Administrator at an earlier step of configuration and may not be visible to you. For more information on Data Prep's connector framework, see Data Prep Connector setup. Also, your Admin may have named this connector something else in the list of Data Sources.

Configure Data Prep

This connector allows you to connect to Azure Data Lake Storage Gen2 for import and export. The following fields are used to define the connection parameters.

General

  • Name: Name of the data source as it will appear to users in the UI.

  • Description: Description of the data source as it will appear to users in the UI.

Tip

You can connect Data Prep to multiple Azure Data Lake Storage Gen2 accounts. Using a descriptive name can be a big help to users in identifying the appropriate data source.

Azure Data Lake Storage Gen2 Configuration

  • Data Store Root Directory: The apparent root path accessible by this connector. Use "/" to access all files in the file system.

  • Azure Storage Account Name: The Subdomain Name of your unique Azure URL. Storage account names must be between 3 and 24 characters in length and may contain numbers and lowercase letters only. Your storage account name must be unique within Azure. No two storage accounts can have the same name.

  • File System Name: The name of the file system within the storage account. This is sometimes called the "container" name.

Azure Data Lake Storage Gen2 Authentication Settings

From the drop-down, select the preferred authentication method for ADLS Gen2 storage and fill out the required fields.

  • Storage Account Access Key: Enter the Storage Account Access Key in the field. This is sometimes referred to as a “Shared Key”.
  • Active Directory Username/Password: Enter the Azure Directory username and password associated with your account.

Note

You must grant access for Data Prep to read and write data within your Microsoft account, otherwise, you will get an error while attempting to connect. To grant access, click on the ‘Test Data Source’ button in the Data Source set-up panel and follow the ‘Grant Access’ link. This will bring you to your Microsoft account where you can log in and grant access. Then, come back to Data Prep to continue.

Data Import Information

Via Browsing

The connector will present a browsable directory hierarchy starting at the location defined in the Data Store Root Directory field.

Via SQL Query

Not Supported

FAQ/Troubleshooting/Common Issues

Can we have both ADLS Gen1 and ADLS Gen2 connectors in the same Data Prep account?

Yes. The two connectors can coexist and will not interfere with each other.


Updated October 28, 2021
Back to top