Databricks (JDBC)¶
Connecting to Databricks using JDBC is currently certified through Azure.
Note
This page describes how to connect to the Databricks JDBC driver, not the native Databricks connector.
Databricks connection differences
When connecting to Databricks in DataRobot, there are three options to choose from: two JDBC drivers and a native connector.
Although DataRobot recommends using the native connector, review the table below before making a selection:
Connection type | Description | Best for | Not supported | |
---|---|---|---|---|
1 | JDBC driver | A JDBC driver certified for Databricks version 2.6.40. | Prediction write back to a remote database | Working with data in Workbench |
2 | Connector | A native connector and the recommended method of connecting to Databricks. | Browsing remote database, ingesting data, data preparation, working with data in Workbench | Prediction write back |
3 | JDBC driver | An older JDBC driver that was last certified for Databricks version 2.6.22. | Prediction write back to a remote database | Working with data in Workbench |
Supported authentication¶
- Access token
Prerequisites¶
The following is required before connecting to Databricks in DataRobot:
- A Databricks workspace in the Azure Portal app
- Data stored in an Azure Databricks database
Retrieve JDBC URL¶
In Microsoft Azure:
- Log into your Azure Databricks workspace.
- On the cluster's Configuration tab, expand Advanced options, click the JDBC/ODBC tab, and copy the JDBC URL.
See the Azure Databricks documentation.
Generate personal access token¶
In Microsoft Azure, generate a personal access token for your Databricks workspace. This token will be used to authenticate your connection to Databricks in DataRobot.
See the Azure Databricks documentation.
Set up a connection in DataRobot¶
To connect to Databricks in DataRobot:
-
Follow the instructions for creating a data connection using the appropriate Databricks driver and JDBC URL.
-
Click Test Connection, to open the Credentials window. Enter your access token and a display name to save your credentials in DataRobot.
Required parameters¶
The table below lists the minimum required fields to establish a connection with Databricks:
Required field | Description | Documentation |
---|---|---|
Connection configuration | ||
JDBC URL | Combination of authentication settings, any driver capability settings, and compute resource settings. | Azure Databricks documentation |
Credentials | ||
Access token | Token used to authenticate your connection to Databricks in DataRobot. | Azure Databricks documentation |
Troubleshooting¶
Problem | Solution | Instructions |
---|---|---|
When attempting to execute an operation in DataRobot, the firewall requests that you clear the IP address each time. | Add all allowed IPs for DataRobot. | See Allowed source IP addresses. If you've already added the allowed IPs, check the existing IPs for completeness. |