Skip to content

On-premise users: click in-app to access the full platform documentation for your version of DataRobot.

Start modeling

To build models in DataRobot, you first create a project by importing a dataset, selecting a target feature, and clicking Start to begin the modeling process. A DataRobot project contains all of the models built with the imported dataset. The following steps provide a quick overview of how to begin modeling data with DataRobot. Links within the steps point to the full documentation if you need assistance.

1: Create a new DataRobot project

Importing a dataset using any one of the methods on the new project page to create a new DataRobot project:

You can see the file type reference for information about file size limitations.

2: Configure modeling settings

To begin modeling, type the name of the target and configure the optional settings described below:

Element Description
1 What would you like to predict? Type the name of the target feature (the column in the dataset you would like to predict) or click Use as target next to the name in the feature list below.
2 No target? Click to build an unsupervised model.
3 Secondary datasets (Optional) Add a secondary dataset by clicking + Add datasets. DateRobot performs Feature Discovery and creates relationships to the datasets.
4 Feature list Displays the feature list to be used for training models.
5 Optimization Metric (Optional) Select an optimization metric to score models. DataRobot automatically selects a metric based on the target feature you select and the type of modeling project (i.e., regression, classification, multiclass, unsupervised, etc.).
6 Show advanced options Specify modeling options such as partitioning, bias and fairness, and optimization metric (click Additional).
7 Time-Aware Modeling Build time-aware models based on time features.

Scroll down to see the list of available features. (Optional) Select a Feature List to be used for model training. Click View info in the Data Quality Assessment area on the right to investigate the quality of features.

3: Start modeling

After specifying the target feature, you can select a Modeling Mode to instruct DataRobot to build more or fewer models and click Start to begin modeling:

Tip

For large datasets, see the section on early target selection.

Or, you can set a variety of advanced options to fine-tune your project's model-building process:

DataRobot prepares the project (EDA2) and starts running models. A progress indicator for running models is displayed in the Worker Queue on the right of the screen. Depending on the size of the dataset, it may take several minutes to complete the modeling process. The results of the modeling process are displayed in the model Leaderboard, with the best-performing models (based on the chosen optimization metric) at the top of the list.

4: Review model details

On the Leaderboard, click a model to display the model blueprint and access the many tabs available for investigating model information and insights.

5: Test predictions before deployment

You can test and generate predictions from any model manually without deploying to production via Predict > Make Predictions. Provide a dataset by drag-and-dropping a file onto the screen or use a method from the dropdown. Once data upload completes, click Compute Predictions to generate predictions for the new dataset and Download, when complete, to view the results in a CSV file.

Next steps

From here, you can:


Updated March 20, 2024