Skip to content

Click in-app to access the full platform documentation for your version of DataRobot.

Remove rows

As you prepare your data in Data Prep, you will find times when you want to retain a specific subset of data. The best way to accomplish this is to remove the rows of data that don’t meet your needs.

Work with the remove tool

To access the remove tool, click remove in the project Tools bar:

Following is an overview of the elements you work with when you remove rows from your project:

Element Description
Remove tool Click remove to access the Remove rows pane and remove rows from your project.
Filter for Remove Rows pane Filters your data down to a subset that you want to isolate for the remove operation. To access this pane, click Filters on the top right.
Data Preview pane Displays the data in your project and how it changes as you prep it.

Remove rows

To remove rows from your data:

  1. Click the Filters link on the top right to add a Filtergram and isolate the rows you want to remove.

    The Data Preview displays the records that match your filter criteria. See Data Filtergrams for details on working with Filtergrams.

  2. Click remove in the Tools bar.

    The Filters on the Current Dataset pane becomes the Filters for Remove Rows pane.

  3. Click Save.

    The filtered rows are removed. The Data Preview is blank because the filter used is still active and the data has been removed.

  4. To display your updated dataset, do one of the following:

    • On the Filtergram, click x clear.
    • Close the Filtergram.

    The Filtergram is cleared. The Data Preview displays the updated data.

Tip

Publish the data from the removed rows to a separate AnswerSet for later reference. See Capture data from removed rows for details.

Note

If you update or add to your dataset after removing rows, the remove rows Step will be applied to the new data in the following ways:

  • If you selected histogram ranges or individual values to remove rows, rows from the new data will only be removed if they satisfy that exact condition.
  • If you selected rows with a string search or dynamic percentile range, then the tool will recompute based on the new data.

Capture data from removed rows

To add a lens to publish the rows you removed to an AnswerSet:

  1. From Tools, click steps.

    The Steps pane appears.

  2. Click the Step prior to the Remove Rows Step.

    The Data Preview displays the records that match the criteria of your filter.

  3. Add a Filtergram to isolate the rows you removed.

    The Data Preview displays the records that match the criteria of your filter.

  4. From Tools, click lens > new lens.

  5. Click Save.

    The lens is added to the project.

  6. To publish the lens, click Publish from the lens.


Updated October 28, 2021
Back to top