Skip to content

Click in-app to access the full platform documentation for your version of DataRobot.

Fill columns

You use the Data Prep Fill operation to populate blank cells within a column based on the known values directly preceding or following the blank(s). To use the Fill function for a column, hover over the column operations menu, scroll to fill in the drop-down, and then select the action of choice. The Group and Sort by options are available for all Fill operations. Click Save to apply the filled in values to your Project.

  • Fill Up: When a data point is missing, it will be filled in using the next available non-blank value.
  • Fill Down: When a data point is missing, it will be filled in using the most recently seen non-blank value.
  • Fill Average: When a data point is missing, it will be filled in with the average of the nearest previous and following non-blank values within a partition. Consecutive blanks will be filled with the same value because their input values are the same.
  • Fill Linear Fit: When a series of data points are missing, they will be filled in with values fitting a line between the surrounding available values. This is different from fill average, as fill average assigns the same average value to all missing values. Fill linear fit calculates a linear average and adjusts the values based on the number of missing values.

Note

Fill average and linear fit can only be applied to numeric column types.


Updated October 28, 2021
Back to top