Skip to content

Wrangler

Wrangler is a graphic user interface (GUI) that allows you to clean and prepare data by building recipes comprised of one or more operations ( e.g., instructions to apply a specifed tranformation on the data).

When you start a wrangling session, DataRobot connects to your data source, pulls a live random sample, and performs exploratory data analysis on that sample. Then, as you add operations to your recipe, the transformations are applied to the sample and the exploratory data insights are recalculated, allowing you to quickly iterate on and profile your data before publishing.

Spark wrangling

For Spark wrangling, DataRobot pulls data from the ingested snapshots.

Note that when you wrangle a dataset in your Use Case, including re-wrangling the same dataset, DataRobot creates and saves a copy of the recipe in the Data assets tile regardless of whether or not you add operations to it. Each time you modify the recipe, your changes are automatically saved. Additionally, you can open saved recipes to continue making changes. All recipes created in Wrangler are preficed by Wrangling Recipe for unless manually changed while working with the recipe.

See the associated considerations for important information about wrangling data in DataRobot.

Topic Description
Build a wrangling recipe Modify wranglings settings and configure the live sample.
Add wrangling operations Build a recipe to interactively prepare data for modeling without moving it from your data source.
Time-aware wrangling Manually or automatically create a derivation plan for time series data.