# Build a wrangling recipe

> Build a wrangling recipe - Configure and analyze samples in preparation for data wrangling.

This Markdown file sits beside the HTML page at the same path (with a `.md` suffix). It summarizes the topic and lists links for tools and LLM context.

Companion generated at `2026-05-06T18:17:10.053567+00:00` (UTC).

## Primary page

- [Build a wrangling recipe](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/build-recipe.html): Full documentation for this topic (HTML).

## Sections on this page

- [Modify wrangling settings](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/build-recipe.html#modify-wrangling-settings): In-page section heading.
- [Edit the recipe metadata](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/build-recipe.html#edit-the-recipe-metadata): In-page section heading.
- [Configure the live sample](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/build-recipe.html#configure-the-live-sample): In-page section heading.
- [Analyze the live sample](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/build-recipe.html#analyze-the-live-sample): In-page section heading.
- [Read more](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/build-recipe.html#read-more): In-page section heading.

## Related documentation

- [NextGen UI documentation](https://docs.datarobot.com/en/docs/workbench/index.html): Linked from this page.
- [Workbench](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/index.html): Linked from this page.
- [Data preparation](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/index.html): Linked from this page.
- [Prepare data](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/index.html): Linked from this page.
- [Wrangler](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/index.html): Linked from this page.
- [selecting a dataset from a data connection](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/add-data/connect.html#select-a-dataset): Linked from this page.
- [Data assetstile](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/explore-data/index.html): Linked from this page.
- [wrangling time series data](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/ts-wrangling.html): Linked from this page.
- [summary statistics](https://docs.datarobot.com/en/docs/classic-ui/data/analyze-data/histogram.html): Linked from this page.

## Documentation content

When you click Wrangle, DataRobot pulls a uniform random sample of 10000 rows and calculates exploratory data insights on that sample, all while connected to your data source. Then, you build a recipe of operations you want to apply to the entire dataset—the transformations are first applied to the live sample to make sure it's being done correctly. When the recipe is ready to be published, it's pushed down to the data source where it's executed to materialize an output dataset.

You can launch the data wrangler from the following areas in a Use Case:

- When selecting a dataset from a data connection , click Open in Wrangler in the top-right corner.
- On the Data assetstile , from the Actions menu next to a dataset.
- On the data explore page , from the Data actions dropdown.

## Modify wrangling settings

In a recipe, you can modify the settings to make the summary information more descriptive for future use, as well as the number of rows included in the live preview.

### Edit the recipe metadata

By default, DataRobot assigns a name and description to each wrangling recipe based on the source data, however, you can modify this information to make it more applicable to your specific use case.

To edit the recipe metadata, click the Info tile on the right.

Then, click on the field you want to edit—either the title or the description. Edit the field and when you're done, you can:

- Click the check mark ✔ or outside of the field to save your changes.
- Click the X to revert your changes.

### Configure the live sample

By default, DataRobot retrieves 10000 random rows for the live sample, however, you can modify this number and sampling method in the wrangling settings. Note that the more rows you retrieve, the longer it will take to render the live sample.

To configure the live sample:

1. ClickSettingsin the right panel and openPreview sample.
2. Select aSampling method. Use the dropdown to select eitherRandom,First-N Rows, orNo sampling, or forwrangling time series data,Date/time.
3. Specify theNumber of rowsto be retrieved from the source data. Enter the number of rows (under 10000) you want to include in the live sample and clickResample. The live sample updates to display the specified number of rows.

## Analyze the live sample

During data wrangling, DataRobot performs exploratory data analysis on the live sample, generating table- and column-level [summary statistics](https://docs.datarobot.com/en/docs/classic-ui/data/analyze-data/histogram.html) and [visualizations](https://docs.datarobot.com/en/docs/classic-ui/data/analyze-data/histogram.html#histogram-chart) that help you profile the dataset and recognize data quality issues as you apply operations. For more information on interacting with the live sample, see the section on [exploratory data insights](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/explore-data/index.html#explore-data).

Note that if you select No sampling as the sampling method, DataRobot processes the full dataset during the wrangling session, which can significantly slow down preview generation.

If you choose to work with a large number of rows during the wrangling session, the total row count is displayed at the bottom of the page, however, insights are only calculated based on the first 100000 rows. Disabling sampling, as well as operations, including cartesian joins and one-to-many/many-to-many join conditions in the inner or left join can all cause large preview results.

**Speed up live sample**

To reduce the time it takes to retrieve and render the live sample, use the toggle next to Show Insights to hide the feature distribution charts.

**Q: Live sample vs. exploratory data insights on the Data assets tile**

Although both pages provide similar insights, you can specify the number of rows displayed in the live sample and it updates each time a transformation is added to your recipe.

## Read more

To learn more about the topics discussed on this page, see:

- Description of summary statistics and histograms in DataRobot Classic.
