# Wrangle large Snowflake datasets

> Wrangle large Snowflake datasets - Tips for improving interactivity and performance when wrangling
> large Snowflake datasets in Workbench.

This Markdown file sits beside the HTML page at the same path (with a `.md` suffix). It summarizes the topic and lists links for tools and LLM context.

Companion generated at `2026-05-06T18:17:09.969174+00:00` (UTC).

## Primary page

- [Wrangle large Snowflake datasets](https://docs.datarobot.com/en/docs/reference/data-ref/wrangle-snowflake.html): Full documentation for this topic (HTML).

## Sections on this page

- [Increase Snowflake warehouse size](https://docs.datarobot.com/en/docs/reference/data-ref/wrangle-snowflake.html#increase-snowflake-warehouse-size): In-page section heading.
- [Change the sampling method](https://docs.datarobot.com/en/docs/reference/data-ref/wrangle-snowflake.html#change-the-sampling-method): In-page section heading.
- [Reduce the sample size](https://docs.datarobot.com/en/docs/reference/data-ref/wrangle-snowflake.html#reduce-the-sample-size): In-page section heading.

## Related documentation

- [Reference documentation](https://docs.datarobot.com/en/docs/reference/index.html): Linked from this page.
- [Data reference](https://docs.datarobot.com/en/docs/reference/data-ref/index.html): Linked from this page.
- [choosing a sampling method](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/build-recipe.html#configure-the-live-sample): Linked from this page.

## Documentation content

This page describes how to improve performance and interactivity when wrangling large Snowflake datasets in Workbench.

## Increase Snowflake warehouse size

Snowflake warehouse size specifies the compute resources available per cluster, therefore, increasing your warehouse size will reduce the time it takes to execute wrangling queries.

See the [Snowflake documentation on increasing warehouse size](https://docs.snowflake.com/en/user-guide/performance-query-warehouse-size).

## Change the sampling method

When generating the live wrangling preview, DataRobot, by default, retrieves a random sample from the source table. To reduce the time it takes to execute the query in Snowflake and display the preview, you can change the sampling method so DataRobot retrieves the First-N Rows instead.

For step-by-step instructions, see the documentation on [choosing a sampling method](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/build-recipe.html#configure-the-live-sample).

## Reduce the sample size

To generate a live wrangling preview, DataRobot executes the query directly in Snowflake. By default, the preview uses 10000 random rows from the source table to generate insights, however, you can reduce the number of rows sampled to decrease the time it takes to execute the query in Snowflake.

This method is particularly helpful for wide (hundreds of features) and heavy (many long text features) datasets where 10000 rows may require significant resources and time to process.

For step-by-step instructions, see the documentation on [configuring the live sample](https://docs.datarobot.com/en/docs/workbench/nxt-workbench/dataprep/wrangle-data/build-recipe/build-recipe.html#configure-the-live-sample).
