# Data Quality Assessment

> Data Quality Assessment - The Data Quality Assessment capability automatically detects and surfaces
> common data quality issues and, often, handles them with minimal or no action on the part of the
> user.

This Markdown file sits beside the HTML page at the same path (with a `.md` suffix). It summarizes the topic and lists links for tools and LLM context.

Companion generated at `2026-05-21T05:20:22.787619+00:00` (UTC).

## Primary page

- [Data Quality Assessment](https://docs.datarobot.com/en/docs/reference/data-ref/data-assess-ref.html.md): Full documentation for this topic (Markdown sidecar).

## Sections on this page

- [Data Quality Assessment locations](https://docs.datarobot.com/en/docs/reference/data-ref/data-assess-ref.html.md#data-quality-assessment-locations): In-page section heading.
- [Explore the assessment](https://docs.datarobot.com/en/docs/reference/data-ref/data-assess-ref.html.md#explore-the-assessment): In-page section heading.
- [Isolate features with data quality issues](https://docs.datarobot.com/en/docs/reference/data-ref/data-assess-ref.html.md#isolate-features-with-data-quality-issues): In-page section heading.
- [View data quality checks](https://docs.datarobot.com/en/docs/reference/data-ref/data-assess-ref.html.md#view-data-quality-checks): In-page section heading.

## Related documentation

- [Reference documentation](https://docs.datarobot.com/en/docs/reference/index.html.md): Linked from this page.
- [Data reference](https://docs.datarobot.com/en/docs/reference/data-ref/index.html.md): Linked from this page.
- [EDA1](https://docs.datarobot.com/en/docs/reference/data-ref/eda-explained.html.md#eda1): Linked from this page.
- [Outliers](https://docs.datarobot.com/en/docs/reference/data-ref/data-quality-ref.html.md#outliers): Linked from this page.
- [Visual AI experiments](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/index.html.md): Linked from this page.
- [Data Quality Handling Report](https://docs.datarobot.com/en/docs/classic-ui/modeling/analyze-models/describe/dq-report.html.md): Linked from this page.

## Documentation content

The Data Quality Assessment capability automatically detects and surfaces common data quality issues and, often, handles them with minimal or no action on the part of the user. The assessment not only saves time finding and addressing issues, but provides transparency into automated data processing (you can see the automated processing that has been applied). It includes a warning level to help determine issue severity.

As part of [EDA1](https://docs.datarobot.com/en/docs/reference/data-ref/eda-explained.html.md#eda1), DataRobot runs checks on features that don’t require date/time and/or target information. Once EDA2 starts, DataRobot runs additional checks. In the end, the following checks are run:

**Baseline checks:**
DataRobot always runs the following baseline data quality checks:

Outliers
Multicategorical format errors
Inliers
Excess zeros
Disguised missing values
Target leakage
Missing images
(Visual AI experiments)

**Time series checks:**
Time series experiments run all the baseline data quality checks as well as checks for:

Imputation leakage
Pre-derived lagged features
Irregular time steps
(inconsistent gaps)
Leading or trailing zeros
Infrequent negative values
New series in validation

**Visual AI checks:**
The [Visual AI experiments](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/index.html.md) Data Quality Assessment runs the same baseline checks and an additional missing image check:

Missing images


**Related reference**

To learn more about the topics discussed on this page, see:

- EDA explained: Detailed descriptions of how DataRobot processes EDA.
- Data quality checks: Detailed descriptions of each data quality check, as well as a summary of the logic behind each one.
- Feature considerations : Important additional information about data quality.

## Data Quality Assessment locations

The Data Quality Assessment provides information about data quality issues that are relevant to your stage of model building. Initially run as part of EDA1 (data ingest), the results report on the All Features list. It runs again and updates after EDA2, displaying information for the selected feature list (or, by default, All Features). For checks that are not applicable to individual features (for example, Inconsistent Gaps), the report provides a general summary.

You can access a Data Quality Assessment from two areas in Workbench:

**Data/EDA1:**
In a Workbench Use Case, open a dataset and select either the Data preview or Features tile. Then, click Show summary. This assessment displays data quality checks surfaced during EDA1.

**Experiment/EDA2:**
In a Workbench Use Case, open an experiment and select either the Data preview or Features tile. Then, click Show summary. This assessment displays data quality checks surfaced during EDA2.

Once model building completes, you can view the [Data Quality Handling Report](https://docs.datarobot.com/en/docs/classic-ui/modeling/analyze-models/describe/dq-report.html.md) for additional imputation information.

> [!NOTE] Identify target leakage
> When EDA2 is calculated, [DataRobot checks for target leakage](https://docs.datarobot.com/en/docs/reference/data-ref/data-quality-ref.html.md#target-leakage), which refers to a feature whose value cannot be known at the time of prediction, leading to overly optimistic models. A badge is displayed next to these features so that you can easily identify and exclude them from any new feature lists.
> 
> [https://docs.datarobot.com/en/docs/images/targ-leak-badge.png](https://docs.datarobot.com/en/docs/images/targ-leak-badge.png)


## Explore the assessment

To view the Data Quality Assessment from one of the areas listed in the [previous section](https://docs.datarobot.com/en/docs/reference/data-ref/data-assess-ref.html.md#data-quality-assessment-locations), click Show summary (unless it is already open, then the button displays Hide summary).

Then, click Show details to open a detailed report.

Each data quality check provides issue status flags, a short description of the issue, and a recommendation message, if appropriate:

| Status | Description |
| --- | --- |
| Warning | Attention or action required |
| Informational | No action required |
| Passing | No issue detected |

## Isolate features with data quality issues

From within the assessment modal, you can filter by issue type to see which features triggered the checks. Toggle on Show only affected features and check boxes next to the check names to select which checks to display:

DataRobot then displays only features violating the selected data quality checks, and within the selected feature list. You can hover on an icon for more detail.

For multilabel and Visual AI experiments, Preview Log displays at the top if the assessment detects [multicategorical format errors](https://docs.datarobot.com/en/docs/reference/data-ref/data-quality-ref.html.md#multicategorical-format-errors) or [missing images](https://docs.datarobot.com/en/docs/reference/data-ref/data-quality-ref.html.md#missing-images) in the dataset. Click Preview Log to open a window with a detailed view of each error, so you can more easily find and fix them in the dataset.

## View data quality checks

To check individual features for data quality issues:

1. From the Use Case, click on the dataset or experiment you want to view.
2. Open theFeaturestile on the left. TheData qualitycolumn indicates if DataRobot detected a data quality issue with the feature.
3. Hover over the icon to learn which check failed. You can then use the exploratory data insights to correct them.

Because the results are feature-list based, it is possible that if you change the selected feature list, new checks will appear or current checks will disappear from the assessment. For example, if feature list `List 1` contains a feature `problem`, which contains outliers, the outliers check will show in the assessment. If you change lists to `List 2` which does not include `problem` (or any other feature with outliers), the outliers check will report "no issue".
