# About augmented models

> About augmented models - An overview of augmented modeling and how it supports the potential for
> smaller overall loss by improving the generalization of models on unseen data.

This Markdown file sits beside the HTML page at the same path (with a `.md` suffix). It summarizes the topic and lists links for tools and LLM context.

Companion generated at `2026-04-24T16:03:56.609132+00:00` (UTC).

## Primary page

- [About augmented models](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/tti-augment/ttia-introduction.html): Full documentation for this topic (HTML).

## Sections on this page

- [Image augmentation](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/tti-augment/ttia-introduction.html#image-augmentation): In-page section heading.
- [Performance](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/tti-augment/ttia-introduction.html#performance): In-page section heading.
- [Data Drift](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/tti-augment/ttia-introduction.html#data-drift): In-page section heading.
- [External resources](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/tti-augment/ttia-introduction.html#external-resources): In-page section heading.

## Related documentation

- [Classic UI documentation](https://docs.datarobot.com/en/docs/classic-ui/index.html): Linked from this page.
- [Modeling](https://docs.datarobot.com/en/docs/classic-ui/modeling/index.html): Linked from this page.
- [Specialized workflows](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/index.html): Linked from this page.
- [Visual AI](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/index.html): Linked from this page.
- [Train-time image augmentation](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/tti-augment/index.html): Linked from this page.
- [augmentation lists and transformation parameters](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/tti-augment/ttia-lists.html): Linked from this page.
- [Before model building](https://docs.datarobot.com/en/docs/classic-ui/modeling/build-models/adv-opt/ttia.html): Linked from this page.
- [secondary dataset](https://docs.datarobot.com/en/docs/classic-ui/data/transform-data/feature-discovery/fd-overview.html): Linked from this page.
- [modify each needed blueprint](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/cml/cml-blueprint-edit.html): Linked from this page.

## Documentation content

# About augmented models

By creating new images for training by randomly transforming existing images, you can build insightful projects with datasets that might otherwise be too small. In addition, all image projects that use augmentation have the potential for smaller overall loss by improving the generalization of models on unseen data. That is:

- Augmentation is the action taken on the image dataset.
- Transformations are the actions applied to an image.

After the process of augmentation, each image is transformed.

For a general explanation of image augmentation, see the description in [albumentations](https://albumentations.ai/docs/introduction/image_augmentation/) documentation—this is the open-source library that helps power DataRobot's implementation of the augmentation feature.

This page provides a general overview of how to configure augmentation. The parameters used to configure augmentation are detailed in this page about [augmentation lists and transformation parameters](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/visual-ai/tti-augment/ttia-lists.html).

## Image augmentation

There are two places where you can configure the Train-Time Image Augmentation step:

- Before model building , in Advanced options .
- From the Leaderboard, after model building .

> [!NOTE] Note
> If you add a [secondary dataset](https://docs.datarobot.com/en/docs/classic-ui/data/transform-data/feature-discovery/fd-overview.html) with images to a primary tabular dataset, the augmentation options described above are not available. Instead, if you have access to Composable ML, you can [modify each needed blueprint](https://docs.datarobot.com/en/docs/classic-ui/modeling/special-workflows/cml/cml-blueprint-edit.html) by adding an image augmentation vertex directly after the raw image input (as the first vertex in the image branch) and configure augmentation from there.

### Performance

A key advantage of train-time image augmentation is that because it is only applied during training, the prediction times for a model are relatively unchanged by whether it was trained with augmentation. This allows you to deploy models with better loss at no cost to your prediction times.

Some performance notes:

- Benchmarking has shown that in a project where dataset rows are doubled with
image augmentation, building in Autopilot will take about 50% longer.
- When image augmentation improves the LogLoss of a model, it improves it on average by approximately 10%, with a very large variance model-to-model and dataset-to-dataset.

### Data Drift

While models trained with image augmentation are often more robust to data drift than models trained without, transformations applied in image augmentation should not be used to anticipate future data drift. For example, if you are training a model to detect species of freshwater fish, and you anticipate that you'll apply your model in the future to a different region with larger fish, the best approach would be to collect data from that different region and incorporate it into your dataset. If you were to just apply the Scale transformation to your current dataset in an attempt to simulate larger fish not seen in your dataset, you would be creating images with larger fish in training, but when DataRobot scored your model against the validation or holdout, model performance would suffer because there were no larger fish in those partitions. This makes it difficult to correctly evaluate your model with augmentation against other models on the Leaderboard— your current training dataset is not representative of your future data.

### External resources

There are many research papers available that explain and provide evidence of the benefits of image augmentation for machine learning models—improved performance and outcomes as well as making them more robust. Below are a sample of external resources:

- Chen, T., Kornblith, S., Norouzi, M., & Hinton, G. (2020, November).A Simple Framework for Contrastive Learning of Visual Representations.
- Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012).ImageNet Classification with Deep Convolutional Neural Networks.
- Wang, J., & Perez, L. (2017).The Effectiveness of Data Augmentation in Image Classification using Deep Learning.
- Zoph, B., Cubuk, E. D., Ghiasi, G., Lin, T. Y., Shlens, J., & Le, Q. V. (2020, August).Learning Data Augmentation Strategies for Object Detection.
