Update datasets with new data¶
Data is always changing. Even if you just imported data into Data Prep, there’s a chance the data is outdated. Updating the data in a dataset allows you to import a dataset as a new version of an existing dataset. After you update a dataset, you can use the new version in an existing project.
When updating a dataset, you can update to a completely different dataset with new values, structure, and format. Or you can update to a dataset where:
- Only the values have changed—the structure and format are the same.
- The format or structure have changed, for example, columns are added or removed.
Update a dataset with new data¶
To update a dataset with new data:
On the Library page, hover over the dataset you want to update and click More Actions.
Select add version.
Locate and select the dataset you want to import from the Select Data Source list or click Upload a local file.
Note: If a SQL statement was used during the initial import, the SQL statement is retained and can be used again to update the data in the dataset.
The dataset is added to the list in the You selected pane. Data Prep displays the Your options pane for the dataset and a preview of the dataset.
Check the preview of the dataset and adjust the import settings as necessary.
Your data is imported as a new version and is ready to be prepped in a project.