Skip to content

On-premise users: click in-app to access the full platform documentation for your version of DataRobot.

End-to-end ML workflow with AWS

Being one of the largest cloud providers in the world, AWS has multiple ways of storing data within its cloud.

You can use either of two AI accelerators that allow you to source data from S3 or Athena, build and evaluate a model using DataRobot, and send predictions from that model back to S3.

Access the AI accelerator for S3 on GitHub

Access the AI accelerator for AWS Athena on GitHub

Each AI accelerator will perform the following steps to help you integrate DataRobot with your data in AWS:

  • Import data for training:

    • In the S3 AI accelerator, you will be able to take data in the Parquet file format, assemble it, and upload it to the AI Catalog in DataRobot.

    • In the Athena AI Accelerator, you will create a JDBC data source within DataRobot to connect to Athena and then pull data in via a SQL query.

  • Using the DataRobot Python API, you will have DataRobot build up to 50 different machine learning models while also evaluating how those models perform on this dataset.


Updated August 28, 2024