Using AWS RDS PostgreSQL with DataRobot¶
はじめに¶
DataRobot supports and recommends using managed services, including AWS RDS PostgreSQL, to store and manage data. This documentation provides detailed instructions on configuring DataRobot to utilize AWS RDS PostgreSQL as a database solution.
Steps to Configure AWS RDS PostgreSQL with DataRobot¶
1. Create an AWS RDS PostgreSQL Instance¶
- Log in to the AWS Management Console.
- Navigate to the Amazon RDS service.
- Click "Create Database" and choose "PostgreSQL" as the engine.
- Configure the following settings:
- DB Instance Size: Choose an appropriate instance size based on our database requirements.
- Master username: Set
postgresas master username. - Master password: Set a master password for accessing the database.
- DB Engine Version: Select the PostgreSQL engine version 12 .
- Allocated Storage: Set the desired disk space for the database.
- Database authentication: Select "Password authentication" to use passwords for database authentication.
- Follow the on-screen instructions to set up other parameters if required
2. Obtain RDS PostgreSQL Endpoint¶
- Once the RDS instance is created, note down the Endpoint. You'll need this for configuring DataRobot.
3. Configure DataRobot to Use AWS RDS PostgreSQL¶
When PostgreSQL is configured as an external service, additional YAML override values must be provided.
postgresql-ha:
postgresql:
postgresPassword: YOUR_AWS_RDS_PASSWORD
then add to your values.yaml within the datarobot chart.
global:
postgresql:
internal: false
hostname: "YOUR_AWS_RDS_ENDPOINT"
core-integration-tasks:
jobs:
setup:
config_env_vars:
PGSQL_INIT_SCRIPT: /init-config/db
build-service:
buildService:
envApp:
secret:
POSTGRES_HOST: "YOUR_AWS_RDS_ENDPOINT"
Built-in Backup Service¶
Amazon RDS offers a built-in backup service that automatically performs backups of your databases according to a specified schedule. You can configure the backup retention period and frequency to meet your data retention requirements. Additionally, you can perform manual backups and create snapshots of specific instances at any time.
成果¶
By following these steps, you can seamlessly integrate AWS RDS PostgreSQL with DataRobot, providing a reliable and scalable database solution. This setup enhances data storage and retrieval, contributing to the overall efficiency of your DataRobot instance. If issues arise, refer to DataRobot documentation and AWS RDS documentation for troubleshooting guidance.
Note: Always ensure that you follow best practices for security and compliance when configuring external databases with DataRobot and AWS RDS.
Fresh Install or Upgrade to 10.2.3¶
AWS RDS
-
Modify the parameter group, to add
password_encryptionset toscram-sha-256. If you only have default parameter group set then create a new option group and configure your RDS to use the option group -
Once option group is added to RDS, reboot the RDS instance with failover
-
Follow guide to delete the existing secret
-
Continue with upgrade as in the admin guide