AWS S3 integration
Overview
The Amazon AWS S3 integration enables users to automatically sync activities from Ortto to an S3 bucket on a daily or weekly basis. This simplifies external data storage and opens up opportunities to connect the data with other platforms and services.
Note: The AWS S3 integration is only available on the Enterprise Plan.
Accessing AWS
To access the AWS integration page, go to CDP > Data sources > New data source > AWS.
Prerequisites
- AWS account: You need an Amazon AWS account to use this integration.
- Dedicated S3 bucket: While not required, we recommend creating a dedicated S3 bucket for Ortto.
AWS set up
The setup of this integration involves three main steps: Setup, Activities, and Fields. Follow the steps below to complete the setup.
Setup
Choose the relevant Connection method:
- Cross-Account role (recommended): This is the more modern method for accessing AWS resources in other accounts. It eliminates static credentials and makes it easier to control usage. Learn more about how to create a Cross-account role role with a custom trust policy.
- Static IAM user credentials: This is the original method for accessing AWS resources from outside AWS. You set up an AWS IAM user and share the keys with someone else. Learn more about how to create an IAM user and how to create an access key for an IAM user.
IMPORTANT: We strongly recommend using Cross-Account role instead of Static IAM user credentials for enhanced security. Cross-Account role provides temporary credentials that automatically rotate, reducing the risks associated with permanent access keys. Only our system can access these credentials, and they expire within an hour, ensuring secure, short-term access.
In the Cross-Account role section, enter the following details:
- S3 bucket: Enter the address of your dedicated S3 bucket.
- Role trust policy or Role in-line policy: [TBD]
In the Role ARN field, enter the Role ARN, which you can copy from your IAM console.
In the Frequency section, choose how often data is synced to AWS, either daily or weekly. Daily exports send data once every 24 hours, while weekly exports send data on Sundays.
Activities
On the Activities page, select the activities you'd like to automatically export to your S3 bucket once the export occurs based on the selected frequency.
The maximum number of activities on the Enterprise plan is 20 activities.
Fields
On the Fields page, select the fields you'd like to include in the automatic export. You can select up to 5 fields.
How is data exported to S3
Data is exported based on the selected frequency. Daily exports send data every 24 hours, while weekly exports are sent on Sundays. The data is exported in Parquet format, with file names similar to:
s3://your-bucket/{your-path}/[ortto-instance-id]-activity-2025-03-03--s-1.parquet
You will get one file per activity per database shard.