AWS S3 integration

Overview

The Amazon AWS S3 integration enables users to automatically sync activities from Ortto to an S3 bucket on a daily or weekly basis. This simplifies external data storage and opens up opportunities to connect the data with other platforms and services.

Note: The AWS S3 integration is only available on the Enterprise Plan.

Accessing AWS

To access the AWS integration page, go to CDP > Data sources > New data source > AWS.

Prerequisites

  • AWS account: You need an Amazon AWS account to use this integration.
  • Dedicated S3 bucket: While not required, we recommend creating a dedicated S3 bucket for Ortto.

AWS set up

The setup of this integration involves three main steps: Setup, Activities, and Fields. Follow the steps below to complete the setup.

Setup

Choose the relevant Connection method:

IMPORTANT: We strongly recommend using Cross-Account role instead of Static IAM user credentials for enhanced security. Cross-Account role provides temporary credentials that automatically rotate, reducing the risks associated with permanent access keys. Only our system can access these credentials, and they expire within an hour, ensuring secure, short-term access.

In the Cross-Account role section, enter the following details:

  • S3 bucket: Enter the address of your dedicated S3 bucket.
  • Role trust policy or Role in-line policy: [TBD]

In the Role ARN field, enter the Role ARN, which you can copy from your IAM console.

In the Frequency section, choose how often data is synced to AWS, either daily or weekly. Daily exports send data once every 24 hours, while weekly exports send data on Sundays.

Activities

On the Activities page, select the activities you'd like to automatically export to your S3 bucket once the export occurs based on the selected frequency.

The maximum number of activities on the Enterprise plan is 20 activities.

Fields

On the Fields page, select the fields you'd like to include in the automatic export. You can select up to 5 fields.


How is data exported to S3

Data is exported based on the selected frequency. Daily exports send data every 24 hours, while weekly exports are sent on Sundays. The data is exported in Parquet format, with file names similar to: s3://your-bucket/{your-path}/[ortto-instance-id]-activity-2025-03-03--s-1.parquet

You will get one file per activity per database shard.