AWS

AWS S3

To set up AWS S3 as data lake, you need to follow below steps

1. Navigate to Settings.

2. Click on Destination

3. Click on Setup Data Lake.

4. Insert all the data lake specific credentials, along with a name and description for the connection.

5. Click on Validate and then Create to save the data lake connection.

S3 Key IDAccess key ID to access S3 bucket
S3 Access KeyCorresponding secret access key ID
S3 Bucket NameName of the S3 bucket (Keep default)
S3 Bucket PathDirectory under the S3 bucket where the data will be written
S3 Bucket RegionRegion of the S3 bucket
Output FormatFormat of data output
FlatteningWhether the input JSON data should be normalized in the output CSV
CompressionWhether the output files should be compressed
EndpointS3 endpoint URL
S3 Path FormatFormat string on how data will be organized inside the S3 bucket
S3 Filename patternAllows to set the file name format for S3 staging files

RedShift

To set up RedShift as data warehouse, you need to follow below steps

1. Navigate to Settings.

2. Click on Destination

3. Click on Setup Data Warehouse.

4. Insert all the data warehouse specific credentials, along with a name and description for the connection.

5. Click on Validate and then Create to save the data warehouse connection.

HostHost endpoint of Redshift cluster
PortPort of database
UsernameUsername to access database
PasswordPassword of corresponding username
DatabaseName of database
Default SchemaDefault schema where the loaded tables are stored
JDBC URL ParamsAdditional properties to pass JDBC URL string when connecting to database
Uploading MethodThe way data will be uploaded to Redshift via AWS S3
S3 Bucket NameS3 bucket name where data will be stored
S3 Bucket PathS3 bucket path where data will be stored
S3 Bucket RegionRegion of S3 bucket
S3 Key IDAccess key ID
S3 Access KeySecret access key
EncryptionHow to encrypt staging data
File Buffer CountNumber of file buffers allocated for writing data
Purge Staging Files and TablesWhether to delete staging files from S3 after completing the sync
SSH Tunnel Method