Amazon Aurora PostgreSQL Setup Guide

Follow these instructions to replicate your Amazon Aurora PostgreSQL database to your destination using Fivetran.

Prerequisites

To connect your Amazon Aurora PostgreSQL database to Fivetran, you need:

PostgreSQL version 11 - 18
Your database host's IP (for example, 1.2.3.4) or domain (your.server.com)
TLS enabled on your database. Follow Amazon's TLS setup instructions to enable TLS on your database.

Aurora Serverless V2 requires PostgreSQL version 13 or later.

Setup instructions

Choose connection method

Decide on your preferred method for connecting Fivetran to your Amazon Aurora PostgreSQL database, and then configure the necessary settings for that method. This connector supports the following connection methods:

Connect directly

Fivetran connects directly to your Amazon Aurora PostgreSQL database. This is the simplest connection method to set up, requiring minimal configuration.

To connect directly, you must do the following:

Enable TLS on your Amazon Aurora PostgreSQL database. Follow AWS's instructions to enable SSL/TLS on your database.
Configure your VPC security group to allow incoming connections to your database host and port (usually 5432) from Fivetran's IPs for your database's region. For detailed instructions, see Configure security group.

Connect using SSH

Fivetran connects to a separate server in your network that provides an SSH tunnel to your Amazon Aurora PostgreSQL database. You must connect through SSH if your database resides in an inaccessible network.

To connect using an SSH tunnel, configure an SSH tunnel between Fivetran and your Amazon Aurora PostgreSQL database. For more information, see our SSH connection setup documentation.

The SSH tunnel setup requires adding Fivetran's SSH public key to the authorized_keys file on your SSH tunnel host. Copy the public key from the connector setup form, which is visible when you select Connect via an SSH tunnel in the Connection method drop-down.

Connect using AWS PrivateLink

You must have a Business Critical plan to use AWS PrivateLink.

AWS PrivateLink allows VPCs and AWS-hosted or on-premises services to communicate with one another without exposing traffic to the public internet. PrivateLink is the most secure connection method. Learn more in AWS’ PrivateLink documentation.

Follow our AWS PrivateLink setup guide to configure PrivateLink for your database.

Connect using Proxy Agent

Fivetran connects to your database through the Proxy Agent, providing secure communication between Fivetran processes and your database host. The Proxy Agent is installed in your network and creates an outbound network connection to the Fivetran-managed SaaS.

To learn more about the Proxy Agent, how to install it, and how to configure it, see our Proxy Agent documentation.

Create read replica (optional)

If you plan on using Query-Based method, you can create a read replica (also called an Aurora reader) for Fivetran's exclusive use. Using a read replica reduces the load of Fivetran's queries on your primary database, though this load is usually negligible unless your database has a table with more than 100 million rows. We recommend connecting a read replica to Fivetran; however, it’s optional.

You cannot enable logical replication on a read replica.

If you already have a read replica, want to connect Fivetran to your primary database, or want to use logical replication, skip ahead to Step 4.

In your Amazon RDS Dashboard, select the Amazon Aurora PostgreSQL database that you want to replicate.
Click Actions, then select Add reader.
On the Add Reader page, find the Settings section. Add a DB instance identifier for your read replica.
In the DB instance size section, specify the DB instance class for the read replica. It does not need to be as large as your primary instance.
In the Connectivity section, ensure that the read replica is accessible from outside your VPC.
Click Additional configuration to reveal more configuration options.
Choose a DB cluster parameter group.
Click Add reader.
The read replica's status should now be creating. It will take a few minutes for the read replica to finish being created. The status will change to available when it is done.
Set the value of the max_standby_streaming_delay parameter to 15-30 min. This ensures that import/incremental queries complete before the replica server cancels them.

Enable database access

Grant Fivetran's data processing servers access to your database server. How you grant access depends on whether or not your database instance is in a VPC.

If your instance is in a VPC, you must configure the two methods that control access: VPC security groups and network access control lists (ACLs). If your instance is not in a VPC, you only need to configure security groups.

Find endpoint and port

Find the endpoint and port for the database that you want to connect to Fivetran.

In your RDS dashboard, click on the Amazon Aurora PostgreSQL database that you want to connect to Fivetran.
In the Connectivity & security section, find the Endpoint and Port and make a note of them. You will need them to configure Fivetran.

Configure security group

These instructions assume that your database instance is in a VPC. If your database instance is not in a VPC, you can still use these instructions because configuring a non-VPC security group is an almost identical process.

In your RDS dashboard, click on the Amazon Aurora PostgreSQL database that you want to connect to Fivetran.
In the Connectivity & security section, ensure that your database's Public accessibility value is YES.
Click the link to your database's security group.
Click the security group ID.
In the Security Group panel, go to the Inbound tab, then click Edit inbound rules.
Click Add Rule. This creates a new Custom TCP Rule at the bottom of the list.
Fill in the new Custom TCP Rule.
- In the Port Range field, enter your database's port number that you found in the previous step. (The default port number is 5432.)
- What you enter in the Custom IP field depends on whether you're connecting directly or using an SSH tunnel.
  - If you're connecting directly, enter Fivetran's IPs for your database's region.
  - If you're connecting using an SSH tunnel, enter {your-ssh-tunnel-server-ip-address}/32.
- (Optional) Enter a brief description in the Description field.
Click Save rules.

Configure network ACLs (VPC only)

If your database instance is not in a VPC, skip ahead to Step 5.

Return to the RDS dashboard.
Click on your database instance.
Click the link to the instance's VPC.
Click the VPC ID.
In the Details section, click on the Network ACL.
Click the Network ACL ID.

You will see tabs for Inbound Rules and Outbound Rules. You must edit both.

Edit inbound rules

Select Inbound Rules.
If you have a default VPC that was automatically created by AWS, the settings already allow all incoming traffic. To verify that the settings allow incoming traffic, confirm that the Source value is 0.0.0.0/0 and that the ALLOW entry is listed above the DENY entry.
If your inbound rules don't include an ALL - 0.0.0.0/0 - ALLOW entry, edit the rules to allow the Source to access the port number of your database instance. (The port will be 5432 for direct connections, unless you changed the default.) For additional help, see Amazon's Network ACL documentation.
- If you're connecting directly, enter Fivetran's IPs for your database's region.
- If you're connecting using an SSH tunnel, enter {your-ssh-tunnel-server-ip-address}/32.

Edit outbound rules

Select Outbound Rules.
If your outbound rules don't include an ALL - 0.0.0.0/0 - ALLOW entry, edit the rules to allow outbound traffic to all ports 1024-65535 for destination 0.0.0.0/0. For additional help, see AWS's Network ACLs documentation.

Create user

Create a database user for Fivetran's exclusive use.

Open a connection to your primary Amazon Aurora PostgreSQL database.
Create a user for Fivetran by executing the following SQL command. Replace <username> and some-password with a username and password of your choice.

CREATE USER <username> PASSWORD 'some-password';

Grant read-only access

Grant the Fivetran user read-only access to all tables by running the following commands. To grant access to a schema other than PostgreSQL's default public schema, replace public with your schema name. If you want to grant access to multiple schemas, run these commands for each schema.

GRANT USAGE ON SCHEMA "public" TO <username>;
GRANT SELECT ON ALL TABLES IN SCHEMA "public" TO <username>;
ALTER DEFAULT PRIVILEGES IN SCHEMA "public" GRANT SELECT ON TABLES TO <username>;

The ALTER DEFAULT PRIVILEGES command ensures that future tables created in the schema are also accessible to the Fivetran user.

Restrict access to tables (optional)

You can limit Fivetran's access to specific tables by granting access only to the tables that you want to sync.

You must grant access individually for each table. You cannot grant access to all tables and then revoke access for a subset of tables.

Ensure that the Fivetran user has access to the schema that contains your table(s):
```
GRANT USAGE ON SCHEMA "your_schema" TO <username>;
```

Revoke previously granted table-level permissions:

ALTER DEFAULT PRIVILEGES IN SCHEMA "your_schema" REVOKE SELECT ON TABLES FROM <username>;
REVOKE SELECT ON ALL TABLES IN SCHEMA "your_schema" FROM <username>;

Grant access to each table:

GRANT SELECT ON "your_schema"."your_table" TO <username>;

By default, new tables created in the schema are not accessible to the Fivetran user. To grant access to new tables, run the following command:
```
ALTER DEFAULT PRIVILEGES IN SCHEMA "your_schema" GRANT SELECT ON TABLES TO <username>;
```

Restrict access to columns (optional)

You can limit access to specific columns within a table by granting permissions only to those columns.

Revoke existing table-level permissions:

REVOKE SELECT ON "your_schema"."your_table" FROM <username>;

Grant access to specific columns:
If you chose Query-Based as your incremental sync method, you must grant us access to the hidden system columns xmin and ctid. This speeds up your initial sync and enables capturing deletes. If you chose Logical replication, granting access to xmin is recommended for new connections to enable faster initial sync and re-import.
```
GRANT SELECT (xmin, ctid, some_column, other_column) ON "your_schema"."your_table" TO <username>;
```

After restricting column access, newly added columns will not be accessible automatically. To grant access to new columns, rerun the command above with the additional columns.

Configure incremental sync method

To keep your data up to date after the initial sync, Fivetran offers multiple incremental sync methods. These methods track recent data changes so Fivetran can sync only what changed since the last sync instead of copying entire tables each time. Learn more in our Updating data documentation.

We recommend using the Logical replication method when possible because it is faster and more efficient than Query-Based. For guidance on choosing the best option for your workload, see Logical replication vs Query-Based documentation.

Configure your chosen incremental sync method:

Logical replication

Logical replication is based on logical decoding of the PostgreSQL write-ahead log (WAL). Fivetran reads the WAL using the pgoutput plugin to detect any new or changed data. This plugin replicates from your custom publication without needing additional libraries.

To enable logical replication, follow these steps:

Open a connection to your primary Amazon Aurora PostgreSQL database. You cannot enable logical replication on a read replica.
Ensure that your server has ample free space for the logs. As soon as Fivetran processes a log, we delete it. However, we don't delete logs if the sync is interrupted (for example, if we lose access to your database). In this case, logs may accumulate on your server and consume additional storage. The amount of additional disk space that these logs consume is proportional to the number of changes committed on the server. If we can't resume a lost connection quickly enough and you need more disk space, you can drop the replication slot, which deletes its unconsumed logs.
In your RDS dashboard, do the following:
i. Create a new parameter group (non-default group).
ii. Set the logical_replication parameter to 1.
iii. Set wal_sender_timeout parameter to 0.
iv. For PostgreSQL 18 and later, either set the idle_replication_slot_timeout parameter to 0 to disable it or to at least 24h to reduce the risk of the replication slot being invalidated between syncs.
v. Apply the parameter group to the database.
vi. Wait until your database's parameter group status changes to pending-reboot, then reboot your database to apply the new parameter group.
Use a PostgreSQL console to log in to your primary database as a superuser (one that has the rds_superuser role).
Create a publication for your tables in your primary database. If you want, you can create a publication for only certain tables so that you add or remove tables from the publication later on. Only changes from tables in the publication are replicated to Fivetran. Each database can have multiple distinct publications. You must have CREATE privileges or above to run this command.
The publication name fivetran_pub quoted throughout this guide is used purely as an example. The actual publication name should be unique for every database and cannot start with a number.
```
CREATE PUBLICATION fivetran_pub FOR TABLE table2, table4, table8;
```
To add or remove a table from a publication, run the following command. You must have ownership rights over the table(s).
```
ALTER PUBLICATION fivetran_pub ADD/DROP TABLE table_name;
```
Alternatively, you can create a publication for all of your tables. However, you cannot remove any table from this publication later on. You must have superuser privileges to run this command.
```
 CREATE PUBLICATION fivetran_pub FOR ALL TABLES;
```
(Optional) You can choose which operations to include in the publication. For example, the following publication includes only INSERT and UPDATE operations.
```
CREATE PUBLICATION insert_only_pub FOR TABLE table1 WITH (publish = 'INSERT, UPDATE');
```
To add partitioned tables for PostgreSQL version 13 or later, run the following command to enable publish_via_partition_root.
```
CREATE PUBLICATION fivetran_pub FOR ALL TABLES WITH (publish_via_partition_root=true);
```
Create a logical replication slot for the database you want to sync by running the following command. You must use the standard output plugin pgoutput. Ensure that you are connected to the correct database when you create your replication slot, or your connection will not be able to find the slot.
You must create a unique replication slot for every connection that uses the same PostgreSQL cluster. Replication slot names cannot start with a number. (The replication slot name fivetran_pgoutput_slot quoted throughout this guide is used purely as an example.)
You need to create the replication slot after you have created the publication.
```
SELECT pg_create_logical_replication_slot('fivetran_pgoutput_slot', 'pgoutput');
```
Verify that your chosen tables are in the publication.
```
SELECT * FROM pg_publication_tables;
```
Grant the Fivetran user permission to read the replication slot.
```
GRANT rds_replication TO <username>;
```
Log in as the Fivetran user.
Verify that the Fivetran user can read the replication slot by running the following command. Replace fivetran_pgoutput_slot with your replication slot name and fivetran_pub with the publication name.
```
SELECT count(*) FROM pg_logical_slot_peek_binary_changes('fivetran_pgoutput_slot', null, null, 'proto_version', '1', 'publication_names', 'fivetran_pub');
```

If the query succeeds, then permissions are sufficient.

Query-Based

Fivetran runs SQL queries that read PostgreSQL system columns (xmin and, when applicable, ctid) to detect new and changed data during each sync.

Capture Deletes (optional)

Query-Based sync detects deleted rows by default. You can disable this behavior by turning off the Capture Deletes toggle in the connection setup form.

When Capture Deletes is enabled:

Fivetran adds an internal helper column ctid_fivetran_id to each synced table. Fivetran uses this column to track deletes by storing the row's ctid value at the end of each sync.
- Before the initial sync (new connection): Fivetran creates tables with ctid_fivetran_id during the initial sync. No additional re-sync is required.
- After data is already synced (existing connection): Fivetran runs a one-time migration sync of the selected tables to populate ctid_fivetran_id column and capture a snapshot of the source table. This migration sync does not trigger a table re-sync. It may take longer than a regular sync because Fivetran must populate the additional ctid_fivetran_id column.
This setting is permanent for the connection. After you click Save & Test in the connection setup form, it’s locked and can’t be disabled later, whether or not the connection has run its first sync.
We sync partitioned tables using child-to-child sync only.

For details and limitations, see our Capturing deletes documentation.

Query-Based sync requires full table scans to detect updates and may be slower than logical replication, especially for large tables. In high-write databases, PostgreSQL xmin freezing/wraparound can also cause older rows to be re-synced, increasing sync volume and load on your PostgreSQL source database. If possible, use logical replication.

Fivetran Teleport Sync Sunset

Fivetran Teleport Sync is a proprietary incremental sync method that can add delete capture with no additional setup other than a read-only SQL connection. Updates will be captured using the XMIN system column.

Learn more in our Fivetran Teleport Sync documentation.

If you are connecting with a standby or read replica, run the following SQL command on your primary database as the Fivetran user:

CREATE AGGREGATE BIT_XOR(IN v bigint) (SFUNC = int8xor, STYPE = bigint);

If you are not connecting with a read replica, you do not need to do any additional configuration. The aggregate that the Teleport mechanism will later use is automatically created for you.

Finish Fivetran configuration

In your connection setup form, enter a Destination schema prefix. This is used as the connection name and cannot be modified once the connection is created.
In the Destination schema names field, choose the naming convention you want Fivetran to use for the schemas, tables, and columns in your destination:
- Source naming: Preserves the original schema, table, and column names from the source system in your destination, and ignores the Destination schema prefix specified in the setup form. However, when multiple connections share the same source schema name (for example, public), Fivetran stores their tables in the same destination schema. Tables with duplicate names may lead to overwrites and data inconsistencies. Be sure to use unique table names across connections that write to the same schema.
- Fivetran naming: Standardizes the schema, table, and column names in your destination according to the Fivetran naming conventions.
If you want to modify your selection, make sure you do it before you start the initial sync.
Depending on your selection, we will either prefix the connection name to each replicated schema or use the source schema names instead.
(Optional, Private Preview only) If you want to manage your credentials outside of Fivetran, enable the Use External Secrets Manager toggle. For some connectors and destinations, this toggle only appears after you select a credential-based authentication method. See the External Secret Managers documentation for more information.
- If you have already configured External Secret Managers for your account, select one from the drop-down menu. Note that the list is filtered by the deployment model of the destination: if the destination uses SaaS Deployment, External Secret Managers configured for Hybrid Deployment won't be available, and vice versa.
- To edit the details of the selected External Secret Manager, click Edit manager details in Account Settings.
- To set up a new External Secret Manager, click Configure a new secrets manager. See the Create New External Secret Manager documentation for prerequisites and setup instructions.
- You can manage all your External Secret Managers at any time under Account Settings. See the External Secret Managers documentation for more information.
- When ESM is enabled, credential fields are replaced by ESM key fields. In each ESM key field, enter the name of the secret stored in your external secrets manager that corresponds to that credential — not the credential value itself. For more information, see External Secret Managers.
In the Host field, enter the endpoint URL that you found in Step 4. Alternatively, you can enter your database host's IP (for example, 1.2.3.4).
In the Port field, enter the port number that you found in Step 4.
In the User field, enter the Fivetran-specific user that you created in Step 5.
In the Password field, enter the password for the Fivetran-specific user that you created in Step 5.
In the Database field, enter the name of your database (for example, your_database).
(Hybrid Deployment only) If your destination is configured for Hybrid Deployment, the Hybrid Deployment Agent associated with your destination is pre-selected for the connection. To assign a different agent, click Replace agent, select the agent you want to use, and click Use Agent.
From the Connection method drop-down, select how Fivetran connects to your database and enter the required information:
- Connect directly
- Connect via an SSH tunnel
  - In the SSH Host field, enter the hostname or IP address of the SSH server. Do not use a load balancer IP address or hostname.
  - In the SSH Port field, enter the port used by the SSH server.
  - In the SSH User field, enter the username of the SSH user.
  - If you enabled TLS on your database in Step 1 - Choose connection method, keep the Require TLS through Tunnel toggle turned ON.
  Ensure that you have added Fivetran's SSH Public Key to the authorized_keys file on your SSH tunnel host when configuring your SSH tunnel in the Choose connection method step.
- Connect via private networking
  - If you enabled TLS on your database in Step 1 - Choose connection method, keep the Require TLS when using Private Networking toggle turned ON.
- Connect via Proxy Agent
  - Select an existing agent from the Proxy agents drop-down list or click + Configure a new proxy agent to set up a new agent.
  - If you enabled TLS on your database in Step 1 - Choose connection method, keep the Require TLS when using Proxy Agent toggle turned ON.
1. In the Incremental sync method tab, select the incremental sync method that you want to use:
  - Logical Replication: Enter the Replication Slot name and Publication Name that you created in the Configure incremental sync method - Logical replication step.
  - Query-Based: The Capture Deletes toggle is ON by default. Fivetran uses this to detect deleted rows. Once you save the connection, this setting cannot be changed. Turn the toggle OFF before saving if you don't want to capture deletes. For more information, see Capture deletes.
(Not applicable to Hybrid Deployment) Copy the Fivetran's IP addresses (or CIDR) that you must safelist in your firewall.
Click Save & Test. Fivetran tests and validates our connection to your Amazon Aurora PostgreSQL database. Upon successful completion of the setup tests, you can sync your data using Fivetran.

Setup tests

Fivetran performs the following tests to ensure that we can connect to your Amazon Aurora PostgreSQL database and that it is properly configured:

The Connecting to SSH Tunnel Test validates the SSH tunnel details you provided in the setup form. It then checks that we can connect to your database using the SSH Tunnel. (We skip this test if you aren't connecting using SSH.)
The Connecting to Host Test validates the database credentials you provided in the setup form. The test verifies that the host is not private and then checks the connectivity to the host.
The Validating Certificate Test generates a pop-up window where you must choose which certificate you want Fivetran to use. It then validates that certificate and checks that we can connect to your database using TLS. (We skip this test if you selected an indirect connection method and then disabled the Require TLS through Tunnel toggle.)
Aurora does not return the entire certificate chain when we query for it, so the root certificate may not be selectable during the Validating Certificate Test stage.
The Connecting to Database Test checks that we can access your database.
The Connecting to WAL Replication Slot Test confirms that the database associated with the replication slot matches the name you supplied in the setup form. It then verifies that the replication slot uses the pgoutput. Lastly, it makes sure that the Fivetran user has replication privileges. (We skip this test if you selected Query-Based as your incremental sync method)
The Checking Configuration Values Test checks a set of WAL-configured values against the recommended settings and detects if they are below the recommended range. (We skip this test if you selected Query-Based as your incremental sync method.)
The Publication Test verifies that the supplied publication name exists in your database. (We skip this test if you selected Query-Based as your incremental sync method.)