MySQL
MySQL is an open-source relational database. Fivetran's integration platform replicates data from your MySQL source database and loads it into your destination.
NOTE: Fivetran supports MySQL as both a database connector and a destination.
Fivetran offers the following solutions for replicating data from your MySQL source database:
Supported services
Fivetran supports the following MySQL database services:
- Generic MySQL
- Amazon Aurora MySQL
- Amazon Aurora Serverless v2 MySQL
- Amazon Aurora Serverless v1 MySQL
- Amazon RDS MySQL
- Azure Database for MySQL
- Google Cloud SQL for MySQL
Supported configurations
Fivetran supports the following MySQL configurations:
Supportability Category | Supported Values |
---|---|
Database versions | 5.5 - 8.4 |
Connector limit per database | No limit |
Transport Layer Security (TLS) | TLS 1.1 - 1.3 |
Which MySQL instance types we can connect to depends on your database service:
Instance Types | Supported |
---|---|
Generic MySQL | |
Primary instance | check |
Read replica | check |
Amazon Aurora MySQL | |
Primary instance | check |
Read replica | |
Amazon Aurora Serverless v2 MySQL | |
Primary instance | check |
Read replica | |
Amazon Aurora Serverless v1 MySQL | |
Aurora Serverless v1 DB cluster | check |
Amazon RDS MySQL | |
Primary instance | check |
Read replica | check |
Azure Database for MySQL | |
Primary instance | check |
Read replica | check |
Google Cloud SQL for MySQL | |
Primary instance | check |
Read replica | check |
Features
Feature Name | Supported | Notes |
---|---|---|
Capture deletes | check | |
History mode | check | |
Custom data | check | |
Data blocking | check | |
Column hashing | check | |
Re-sync | check | |
API configurable | check | API configuration |
Priority-first sync | ||
Fivetran data models | ||
Private networking | check | |
Authorization via API | check |
Setup guide
This overview will give you a general idea of the kind of work needed to set up a MySQL connector. For specific instructions on how to set up your database, see the guide for your MySQL database type:
In your primary database, you need to do the following:
- Allow access to your MySQL database using Fivetran's IP
- (Optional) Allow access to a read replica of your MySQL database if you do not want to connect Fivetran to your production instance
- Create a Fivetran-specific MySQL user with read-level and other replication permissions as mentioned in the setup guides above.
WARNING: This user must be reserved for Fivetran use and unique to your connector.
- Reserved: Create a user for Fivetran's exclusive use. At the beginning of each sync, a Fivetran user attempts to kill any zombie processes it left behind on previous syncs. If you try to run any operations as this user, they may be killed.
- Unique: If you are creating multiple connectors targeting the same source database, create a unique user for each connector. If you set up multiple connectors with the same Fivetran user, they can kill one another's connections, slowing down your sync unnecessarily. NOTE: The Fivetran MySQL connector only supports the authentication plugin
caching_sha2_password
over TLS.
- If you connect Fivetran to a multithreaded replica and plan to run your connector on history mode, configure your read replica to allow history mode to run.
Sync overview
Once Fivetran is connected to your MySQL primary database or read replica, we pull a full dump of all selected data from your database. We then connect to your binary log to pull all your new and changed data at regular intervals. The binary log is the same change tracking mechanism that MySQL uses to perform its own replication for backups. If data in the source changes (for example, you add new tables or change a data type), Fivetran automatically detects and persists these changes into your destination.
Syncing empty tables and columns
Fivetran can sync empty tables and columns for your MySQL connector. An empty table has no rows, and an empty column has no data. A table that contains zero columns is invalid and can't be synced. For more information, see our Features documentation.
Schema information
Fivetran tries to replicate the exact schema and tables from your database to your destination according to our standard database update strategies. For every schema in the MySQL database that you connect, we create a schema in your destination that maps directly to its native schema. This ensures that the data in your destination is in a familiar format to work with.
Private Preview Fivetran allows you to control how source names for tables and columns are replicated to the destination. You can choose between preserving the original source names with UTF-8 values or standardizing the naming conventions for schemas, tables, and columns in the destination.
- To standardize the naming conventions in the destination, select Fivetran naming.
- To preserve the original source names in the destination, select Source naming.
You can modify this selection at any time before the initial sync completes, offering flexibility in how your data is structured in the destination. For more information, see our naming conventions documentation.
Fivetran-generated columns
Fivetran adds the following columns to every table in your destination:
_fivetran_deleted
(BOOLEAN) marks rows that were deleted in the source database._fivetran_synced
(UTC TIMESTAMP) indicates the time when Fivetran last successfully synced the row._fivetran_index
(INTEGER) shows the order of updates for tables that do not have a primary key._fivetran_id
(STRING) is the hash of the non-Fivetran values of each row. It's a unique ID that Fivetran uses to avoid duplicate rows in tables that do not have a primary key.
We add these columns to give you insight into the state of your data and the progress of your data syncs. For more information about these columns, see our System Columns and Tables documentation.
Type transformations and mapping
As we extract your data, we match MySQL data types to types that Fivetran supports. If we don't support a certain data type, we automatically change that type to the closest supported type or, for some types, don't load that data at all. Our system automatically skips columns with data types that we don't accept or transform.
The following table illustrates how we transform your MySQL data types into Fivetran supported types:
MySQL Type | Fivetran Data Type | Fivetran Supported | Notes |
---|---|---|---|
BINARY | BINARY | True | |
BIGINT | LONG | True | |
BIT | BOOLEAN | True | BIT type with a single digit is supported |
BLOB | BINARY | True | |
CHAR | STRING | True | |
DATE | DATE | True | Invalid values will be loaded as NULL or EPOCH if the type is a primary key |
DATETIME | TIMESTAMP_NTZ | True | Invalid values will be loaded as NULL or EPOCH if the type is a primary key |
DECIMAL/ NUMERIC | BIGDECIMAL | True | |
DOUBLE | DOUBLE | True | |
ENUM | STRING | True | |
FLOAT | DOUBLE | True | |
GEOMETRY | JSON | True | |
GEOMETRYCOLLECTION | JSON | True | |
JSON | JSON | True | |
INT | INTEGER | True | |
LINESTRING | JSON | True | |
LONGBLOB | BINARY | True | |
LONGTEXT | STRING | True | |
MEDIUMBLOB | BINARY | True | |
MEDIUMINT | INTEGER | True | |
MEDIUMTEXT | STRING | True | |
MULTILINESTRING | JSON | True | |
MULTIPOINT | JSON | True | |
MULTIPOLYGON | JSON | True | |
POINT | JSON | True | |
POLYGON | JSON | True | |
SET | JSON | True | |
SMALLINT | INTEGER | True | |
TIME | STRING | True | |
TIMESTAMP | TIMESTAMP | True | MySQL always stores timestamps in UTC Invalid values will be loaded as NULL or EPOCH if the type is a primary key |
TINYBLOB | BINARY | True | |
TINYINT | BOOLEAN | True | If the source TINYINT column has a specified width of exactly 1 (i.e., TINYINT(1) ) and contains values of only 0s and 1s, the destination type will be BOOLEAN. If it contains values other than 0 or 1, the destination type will be INTEGER. |
TINYINT | INTEGER | True | In all other cases, the destination type for TINYINT columns will be INTEGER. If the width isn't specified to be exactly 1 (either no specification or a value other than 1), the destination type will be INTEGER, even if the column contains only 1s or 0s. |
TINYTEXT | STRING | True | |
UNSIGNED BIGINT | BIGDECIMAL | True | |
UNSIGNED INT | LONG | True | |
UNSIGNED SMALLINT | INTEGER | True | |
VARCHAR | STRING | True | |
VARBINARY | BINARY | True | |
YEAR | INTEGER | True |
If we are missing an important data type that you need, reach out to support.
In some cases, when loading data into your destination, we may need to convert Fivetran data types into data types that are supported by the destination. For more information, see the individual data destination pages.
Unparsable values
When we encounter an unparsable value of one of the following data types, we substitute it with a default value. Which default value we use depends on whether the unparsable value is in a primary key column or non-primary key column:
MySQL Type | Primary Key Value | Non-Primary Key Value |
---|---|---|
DATE | 1970-01-01 | null |
DATETIME | 1970-01-01T00:00:00 | null |
TIMESTAMP | 1970-01-01T00:00:00Z | null |
Although we may be able to read some values outside the supported DATE, DATETIME, and TIMESTAMP ranges as defined by MySQL's documentation, there is no guarantee. Additionally, the special zero value 0000-00-00 00:00:00
is subject to this rule.
Supported character sets
Fivetran supports the following character sets in MySQL:
MySQL Character Set | Description |
---|---|
ascii | US ASCII |
big5 | Big5 Traditional Chinese |
cp1250 | Windows Central European |
cp1251 | Windows Cyrillic |
cp1256 | Windows Arabic |
cp1257 | Windows Baltic |
cp850 | DOS West European |
cp852 | DOS Central European |
cp866 | DOS Russian |
cp932 | SJIS for Windows Japanese |
euckr | EUC-KR Korean |
gb18030 | China National Standard GB18030 |
gb2312 | GB2312 Simplified Chinese |
gbk | GBK Simplified Chinese |
greek | ISO 8859-7 Greek |
hebrew | ISO 8859-8 Hebrew |
koi8r | KOI8-R Relcom Russian |
koi8u | KOI8-U Ukrainian |
latin1 | cp1252 West European |
latin2 | ISO 8859-2 Central European |
latin5 | ISO 8859-9 Turkish |
latin7 | ISO 8859-13 Baltic |
macce | Mac Central European |
macroman | Mac West European |
sjis | Shift-JIS Japanese |
tis620 | TIS620 Thai |
ucs2 | UCS-2 Unicode |
utf16 | UTF-16 Unicode |
utf16le | UTF-16LE Unicode |
utf32 | UTF-32 Unicode |
utf8mb3 | UTF-8 Unicode |
utf8mb4 | UTF-8 Unicode |
Using an unsupported character set will cause your sync to fail.
Excluding source data
If you don’t want to sync all the data from your primary database, you can exclude schemas, tables, or columns from your syncs on your Fivetran dashboard. To do so, go to your connector details page and uncheck the objects you would like to omit from syncing. For more information, see our Data Blocking documentation.
Alternatively, you can change the permissions of the Fivetran user you created and restrict its access to certain tables or columns.
IMPORTANT: We recommend blocking or hashing data in the Fivetran dashboard instead of restricting permissions. If you exclude a column by restricting the Fivetran user's permissions, your sync logs will indicate that the number of columns in the destination does not match the number of columns in the source table - in the
reason
field of thedata
object of theforced_resync_table
log event.
Note that column permissions will not shield confidential data, such as PII (Personally Identifiable Information), from Fivetran because they do not apply to binary logs (binlogs). Using your binlogs, Fivetran has access to the full contents of any changed rows, including columns that have no SELECT permissions. However, we filter out the prohibited columns at the earliest possible stage of our syncs and do not load them into your destination.
Unsupported schemas
Fivetran does not support the following system schemas:
- performance_schema
- information_schema
- mysql
- innodb
- looker_scratch
- Percona
- tmp
Initial sync
When Fivetran connects to a new database, we first copy all rows from every table in every schema for which we have SELECT permission (except those you have excluded in your Fivetran dashboard) and add Fivetran-generated columns. Tables are copied in ascending size order (from smallest to largest). We copy rows by performing a SELECT statement on each table. For large tables, we copy a limited number of rows at a time so that we don't have to start the sync over from the beginning if our connection is lost midway.
The duration of initial syncs can vary depending on the number and size of tables to be imported. We therefore interleave incremental updates with the table imports during the initial sync.
Updating data
Fivetran performs incremental updates of any new or modified data from your source database. We use one of the following incremental sync methods for incremental updates: binary log and Fivetran Teleport Sync.
Both methods keep a record of recent data changes, which allows Fivetran to update only the data that has changed since our last sync. We recommend the binary log method because it provides the most efficient mechanism for capturing change data from the database.
WARNING: Switching the incremental sync method will require a historical re-sync to guarantee data integrity.
Binary log vs. Fivetran Teleport Sync
There are several key differences between the binary log and Fivetran Teleport Sync incremental update methods:
Feature | Binary log | Fivetran Teleport Sync | Notes |
---|---|---|---|
Syncs tables with primary keys | Yes | Yes | Includes composite primary keys |
Syncs tables without primary keys | Yes | Yes1 | |
Syncs all supported data types2 | Yes | Partially 3 | |
Can read from a replica | Yes | Yes | |
Tracks deletes | Yes | Yes | |
Tracks DDL and DML changes | Partially4 | Partially4 | |
Table limit per sync | No | 400 tables | |
Row limit per table | No | 20 million rows | |
Supports GTID mode | Yes | No | |
Supports replication filtering | Yes | No |
1Fivetran Teleport Sync supports syncing tables without primary keys in Private Preview for existing connectors. All new connectors have this ability enabled automatically.
2See which data types the MySQL connector supports in our Type transformations and mapping documentation.
3Learn which data types Fivetran Teleport Sync supports in our Teleport Sync documentation.
4Fivetran does not support certain DDL and DML commands. Learn more in our Unsupported MySQL commands documentation.
Binary log
Fivetran uses your binary logs (also called binlogs) to request only the data that has changed since our last sync. We require the binlogs to be in row format, so that they contain a separate event in each row. We pull INSERT, UPDATE, and DELETE events from your binlogs at regular intervals and upload them to your destination.
How we load UPDATE events into your destination depends on whether or not the table has a primary key. To find out which of your tables have primary keys, run this query in your source database:
SELECT
max(tc.CONSTRAINT_TYPE = 'PRIMARY KEY') IS NOT NULL
AND max(tc.CONSTRAINT_TYPE = 'PRIMARY KEY') = TRUE AS has_primary_key,
c.TABLE_SCHEMA AS table_schema,
c.TABLE_NAME AS table_name
FROM information_schema.COLUMNS c
LEFT JOIN information_schema.KEY_COLUMN_USAGE k ON k.TABLE_SCHEMA = c.TABLE_SCHEMA AND
k.TABLE_NAME = c.TABLE_NAME AND
k.COLUMN_NAME = c.COLUMN_NAME
LEFT JOIN information_schema.TABLE_CONSTRAINTS tc ON tc.TABLE_SCHEMA = k.TABLE_SCHEMA AND
tc.TABLE_NAME = k.TABLE_NAME AND
tc.CONSTRAINT_NAME = k.CONSTRAINT_NAME
LEFT JOIN information_schema.TABLES t ON t.TABLE_SCHEMA = c.TABLE_SCHEMA AND
t.TABLE_NAME = c.TABLE_NAME
WHERE t.TABLE_TYPE = 'BASE TABLE'
AND c.TABLE_SCHEMA NOT IN ('performanceschema', 'informationschema', 'mysql', 'innodb', 'lookerscratch', 'tmp')
GROUP BY c.TABLE_SCHEMA, c.TABLE_NAME ORDER BY has_primary_key, c.TABLE_SCHEMA, c.TABLE_NAME;
Setup guide
In your primary database, you need to do the following:
- Enable ROW format binary log replication
- Set the
binlog_row_image
binary logging system variable toFULL
in global scope - Set binary log expiration to a minimum of one day (24 hours). We recommend setting the log expiration for seven days.
WARNING: Do not purge the binary log files manually. If the binary log files that the connectors look for are missing, the sync will fail and you will need to re-sync to fix the issue.
Limitations
Corrupted binary log files cause incremental updates to fail. Our updates fail because we cannot parse the entry events of corrupted binary log files, which is a limitation of MySQL database technology. A binary log can be corrupted when it is manually edited, the size of the event is more than 1 GB, or the network is unreliable. To avoid corrupted binary logs, do the following:
- Do not edit the binary log files manually.
- Write binlog events that are smaller than 1 GB.
- Make sure that the size of the event is less than the max_allowed_packet.
You can retry a failed incremental sync by manually triggering a full re-sync of the table.
NOTE: Full table re-syncs increase your MAR. To learn more, see our resync pricing documentation.
GTID mode
We also support replication using global transaction identifiers (GTIDs). In GTID mode, we stream incremental updates from your database using each transaction's GTID instead of its binary log position. Using GTIDs has the following advantages over normal binary log replication:
- You do not have to re-sync your connector if you switch to another replica with identical data
- Your database can failover to another replica with identical data without a binary log mismatch
- You do not have to manually rotate logs if you have a low-activity connector where all logs expire because of low transaction activity
GTID mode still uses your binary logs, so you must follow the binary log setup steps in your MySQL connector setup guide. In addition, you must set the GTID_MODE configuration parameter to ON in your source database (or for MariaDB, set GTID_STRICT_MODE to ON). GTID mode cannot function with GTID_MODE set to ON_PERMISSIVE. If GTID_MODE isn't set to ON in our database, we fall back to using only your binary log positions as described in the binary log section.
Learn more about GTIDs in MySQL's Replication with Global Transaction Identifiers documentation.
Replication filtering
Fivetran supports MySQL replicas configured to use replication filtering. Replication filtering may be beneficial if your database produces a large volume of binary logs, but you only want to sync a small portion of those events from your database (for example, include a single schema or exclude a high-volume schema or table).
This option may not be supported on hosted MySQL services such as Google Cloud SQL for MySQL, Amazon RDS MySQL, Amazon Aurora MySQL, or Azure Database for MySQL because it requires SUPER privileges to configure. See the documentation for your host's version of MySQL for more configuration information.
For maximum compatibility with Fivetran while using replication filtering, we recommend connecting to a MySQL replica and only configuring replicate-*
options. For example, if you have multiple schemas but you want Fivetran to see and replicate only the my-example
schema, you can configure a replication filter REPLICATE_DO_DB = 'my-example'
using either the CHANGE REPLICATION FILTER
statement, the MySQL command line arguments, or the my.cnf
configuration properties for this filter. Creating a replication filter ensures the MySQL replica ignores binary log events for all other schemas in the main MySQL database. The MySQL replica still requires binary logging to be enabled, so you must follow the binary log setup steps in your MySQL connector setup guide. Once configured, the MySQL replica contains only the schemas and tables configured (or excludes the configured schemas and tables if you use one of the *-ignore-*
options). The Fivetran connector connected to the MySQL replica has access to and replicates only those schemas and tables in the MySQL replica.
Fivetran Teleport Sync
Fivetran Teleport Sync is a proprietary incremental sync method that can incrementally replicate your database with no additional setup other than a read-only SQL connection.
Use Fivetran Teleport Sync when the binary log method is not an option for the following reasons:
- You cannot or do not want to provide Fivetran access to your database's change logs
- You need to limit the tables or columns that Fivetran can read from
- Binary log syncing is slow or resource intensive due to the number of updates
- You load data into the source database using a TRUNCATE/LOAD or BACKUP/RESTORE process that invalidates change logs
- You currently use snapshot syncing without incremental updates and want a faster method
Fivetran Teleport Sync's queries perform the following operations on your database:
- Do a full table scan of each synced table
- Perform calculations on all values in each synced table's rows
- Aggregate a compressed table snapshot in the database's memory
We do not support syncing more than 400 tables or tables with more than 20 million rows with Fivetran Teleport Sync for MySQL.
For optimum Fivetran Teleport Sync performance, we recommend that you make the following resources available in your database:
- 1 GB Free RAM
- 1 Free CPU Core
- IOPS (Teleport Sync times decrease linearly with an increase of available IOPS)
Private Preview Fivetran Teleport Sync supports syncing tables without primary keys in Private Preview for existing connectors. All new connectors have this ability enabled automatically. Reach out to Fivetran Support if you're interested in trying this functionality for existing connectors.
IMPORTANT: Tables without a primary key can have many exact duplicates of a row, but duplicate rows create data discrepancy issues in Fivetran Teleport Sync. We generate a
_fivetran_id
column for tables without a primary key, so many identical source rows may share a_fivetran_id
. In that case, only one copy of the duplicate rows is synced to the destination.
Fivetran Teleport Sync for MySQL supports the following data types:
Data Type | Regular Column Support | Primary Key Support | Notes |
---|---|---|---|
BLOB | Yes | Yes | |
BIGINT | Yes | Yes | |
BINARY | Yes | Yes | |
BIT | Yes | No | |
CHAR | Yes | Yes | |
DATE | Yes | Yes | Supported in Private Preview |
DATETIME | Yes | Yes | Supported in Private Preview |
DECIMAL | Yes | Yes | |
DOUBLE | Yes | No | |
ENUM | Yes | Yes | |
FLOAT | Yes | No | |
GEOMETRY | Yes | No | |
GEOMCOLLECTION | Yes | No | |
GEOMETRYCOLLECTION | Yes | No | |
JSON | Yes | No | |
INT | Yes | Yes | |
LONGBLOB | Yes | Yes | |
LONGTEXT | Yes | Yes | |
LINESTRING | Yes | No | |
MEDIUMBLOB | Yes | Yes | |
MEDIUMINT | Yes | Yes | |
MEDIUMTEXT | Yes | Yes | |
MULTILINESTRING | Yes | No | |
MULTIPOINT | Yes | No | |
MULTIPOLYGON | Yes | No | |
POINT | Yes | No | |
POLYGON | Yes | No | |
SET | Yes | No | |
SMALLINT | Yes | Yes | |
TEXT | Yes | Yes | |
TIME | Yes | Yes | |
TIMESTAMP | Yes | No | |
TINYBLOB | Yes | Yes | |
TINYINT | Yes | Yes | Only supported if there is no type parameter or the type parameter is greater than 1 |
TINYTEXT | Yes | Yes | |
VARBINARY | Yes | Yes | |
VARCHAR | Yes | Yes | |
YEAR | Yes | Yes |
IMPORTANT: Teleport for MySQL also supports tables with composite primary keys.
Tables with a primary key
We merge changes to tables with primary keys into the corresponding tables in your destination:
- An INSERT in the source table generates a new row in the destination with
_fivetran_deleted = FALSE
. - A DELETE in the source table updates the corresponding row in the destination with
_fivetran_deleted = TRUE
. - An UPDATE in the source updates the data in the corresponding row in the destination.
Tables without a primary key
NOTE: Fivetran Teleport Sync support for tables without a primary key is currently in Private Preview for existing connectors. All new connectors have this ability enabled automatically.
Fivetran Teleport Sync performs the following steps to sync tables without a primary key:
- Copy the table to a
TEMPORARY TABLE
on the MySQL source server. This table includes a generated primary key column,_fivetran_id
. Because this value depends on the values of the other columns in the row, any duplicate rows in the original table are removed. - Use the
TEMPORARY TABLE
to sync the data to the destination. - Once the sync is complete, drop the
TEMPORARY TABLE
.
NOTE: To sync tables without a primary key, Fivetran Teleport Sync requires the Fivetran database user to have the
CREATE TEMPORARY TABLES
permission.
For the binary log method, we handle changes to tables without a primary key differently:
- An INSERT in the source table generates a new row in the destination with
_fivetran_deleted = FALSE
. - The
_fivetran_id
column helps us handle DELETE operations:- If there is a row in the destination that has a corresponding
_fivetran_id
value, that row will be updated with_fivetran_deleted = TRUE
. - If there is not a row in the destination that has a corresponding
_fivetran_id
value, a new row will be added with_fivetran_deleted = TRUE
.
- If there is a row in the destination that has a corresponding
- An UPDATE in the source table is treated as a DELETE followed by an INSERT, so it results in two rows in the destination:
- A row containing the old values with
_fivetran_deleted = TRUE
- A row containing the new values with
_fivetran_deleted = FALSE
- A row containing the old values with
As a result, one record in your source database may have several corresponding rows in your destination. For example, suppose you have a products
table in your source database with no primary key:
description | quantity |
---|---|
Shrink-ray gun | 1 |
Boogie robot | 2 |
Cookie robot | 3 |
You load this table into your destination during your initial sync, creating this destination table:
description | quantity | _fivetran_synced | _fivetran_index | _fivetran_deleted | _fivetran_id |
---|---|---|---|---|---|
Shrink-ray gun | 1 | '2000-01-01 00:00:00' | 0 | FALSE | asdf |
Cookie robot | 2 | '2000-01-01 00:00:00' | 1 | FALSE | dfdf |
Boogie robot | 3 | '2000-01-01 00:00:00' | 2 | FALSE | ewra |
You then update a row:
UPDATE products SET quantity = 4 WHERE description = 'Cookie robot';
After your UPDATE operation, your destination table will look like this:
description | quantity | _fivetran_synced | _fivetran_index | _fivetran_deleted | _fivetran_id |
---|---|---|---|---|---|
Shrink-ray gun | 1 | '2000-01-01 00:00:00' | 0 | FALSE | asdf |
Cookie robot | 2 | '2000-01-01 00:00:00' | 3 | TRUE | dfdf |
Boogie robot | 3 | '2000-01-01 00:00:00' | 2 | FALSE | ewra |
Cookie robot | 4 | '2000-01-01 00:00:00' | 4 | FALSE | zxfd |
You then delete a row:
DELETE FROM products WHERE description = 'Boogie robot';
After your DELETE operation, your destination table will look like this:
description | quantity | _fivetran_synced | _fivetran_index | _fivetran_deleted | _fivetran_id |
---|---|---|---|---|---|
Shrink-ray gun | 1 | '2000-01-01 00:00:00' | 0 | FALSE | asdf |
Cookie robot | 2 | '2000-01-01 00:00:02' | 3 | TRUE | dfdf |
Cookie robot | 4 | '2000-01-01 00:00:02' | 4 | FALSE | zxfd |
Boogie robot | 3 | '2000-01-01 00:00:02' | 5 | TRUE | ewra |
So, while there may be just one record in your source database where description = Cookie robot
, there are two in your destination - an old version where _fivetran_deleted = TRUE
, and a new version where _fivetran_deleted = FALSE
.
We also de-duplicate rows before we load them into your destination. We use the _fivetran_id
field, which is the hash of the non-Fivetran values in every row, to avoid creating multiple rows with identical contents. If, for example, you have the following table in your source:
description | quantity |
---|---|
Shrink-ray gun | 1 |
Shrink-ray gun | 1 |
Shrink-ray gun | 1 |
Then your destination table will look like this:
description | quantity | _fivetran_synced | _fivetran_index | _fivetran_deleted | _fivetran_id |
---|---|---|---|---|---|
Shrink-ray gun | 1 | '2000-01-01 00:00:00' | 0 | FALSE | asdf |
Deleted rows
We don't delete rows from the destination, though the way for how we process deletes differs for tables with primary keys and tables without a primary key. We handle deletes as part of streaming changes from the binlog. Note that we only process DELETE events from the binlog.
Deleted columns
We don't delete columns from your destination. When a column is deleted from the source table, we replace the existing values in the corresponding destination column with NULL
values.
Unsupported MySQL commands
Fivetran does not support several MySQL commands:
DROP
(and re-creating the dropped table withCREATE
)LOAD
RENAME
ON DELETE CASCADE
ON UPDATE CASCADE
If you use any of these unsupported commands to delete or update the records in a table, those records won't be replicated correctly in your destination.
Adding and dropping columns
When you add or drop a column, we attempt to automatically migrate your destination schema to the new table structure. In some cases, we won't be able to do this and instead perform an automatic re-sync of the changed table.
In the following scenarios, Fivetran will re-sync your table instead of automatically migrating it:
- Adding or dropping a column on a table without a primary key
- Changing column order
- Changing primary keys
- Modifying
ENUM
orSET
columns - Adding or dropping a column using a DDL statement that we're unable to parse properly
NOTE: For tables in History mode with a
DROP COLUMN
DDL applied, updates to the dropped column that occur during the DDL-containing sync (but before the DDL statement is processed) will appear asNULL
.
Renaming tables or columns
Renaming involves following three queries:
ALTER TABLE tbl_name RENAME [TO | AS] new_tbl_name
ALTER TABLE tbl_name CHANGE [COLUMN] old_col_name new_col_name column_definition
[FIRST | AFTER col_name]
ALTER TABLE tbl_name RENAME COLUMN old_col_name TO new_col_name
When you rename a table, we reimport the table as a new table, but we will not soft delete the old table in the destination. When you rename a column, both the old column and new column will appear in the table. We only update data in the new column moving forward.
Adding a column in Block All mode
When our connector detects a new column in a source table, it automatically performs a schema migration on that table unless you've chosen to block all newly-added tables and columns from syncing. If you excluded newly-added tables and columns from your syncs, you must manually re-sync your table whenever you add a new column.
Migrating service providers
If you want to migrate service providers, we will need to do a full re-sync of your data because the new service provider won't retain the same change tracking data as your original MySQL database.