Technical Reference

Need to get your connection up and running quickly?

Get free help to write and deploy your first Connector SDK connector. Our Professional Services experts are available to provide guidance on setup, troubleshooting, and best practices. To get started, file a support ticket.

Save time now

This section documents methods and operations used by Connector SDK as well as the Connector object that needs to be declared in your connector.

Technical details - `fivetran-connector-sdk` commands

The following CLI commands are available for the fivetran-connector-sdk PyPI package:

fivetran deploy: Deploys your code to Fivetran and creates or updates the connection. If the connection does not exist, it creates one. If a connection already exists, it prompts you to confirm whether to overwrite it. The available fivetran deploy command parameters are:
- --api-key <BASE_64_ENCODED_API_KEY>: Specifies your Fivetran base64-encoded API key which is used for deploying the connection.
- --destination <DESTINATION_NAME>: Specifies the destination name in your Fivetran account.
- --connection <CONNECTION_NAME>: Specifies the connection name in your Fivetran destination.
- --configuration (OPTIONAL): Determines the configuration values for your connection.
- --force (OPTIONAL): Automatically answers "Yes" to all confirmation prompts, and "No" to dependency validation prompts, removing the need for manual approval.
- --python or --python-version (OPTIONAL): Specifies the Python version to be used at runtime. Refer to Python version support documentation to see the list of supported versions.
- --hybrid-deployment-agent-id <NON_DEFAULT_HYBRID_DEPLOYMENT_AGENT_ID> (OPTIONAL): Specifies the hybrid deployment agent within the Fivetran system. Use this argument if you have specified a destination set up for hybrid deployment and you would like the hybrid deployment agent for your connection to be different from the default agent for that destination.

The fivetran deploy command executed in the CLI will prompt you to enter parameter values if issued without them. The command will exit after 3 invalid responses. We have also provided an easy way to set default values for parameters by using environment variables

fivetran debug: Runs and debugs the code locally. It tests and troubleshoots the connection's behavior with the source's actual data and generates a local warehouse.db file, a DuckDB representation of the data delivered by the connection to the destination. This process internally emulates Fivetran's core. The available fivetran debug command parameters are:
- --configuration (OPTIONAL): Specifies the configuration file for running the code locally.
fivetran reset: Resets the locally saved cursor and warehouse.db files, allowing you to re-run fivetran debug from scratch. Use it to quickly simulate an initial historical sync.
fivetran version: Displays the currently installed CLI version. This is required for getting support from Fivetran and troubleshooting issues.
fivetran --help: Returns a list of all possible commands and their parameters.

You can also specify your project's path along with a fivetran-connector-sdk command if your current working directory doesn't contain the connector.py file.

For example, you can use:

fivetran debug "/Users/fivetran/connector_sdk/sample_connector" 
# where current working directory is "/Users/fivetran"
# and project path is "/Users/fivetran/connector_sdk/sample_connector"

Technical details - required imports

Before you start implementing your connector, ensure you include the following imports at the top of your connector.py file:

from fivetran_connector_sdk import Connector # For supporting Connector operations like Update() and Schema()
from fivetran_connector_sdk import Operations as op # For supporting Data operations like Upsert(), Update(), Delete() and checkpoint()
from fivetran_connector_sdk import Logging as log # For enabling Logs in your connector code

Technical details - methods

Our Connector SDK supports the following methods.

`Update()`

This is a required method.

update(configuration: dict, state: dict) must contain the yield statement with operations to send your data to Fivetran.

Update() is called when the sync starts. Fivetran passes two dictionaries to the method:

The configuration dictionary contains any secrets or payloads you configure when deploying the connector. Use this dictionary to access any configuration values you need. See our configuration example to learn how to use configuration values from this dictionary.
The state dictionary is empty for the first sync or for any full re-sync. In all other cases, it contains whatever state you have chosen to checkpoint during the prior sync. In some of our more complex examples, e.g., weather, you can see how this is used to track the state of your data connection and achieve incremental syncs efficiently.

`Schema()`

This is an optional method. If not implemented, Fivetran infers the schema from the data you send us and the Schema tab on the connection setup page remains empty until the first sync is completed.

schema(configuration: dict) Fivetran passes one dictionary to the method:

The configuration dictionary contains any secrets or payloads you configure when deploying the connector. Please use this dictionary to access any configuration values you need. You can see our configuration example to refer how to use configuration values from this dictionary.

It must return a JSON object containing the following keys:

The table key is required and specifies the name of the table.
The primary_key key is optional but recommended. The value is a list of one or more primary keys. The content of the list is used as the table's primary key; a single entry means a simple primary key while multiple entries are combined to create a composite primary key for the table. We recommend that you provide primary keys for your tables. If you don't, we will use all columns for generating a unique hash to be used as a primary key and store it in a column named _fivetran_id. See our System Columns documentation for more details.
The column key is optional, it contains a dictionary of column names and data types.

The values you provide for the table, primary_key and column keys are renamed based on our renaming rules so that the corresponding names in the destination may differ from how they are set in your code. If you want the table and column names in the destination to exactly match the names you set in your code, we recommend adhering to the renaming rules ensuring the names align with the pattern and character set of transformed names. This means that names of the tables and columns in your source may not exactly match the corresponding names in the destination.

The Schema() method lets you configure the schema your connector delivers. We infer the schema for data you send us if you do not define it. However, if you want to set a primary key for a table or configure columns to have specific data types, then use this method.

If you don't provide the primary key to use in a table, Fivetran creates a surrogate primary key column named _fivetran_id which is a hashed value generated based on the row's set of values. See our system columns documentation for more details.

Our Connector SDK GitHub Repo has many examples of how to use the schema() method.

For an example of defining a schema with one table, see our weather example.
For an example with multiple tables, see our using_pd_dataframes example.
For an example of a table with multiple keys, see our records_with_no_created_at_timestamp example.
For an example of how columns in a schema response can be defined with specific data types, see our specified_types example.

If a new row is received with a different set of columns, we calculate the hash from the new row's values, including values from any new columns. This can lead to duplicate rows or data integrity issues in the destination. In this case, you may have to drop and re-sync the connection to preserve data integrity. Thus, we recommend defining primary keys for your tables to avoid unexpected behavior.

If you need to change primary key selections for a table, drop the table in your destination and then select Resync all historical data on the connection's Setup tab in your dashboard. Doing so maintains data integrity across all records.

Example of data duplication

Assume Fivetran receives the following row for a table not defined in the schema or defined without a primary key:
_id foo name _fivetran_id
1 abc John Doe 96DE69AE1728658394E4EAE664431F1A4E7857E4

_id	foo	name	_fivetran_id
1	`abc`	`John Doe`	`96DE69AE1728658394E4EAE664431F1A4E7857E4`

The generated hashed value would be from the values of the three columns.

Consider we receive the same row with an additional column:
_id foo name bar _fivetran_id
1 abc John Doe 96DE69AE1728658394E4EAE664431F1A4E7857E4
1 abc John Doe xyz 2AC47E18D9FCBC35B6DB94EA4FE4227A3A67A7F8

_id	foo	name	bar	_fivetran_id
1	`abc`	`John Doe`		`96DE69AE1728658394E4EAE664431F1A4E7857E4`
1	`abc`	`John Doe`	`xyz`	`2AC47E18D9FCBC35B6DB94EA4FE4227A3A67A7F8`

The generated hashed value would differ from the first row as the hashed value is calculated from the values of all the columns. This would cause the same row to be duplicated in the destination.

The Schema() method must return a JSON dictionary containing a list of dictionary objects. Each object represents one table.

Syncing empty tables and columns

Fivetran creates tables and columns in your destination for any column declared in the schema() method, even if there is no data sent for that column.

For more information, see our Features documentation.

Supported data types

The following data types are supported in the Fivetran Connector SDK:

BOOLEAN
SHORT
INT
LONG
DECIMAL
FLOAT
DOUBLE
NAIVE_DATE
NAIVE_DATETIME
UTC_DATETIME
BINARY
XML
STRING
JSON

If unspecified, Fivetran infers the data type automatically based on the data values. Additionally, None values are interpreted as NULL, and NaN values as FLOAT.

We cannot implicitly infer list objects as JSON. You must explicitly declare them as JSON in theSchema() method.

Technical details - required object `connector`

Our Connector SDK requires the following object to be declared in your code.

Your connector.py file must include an initialization of the Connector object as follows:

If you implement both the Update() and Schema() methods:

connector = Connector(update=update, schema=schema)

If you implement only the Update() method:
```
connector = Connector(update=update)
```

Technical details - operations

The values you provide for the table and column keys are renamed based on our renaming rules so that the corresponding names in the destination may differ from how they are set in your code. If you want the table and column names in the destination to exactly match the names you set in your code, we recommend adhering to the renaming rules ensuring the names align with the pattern and character set of transformed names. This means that names of the tables and columns in your source may not exactly match the corresponding names in the destination.

Our Connector SDK offers the following operations to deliver data to Fivetran:

`Upsert()`

upsert (table=”three”, data=data)

Writes data to the target table, using the defined primary keys of the table to either create a new row or update an existing row. Columns present in your table and not present in the data passed in the method will be populated with NULL.

The data parameter accepts a Python dictionary (similar to a JSON object) that holds key-value pairs. The key represents the name of a column in the target table, and the value represents the value to be upserted. The data type of the value can be any of the supported data types. It is crucial to ensure the data type of the value matches the data type defined in the table's schema for the corresponding column.

`Update()`

update (table=”three”, modified=data)

Writes data to the table using the primary keys to identify which row to update. This operation does not write data with new primary keys to your destination. Columns present in your table and not present in the data passed in the method will be left unchanged.

The modified parameter accepts a Python dictionary (similar to JSON object) that holds key-value pairs. The key represents the name of a column in the target table, and the value represents the value to be updated. The data type of the value can be any of the supported data types.

modified = {
    "primary_key_column": "value",
    "column_1": "value",
    "column_2": "value"
}

If there is a composite primary key containing multiple columns, all the columns must be present inside the dictionary for the correct row update.

`Delete()`

delete (table=”three”, keys=data)

Sets the _fivetran_deleted column value to TRUE for rows with the provided primary keys in the target table.

The keys parameter accepts a Python dictionary (similar to a JSON object) that specifies the rows to be marked as deleted. It contains key-value pairs where the key is the name of a primary key column in the table and the value is the value of that primary key column for the row to be deleted.

keys = {
    "primary_key_column": "value"
}

If there is a composite primary key comprising multiple columns, all the columns must be present inside the dictionary for the correct row to be marked as deleted.

`Checkpoint()`

checkpoint (state=new_state)

Updates state: dict with new_state and tells Fivetran that the data sent up until this point can be safely written to your destination. This is used to enable incremental syncs as well as safely break large syncs ensuring data is delivered to the destination periodically. Fivetran does not save any values in state automatically; only the contents of new_state are applied as they are passed. You must pass the state as a JSON string that represents a single JSON object (i.e., the decoded result must be a dictionary — not an array, string, or number).

See the following example structure and contents of a state.json file:

{
    "company_cursor": "2024-08-14T02:01:00Z",
    "department_cursor": {
        "1": "2024-08-14T03:00:00Z",
        "2": "2024-08-14T06:00:00Z"
    },
    "offset": 80
}

All but the most simple connectors need to use checkpoint() so that your connection does not reprocess data, especially when long sync fails due to any underlying reason. See our recommendations on checkpointing for large data sets.

Re-sync connection

You can run a full connection re-sync in your Fivetran dashboard.

If you want to re-sync just the affected table(s), use the REST API to modify the connections's state. Make sure you build your connector in such a way that you can modify the state to re-sync particular table(s).

Technical details - logging

We recommend using logging in your connector code, as it can help in debugging and observability. Your connector.py file must include logging as follows:

from fivetran_connector_sdk import Logging as log

Logging levels

We support logs at the following three levels in production:

INFO - for all informational logs such as status, start, pause, exit, etc.
WARNING - for less severe error conditions that could degrade the flow in the future if not addressed.
SEVERE - for error conditions and failures that cause significant issues to current flows and execution.

INFO logs have a rate limit of 1500 logs per minute. WARNING and SEVERE logs are not subject to this limit.

We additionally support one more level for debugging your code locally:

FINE - for detailed low-level logs needed while testing and building your code.
The log messages of this level (for example, log.fine("Debugging the data transformation process.")) are only generated when running the fivetran debug command when you debug your code. It is safe to leave the log.fine logs in your code, but they will not appear in the logs once you have deployed the connector.

Handling Exceptions

We expect you to throw errors from the connector.py file, so that you can see an error in the dashboard in the form of a task.

Avoid using exit() to terminate the Python code as this can cause the connector to become stuck. You can either raise an error or catch and display the error using the log.severe(), which will be displayed in your sync logs.

To fail a sync by throwing an exception, use any RuntimeError -

raise RuntimeError(f"Value not expected. Value: {value}")

To handle the error using try-catch and display the error traceback in logs -

try:
    // Some Code
catch exception as e:
    log.severe("Value not expected.", e)

Logging syntax - examples

Each logging method accepts only one argument. Refer to the following examples for your understanding:

log.fine("Debugging the data transformation process.")

log.info("Connector started successfully.")
log.info("Initial state:" + repr(state))

log.warning("Data source response time is slower than expected.")

log.severe("Failed to connect to the data source.")

You can check our weather example, which uses info and fine-level logging in the update() method for reference.

Ensure you are not adding excessive logs by accident. For example, avoid placing a log after each record, as it can increase the log volume and cause logs to be discarded by our system due to logging rate limits.

Accessing Connector SDK logs

You can access your Connector SDK connections' log events by:

Navigating to the Connector SDK logs tab of your connection details page in the Fivetran dashboard.
Using the Fivetran Platform Connector. Your connections' logs are available in the CONNECTOR_SDK_LOG table within the associated Fivetran destination.
Using external log services, previously configured for your destination.

Connector SDK logs provide in-depth event data, including timestamps, log levels, and messages. We prefix all Fivetran process logs with the Fivetran-Platform identifier, and logs from the fivetran-connector-sdk library are prefixed with the Fivetran-Connector-sdk identifier.

Logs when running the Local Tester

When running Fivetran's Local Tester, fivetran debug logs are printed from two asynchronous sources: the logs from your connector.py code and the logs from the Local Tester.

Connector SDK logs

The Fivetran Local Tester methods are not blocking calls, so your connector.py code continues to execute after we enqueue the instruction. Because the execution of your connector's code and Fivetran’s Local Tester are not synchronous, the logs related to the Local Tester are also not synchronous. The logs from your connector.py code are printed immediately whereas the logs from the Local Tester are printed when the corresponding operations have been executed by the tester. To help differentiate the Local Tester and Library logs from your connection logs, the Local Tester logs are prefixed with the Fivetran-Tester-Process identifier, and logs from the fivetran-connector-sdk library prefixed with the Fivetran-Connector-sdk identifier.

Furthermore, for connection tests with a large volume of data, the Local Tester process might take a long time to complete, longer than the connector.py code takes to complete. For this, a periodic log displaying counts of different operations completed is printed every 5 min.
See the following example of such a log:

Dec 19, 2024 01:41:52 AM Fivetran-Tester-Process: INFO: Sync Progress 
Operation       | Call Count     
----------------|------------
Upserts         | 4         
Updates         | 0         
Deletes         | 0         
Truncates       | 0         
SchemaChanges   | 2         
Checkpoints     | 3

Log example

Expand to see the log

Dec 18, 2024 10:04:19 PM Fivetran-Tester-Process: INFO: Configuration:
{} 
Dec 18, 2024 10:04:19 PM Fivetran-Tester-Process: INFO: Previous state:
{} 
Dec 18, 2024 10:04:19 PM Fivetran-Tester-Process: INFO: [SchemaChange]: tester.company 
Dec 18, 2024 10:04:19 PM Fivetran-Tester-Process: INFO: [CreateTable]: tester.company 
Dec 18, 2024 10:04:19 PM Fivetran-Tester-Process: INFO: [SchemaChange]: tester.department 
Dec 18, 2024 10:04:19 PM Fivetran-Tester-Process: INFO: [CreateTable]: tester.department 
Dec 18, 2024 10:04:19 PM WARNING: Example: Common Patterns For Connectors - Cursors - Multiple Tables With Cursors
Dec 18, 2024 10:04:19 PM INFO: Upserting
Dec 18, 2024 10:04:30 PM INFO: Checkpointing
Dec 18, 2024 10:04:35 PM INFO: Upserting
Dec 18, 2024 10:04:35 PM Fivetran-Tester-Process: INFO: Checkpoint: {"company_cursor": "2024-08-14T01:00:00Z"} 
Dec 18, 2024 10:04:45 PM INFO: Checkpointing
Dec 18, 2024 10:04:55 PM INFO: Upserting
Dec 18, 2024 10:04:55 PM Fivetran-Tester-Process: INFO: Checkpoint: {"company_cursor": "2024-08-14T01:00:00Z", "department_cursor": {"1": "2024-08-14T01:00:00Z"}} 
Dec 18, 2024 10:05:05 PM INFO: Checkpointing
Dec 18, 2024 10:05:15 PM INFO: Upserting
Dec 18, 2024 10:05:15 PM Fivetran-Tester-Process: INFO: Checkpoint: {"company_cursor": "2024-08-14T01:00:00Z", "department_cursor": {"1": "2024-08-14T02:00:00Z"}} 
Dec 18, 2024 10:05:25 PM INFO: Checkpointing
Dec 18, 2024 10:05:25 PM Fivetran-Tester-Process: INFO: Sync Progress 
Operation       | Call Count     
--------------------+------------
Upserts         | 4         
Updates         | 0         
Deletes         | 0         
Truncates       | 0         
SchemaChanges   | 2         
Checkpoints     | 3   


Dec 18, 2024 10:05:35 PM INFO: Upserting
Dec 18, 2024 10:05:35 PM Fivetran-Tester-Process: INFO: Checkpoint: {"company_cursor": "2024-08-14T01:00:00Z", "department_cursor": {"1": "2024-08-14T03:00:00Z"}} 
Dec 18, 2024 10:05:45 PM INFO: Checkpointing
Dec 18, 2024 10:05:50 PM INFO: Upserting
Dec 18, 2024 10:05:50 PM Fivetran-Tester-Process: INFO: Checkpoint: {"company_cursor": "2024-08-14T02:01:00Z", "department_cursor": {"1": "2024-08-14T03:00:00Z"}} 
Dec 18, 2024 10:06:00 PM INFO: Checkpointing
Dec 18, 2024 10:06:10 PM INFO: Upserting
Dec 18, 2024 10:06:10 PM Fivetran-Tester-Process: INFO: Checkpoint: {"company_cursor": "2024-08-14T02:01:00Z", "department_cursor": {"1": "2024-08-14T03:00:00Z", "2": "2024-08-14T01:00:00Z"}} 
Dec 18, 2024 10:06:20 PM INFO: Checkpointing
Dec 18, 2024 10:06:30 PM INFO: Upserting
Dec 18, 2024 10:06:30 PM Fivetran-Tester-Process: INFO: Checkpoint: {"company_cursor": "2024-08-14T02:01:00Z", "department_cursor": {"1": "2024-08-14T03:00:00Z", "2": "2024-08-14T02:00:00Z"}} 
Dec 18, 2024 10:06:30 PM Fivetran-Tester-Process: INFO: Sync Progress 
Operation       | Call Count     
--------------------+------------
Upserts         | 7        
Updates         | 0         
Deletes         | 0         
Truncates       | 0         
SchemaChanges   | 2         
Checkpoints     | 7   
      
Dec 18, 2024 10:06:40 PM INFO: Checkpointing
Dec 18, 2024 10:06:50 PM Fivetran-Tester-Process: INFO: Checkpoint: {"company_cursor": "2024-08-14T02:01:00Z", "department_cursor": {"1": "2024-08-14T03:00:00Z", "2": "2024-08-14T03:00:00Z"}} 
Dec 18, 2024 10:06:50 PM Fivetran-Tester-Process: INFO: Sync Progress 
Operation       | Call Count     
--------------------+------------
Upserts         | 16        
Updates         | 0         
Deletes         | 0         
Truncates       | 0         
SchemaChanges   | 2         
Checkpoints     | 16   
      

Dec 18, 2024 10:06:50 PM Fivetran-Tester-Process: INFO: Sync SUCCEEDED

Technical details - Modes

History Mode

Currently, the Connector SDK does not support History Mode. However, you can imitate the behavior by following these steps:

Identify a column or field in the data that represents the timestamp of when a record is updated.
Include the identified column as part of the primary key (composite key).

This approach ensures that updated records are not overwritten but instead stored with their changed values over time.

For reference, check out the NewsAPI example, which implements this method to mimic History Mode behavior in the Connector SDK.

Technical Reference

Technical details - fivetran-connector-sdk commands

Technical details - required imports

Technical details - methods

Update()

Schema()

Example of data duplication

Syncing empty tables and columns

Supported data types

Technical details - required object connector

Technical details - operations

Upsert()

Update()

Delete()

Checkpoint()

Re-sync connection

Technical details - logging

Logging levels

Handling Exceptions

Logging syntax - examples

Accessing Connector SDK logs

Logs when running the Local Tester

Log example

Technical details - Modes

History Mode

Technical details - `fivetran-connector-sdk` commands

`Update()`

`Schema()`

Technical details - required object `connector`

`Upsert()`

`Update()`

`Delete()`

`Checkpoint()`