HubSpot

HubSpot is an inbound marketing and sales software that helps companies attract visitors, convert leads, and close customers.

Features

Feature Name	Supported	Notes
Capture deletes		`ASSOCIATION_TYPE`, `COMPANY`, `COMPANY_LIST`, `COMPANY_LIST_MEMBER`,`CONTACT`, `CONTACT_LIST`, `CONTACT_LIST_MEMBER`, `CUSTOM_EVENT_DEFINITION`, `CUSTOM_EVENT_DEFINITION_PROPERTY`, `DEAL`, `DEAL_PIPELINE`, `DEAL_PIPELINE_STAGE`, `FEEDBACK_SUBMISSION`, `FORM`, `GOAL`, `LEAD`, `LEAD_PIPELINE`, `LEAD_PIPELINE_STAGE`, `LINE_ITEM`, `MARKETING_CAMPAIGN`, `OWNER_TEAM`, `PRODUCT`, `QUOTE`, `ROLE`, `TEAM`, `TEAM_USER`, `TICKET`, `TICKET_PIPELINE`, `TICKET_PIPELINE_STAGE`, and `USERS` tables.
History mode		`COMPANY_PROPERTY_HISTORY`, `CONTACT_PROPERTY_HISTORY`, `DEAL_PROPERTY_HISTORY`, `DEAL_STAGE`,`FEEDBACK_SUBMISSION_PROPERTY_HISTORY`, `GOAL_PROPERTY_HISTORY`,`LEAD_PROPERTY_HISTORY`, `LINE_ITEM_PROPERTY_HISTORY`, `PRODUCT_PROPERTY_HISTORY`, `QUOTE_PROPERTY_HISTORY`, `INVOICE_PROPERTY_HISTORY` and `TICKET_PROPERTY_HISTORY` tables \*.
Custom data		`COMPANY`, `CONTACT`, `DEAL`, `LINE ITEMS`, and `PRODUCT` tables only. We support custom objects.
Data blocking		Column level
Column hashing
Re-sync		Connection level and table level
API configurable		API configuration
Priority-first sync		`CUSTOM_EVENT`, `EMAIL_EVENT_BOUNCE`, `EMAIL_EVENT_CLICK`, `EMAIL_EVENT_DEFERRED`, `EMAIL_EVENT_DELIVERED`, `EMAIL_EVENT_DROPPED`, `EMAIL_EVENT_FORWARD`, `EMAIL_EVENT_OPEN`, `EMAIL_EVENT_PRINT`, `EMAIL_EVENT_SENT`, `EMAIL_EVENT_SPAM_REPORT`, `EMAIL_EVENT_STATUS_CHANGE`, `EMAIL_EVENT_SUPPRESSED`, `EVENT`, `FEEDBACK_SUBMISSION`, `FEEDBACK_SUBMISSION_CONTACT`, `FEEDBACK_SUBMISSION_PROPERTY_HISTORY`, `FEEDBACK_SUBMISSION_TICKET`, `LEAD`, `LEAD_COMPANY`, `LEAD_CONTACT`, `LEAD_PROPERTY_HISTORY`,`INVOICE`, `INVOICE_COMPANY`, `INVOICE_CONTACT` `INVOICE_DEAL`,`INVOICE_LINE_ITEM` , `INVOICE_TICKET`, and `INVOICE_PROPERTY_HISTORY` tables.
Fivetran data models		Get the models: source / transform; Get the unified RAG model; Supports Quickstart data models
Private networking
Authorization via API

The HubSpot connection doesn't support switching modes because we fetch history data directly from the APIs. We maintain two separate tables for HubSpot connection: one that reflects the current state and one for history mode. For example, on the connection's Schema tab, you will have two tables, COMPANY and COMPANY_PROPERTY_HISTORY. You can sync the history table only if you have included the current state table in your syncs.

Supported products

Product Name
HubSpot CRM
Marketing Hub
Sales Hub
Service Hub

Supported deployment models

We support the SaaS Deployment model for the connector.

Setup guide

Follow our step-by-step HubSpot setup guide to connect HubSpot with your destination.

Sync overview

Sync strategy

Sync strategy for DEAL tables

We incrementally sync the DEAL table.

The DEAL table has two child tables, DEAL_PROPERTY_HISTORY and DEAL_STAGE. We don't store the deal_stage_property in the DEAL_PROPERTY_HISTORY table. The DEAL_STAGE table stores the deal_stage_property and its historical versions.

Sync strategy for ENGAGEMENT tables

We incrementally sync the ENGAGEMENT and ENGAGEMENT_* tables according to the sync frequency you set in your Fivetran dashboard.

We sync metadata for the following engagement types into the corresponding destination tables:

NOTE
EMAIL
TASK
PUBLISHING_TASK
MEETING
CALL

We sync engagement metadata only if you select the corresponding engagement type table(s) above. The metadata is stored in the following tables:

ENGAGEMENT
ENGAGEMENT_COMPANY
ENGAGEMENT_CONTACT
ENGAGEMENT_DEAL
ENGAGEMENT_GOAL
ENGAGEMENT_LEAD
ENGAGEMENT_PROPERTY_HISTORY

If we detect a new engagement type, we skip the metadata for the new engagement.

Sync strategy for EMAIL EVENT tables

In every sync, we re-fetch data that is twenty-five hours behind the data synced in the current incremental sync for the EMAIL_EVENT_* tables you have selected. We do this to capture events we may have missed because of HubSpot's event processing delays.

The EMAIL_EVENT table syncs data for only the event types you define in the source EMAIL_EVENT_* child tables.

To sync the EMAIL_EVENT table, make sure to select all the relevant EMAIL_EVENT_* child tables in the Schema tab of your connection's dashboard. If you don't select any of these child tables, we don't sync data into the EMAIL_EVENT table.

HubSpot's Email Events API sends bot events that may increase your total event count. To exclude these bot events, query the events with filtered_event = false.

Sync strategy for PROPERTY_HISTORY tables

Enabling history mode increases your monthly active rows (MAR) consumption because every change is recorded as a new row that counts towards MAR. We do incremental syncs of the history tables.

The CONTACT_PROPERTY_HISTORY table contains different versions of each record, and syncing the table may contribute to higher MAR usage because of its size.

When deleted records are restored at source, we sync their property history changes only after the time they were restored.

Sync strategy for Web Analytics tables

To keep the analytics data updated, during incremental sync, we sync the data for the following granular analytics tables with the corresponding rollback sync durations:
- *_ANALYTICS_DAILY_REPORT - 30 days
- *_ANALYTICS_WEEKLY_REPORT - Four weeks
- *_ANALYTICS_MONTHLY_REPORT - Two months
For example, if the data for the GEOLOCATION_ANALYTICS_DAILY_REPORT table is synced till 2023-03-30 in the last sync, then in the current sync the analytics data is synced from 2023-03-01 till the current date.
This causes higher MAR usage for these tables at the beginning of each month.
Fivetran performs full table re-syncs for all the *_ANALYTICS_OVERALL_REPORT tables in every sync, as they represent the rolled up analytics from the start to the current date.
For more information about MAR usage for re-imported tables, see our Troubleshooting documentation.
For GEOLOCATION_ANALYTICS_*_REPORT tables, a breakdown column value of 7081c5b2-d128-4ec1-a9be-cba29cfc540a indicates the visits for which HubSpot is unable to capture the country code data.
For the *_ANALYTICS_WEEKLY_REPORT and *_ANALYTICS_MONTHLY_REPORT tables, the values of the date column represent the starting date of a week and month respectively. For example, if the value of date column for a *_ANALYTICS_WEEKLY_REPORT table record is 2023-08-03, then the record represents analytics data for the time period of a week starting on 2023-08-03 and ending on 2023-08-09.
For the *_ANALYTICS_OVERALL_REPORT tables, a breakdown column value of totals indicates the aggregated data of all other breakdown column values for that table.
The HubSpot connection syncs the *_ANALYTICS_*_REPORT tables depending on the HubSpot account's permissions.

Re-import tables

We re-import the following tables in every sync to ensure that you have the latest data:

For users on legacy pricing plan, re-importing the tables below will result in higher MAR usage at the beginning of each month.

ASSOCIATION_TYPE
CUSTOM_EVENT_DEFINITION
CUSTOM_EVENT_DEFINITION_ASSOCIATION
CUSTOM_EVENT_DEFINITION_PROPERTY
CUSTOM_EVENT_DEFINITION_PROPERTY_OPTION
COMPANY_LIST
CONTACT_LIST
DEAL_PIPELINE
DEAL_PIPELINE_STAGE
EMAIL_SUBSCRIPTION
FORM
LEAD_PIPELINE
LEAD_PIPELINE_STAGE
MARKETING_CAMPAIGN
MARKETING_CAMPAIGN_BUDGET_ITEMS
MARKETING_CAMPAIGN_SPEND_ITEMS
MARKETING_CAMPAIGN_CONTACT
MARKETING_CAMPAIGN_ASSET
MARKETING_EVENT
MARKETING_EVENT_CUSTOM_PROPERTY
MARKETING_EVENT_LIST
MARKETING_EVENT_PARTICIPANT
OWNER
OWNER_TEAM
PROPERTY
PROPERTY_OPTION
ROLE
SUBMISSION_RESPONSE
TEAM
TEAM_USER
USER

Sync note

If you exclude the child tables and later want to sync them, perform a table-level re-sync of parent table to backfill the data in the child tables.

Capture deletes

We use different strategies to capture deletes because the HubSpot API doesn't offer a mechanism to capture deletes for every table:

We capture deletes once a day via API call for the following tables:
- FEEDBACK_SUBMISSION
- GOAL
- LEAD
- LINE_ITEM
- PRODUCT
- QUOTE
- TICKET
If records of the FEEDBACK_SUBMISSION,GOAL, LEAD, LINE_ITEM, PRODUCT, QUOTE, and TICKET tables are permanently deleted between syncs, we don’t mark the _fivetran_deleted column as true for these records.
We infer deletes for the following tables by comparing the tables against their previous versions and capturing deletes using the _fivetran_deleted system column:
- ASSOCIATION_TYPE
- COMPANY_LIST
- CONTACT_LIST
- CUSTOM_EVENT_DEFINITION
- CUSTOM_EVENT_DEFINITION_PROPERTY
- DEAL_PIPELINE
- DEAL_PIPELINE_STAGE
- FORM
- LEAD_PIPELINE
- LEAD_PIPELINE_STAGE
- MARKETING_CAMPAIGN
- OWNER_TEAM
- ROLE
- TEAM
- TEAM_USER
- TICKET_PIPELINE
- TICKET_PIPELINE_STAGE
- USERS
We use webhooks to capture deletes for the COMPANY, CONTACT, and DEAL tables.
If you created the connection using our REST API, we don't capture deletes for the COMPANY, CONTACT, and DEAL tables, because these connections are not authorized using the Fivetran HubSpot application.
To capture deletes for the CONTACT_LIST_MEMBER and COMPANY_LIST_MEMBER tables, we perform a weekly re-sync for these tables during weekends. The MAR charged during the weekly re-sync is for incremental records only.
We don't capture deletes for the INVOICE table and its child tables because of the following reasons:
- The archived parameter is not functional for invoices. As a result, we cannot re-import the archived invoices using the crm/v3/objects/invoice?archived=true endpoint.
- HubSpot does not support the Invoices subscription type in its webhooks. This prevents the capture of delete events for invoices and their child tables.

Capture merges

In HubSpot, you can merge two records into one record. For example, when you merge two contacts, the primary contact record remains after the merge and the secondary contact is merged into the primary record. For more information, see HubSpot's documentation.

We capture merges for the COMPANY, CONTACT, and DEAL tables.

To capture merges of the CONTACT table, we have a property_hs_calculated_merged_vids column which stores data of the merged object ids.
To capture merges of the DEAL table, we have a merged_deal table in your destination. This table reflects that deal with id merged_deal_id has been merged into deal with id deal_id.
In your destination table, you may observe the missing _fivetran_deleted column.
To capture merges of the COMPANY table, we mark the company records as is_deleted = TRUE.

Identifying primary associations

We can find the primary company associated with an object. For example, if you want to identify the primary association in the DEAL_COMPANY table, use the following SQL query:

SELECT * FROM DEAL_COMPANY dc JOIN ASSOCIATION_TYPE t ON dc.type_id = t.id WHERE t.label LIKE 'Primary';

Similarly, you can query for other association labels.

Deal stage calculations

The HubSpot API returns only the date_entered value of the dealstage properties data. We sync this data to the DEAL_STAGE table.

Fivetran populates the date_entered value in the _fivetran_start column and sets the date_exited value in the _fivetran_end column. The date_exited value for a deal stage is the date_entered value of the next deal stage in chronological order.

You can calculate the time_in value as the difference between the _fivetran_start and _fivetran_end column values in the DEAL_STAGE table.

The _fivetran_active column indicates whether the deal stage is currently active.

For example, in the DEAL_STAGE table:

deal_id	deal stage	date_entered	_fivetran_start	_fivetran_end	_fivetran_active
10	123	11:00	11:00	12:00	False
10	145	12:00	12:00	13:00	False
10	157	13:00	13:00	14:00	False
10	173	14:00	14:00	MAX_TIMESTAMP_VALUE	True

For deal_id 10 in deal stage 145:
- date_entered_145 = _fivetran_start = 12:00
- date_exited_145 = date_entered_157 = _fivetran_end = 13:00
- time_in_145 = 1 hour (_fivetran_end - _fivetran_start)
- _fivetran_active = False (the deal stage has been exited)
For deal_id 10 in deal stage 173:
- date_entered_173 = _fivetran_start = 14:00
- date_exited_173 = _fivetran_end = MAX_TIMESTAMP_VALUE
- time_in_173 = MAX_TIMESTAMP_VALUE - 14:00 (_fivetran_end - _fivetran_start)
- _fivetran_active = True

Table re-syncs

When you re-sync a parent table, we re-sync its child tables along with the parent table.

To re-sync an association table (with table name in [FROM_TABLE_NAME]_[TO_TABLE_NAME] format), you must re-sync the [FROM_TABLE_NAME] table. For example, to re-sync the association table COMPANY_CONTACT, you must re-sync the COMPANY table.

Similarly, to re-sync a custom association table (with table name in [FROM_TABLE_NAME]\_TO\_[TO_TABLE_NAME] format), you must re-sync the [FROM_TABLE_NAME] table.

If you have created custom events in your HubSpot account by backdating them using the API, you must trigger a table re-sync of the CUSTOM_EVENT table to fetch backdated records, as we only perform a rollback sync of 25 hours.

Historical sync time frame

You can configure the Historical sync time frame when setting up your connection to speed up both the initial sync and subsequent re-syncs. Historical sync time frame limits the amount of data synced for the following tables:

EVENT
EMAIL_EVENT
CUSTOM_EVENT

We apply the selected start date only to these tables, as they contain data that you are less likely to reference much later.

You can modify the Historical sync time frame at any time. If you change it to an earlier date, we perform a full historical sync, syncing data back to that new date. If you set it to a later date, we do not delete any existing data. Instead, we apply the new time frame only to subsequent re-syncs.

Schema information

Marketing Hub schema

This schema is applicable for the HubSpot Marketing Hub product.

To zoom, open the ERD in a new window.

CRM and Sales Hub schema

This schema is applicable for the HubSpot CRM and Sales Hub products.

To zoom, open the ERD in a new window.

Commerce schema

This schema is applicable for the HubSpot Commerce products.

To zoom, open the ERD in a new window.

Service Hub schema

This schema is applicable for the HubSpot Service Hub product.

To zoom, open the ERD in a new window.

Web Analytics schema

This schema is applicable for the HubSpot Web Analytics API

To zoom, open the ERD in a new window.

Goals schema

This schema is applicable for the HubSpot Goals.

To zoom, open the ERD in a new window.

Custom objects

You can sync custom objects from your HubSpot account to your destination. We create a destination table for each custom object. We follow our standard table naming conventions.

We sync the properties you define for the custom objects into the PROPERTY table.

We create an association table in your destination to capture the associations between different custom objects and associations between a custom object and the following tables:

COMPANY
CONTACT
DEAL
EMAIL
ENGAGEMENT
GOAL
LINE_ITEM
PRODUCT
TICKET

We name the custom object association tables using the format, [FROM_TABLE_NAME]\_TO\_[TO_TABLE_NAME].

In HubSpot, a custom object's updatedAt time does not change for association changes. Also, HubSpot doesn't support webhook subscriptions for custom object association changes. Due to these limitations from HubSpot, we may not be able to sync some associations to the destination immediately in the incremental sync. We sync these associations whenever the updatedAt time of the custom object changes due to another property change.

Negative timestamps from HubSpot

We automatically convert negative timestamps received from HubSpot to EPOCH, for example, 1st January 1970 00:00:00 UTC.

Syncing empty tables and columns

The HubSpot connector does not support the creation of empty tables and columns in your destination.

We create a table in the destination only if we can retrieve the table data from the source. If HubSpot does not return any data for a source table, we don’t create the table in your destination.