Shopify
This page contains the setup guide and reference information for the Shopify source connector.
Prerequisites
- An active Shopify store.
- If you are syncing data from a store that you do not own, you will need to request access to your client's store (not required for account owners).
- For Airbyte Open Source users: A custom Shopify application with
read_
scopes enabled.
Setup guide
This connector supports OAuth2.0 and API Password (for private applications) authentication methods.
Set up Shopify
For existing Airbyte Cloud customers, if you are currently using the API Password authentication method, please switch to OAuth2.0, as the API Password will be deprecated shortly. This change will not affect Airbyte Open Source connections.
For Airbyte Cloud:
- Log into your Airbyte Cloud account.
- Click Sources and then click + New source.
- On the Set up the source page, select Shopify from the Source type dropdown.
- Enter a name for the Shopify connector.
Connect using OAuth2.0
- Choose a Source name.
- Click Authenticate your Shopify account.
- Click Install to install the Airbyte application. Log in to your account, if you are not already logged in. Select the store you want to sync and review the consent form. Click Install app to finish the installation.
- The Shopify Store field will be automatically filled after you authenticate your Shopify account based on the store you selected. Once populated, confirm the value is accurate.
- (Optional) You may set a Replication Start Date as the starting point for your data replication. Any data created before this date will not be synced. Defaults to January 1st, 2020.
- Click Set up source and wait for the connection test to complete.
For Airbyte Open Source:
- Navigate to the Airbyte Open Source dashboard.
- Click Sources and then click + New source.
- On the Set up the source page, select Shopify from the Source type dropdown.
- Enter a name for the Shopify connector.
Create a custom app
Authentication to the Shopify API requires a custom application. Follow these instructions to create a custom app and find your Admin API Access Token.
- Log in to your Shopify account.
- In the dashboard, navigate to Settings > App and sales channels > Develop apps > Create an app.
- Select a name for your new app.
- Select Configure Admin API scopes.
- Grant access to the following list of scopes. Only select scopes prefixed with
read_
, notwrite_
(e.g.read_locations
,read_price_rules
, etc ). - Click Install app to give this app access to your data.
- Once installed, go to API Credentials to copy the Admin API Access Token. You are now ready to set up the source in Airbyte!
Connect using API Password
- Enter a Source name.
- Enter your Shopify Store name. You can find this in your URL when logged in to Shopify or within the Store details section of your Settings.
- For API Password, enter your custom application's Admin API access token.
- (Optional) You may set a Replication Start Date as the starting point for your data replication. Any data created before this date will not be synced. Please note that this defaults to January 1st, 2020.
- Click Set up source and wait for the connection test to complete.
Custom app scopes
Add the following scopes to your custom app to ensure Airbyte can sync all available data. For more information on access scopes, see the Shopify docs.
read_analytics
read_assigned_fulfillment_orders
read_content
read_customers
read_discounts
read_draft_orders
read_fulfillments
read_gdpr_data_request
read_gift_cards
read_inventory
read_legal_policies
read_locations
read_locales
read_marketing_events
read_merchant_managed_fulfillment_orders
read_online_store_pages
read_order_edits
read_orders
read_price_rules
read_product_listings
read_products
read_publications
read_reports
read_resource_feedbacks
read_script_tags
read_shipping
read_shopify_payments_accounts
read_shopify_payments_bank_accounts
read_shopify_payments_disputes
read_shopify_payments_payouts
read_themes
read_third_party_fulfillment_orders
read_translations
Supported sync modes
The Shopify source connector supports the following sync modes:
Feature | Supported? |
---|---|
Full Refresh Sync | Yes |
Incremental Sync | Yes |
The Shopify source supports both Full Refresh and Incremental syncs. You can choose if this connector will copy only the new or updated data, or all rows in the tables and columns you set up for replication, every time a sync is run.
This source can sync data for the Shopify REST API and the Shopify GraphQL API and the Shopify GraphQL BULK API
Supported Streams
- Abandoned Checkouts
- Articles
- Balance Transactions
- Blogs
- Collects
- Collections (GraphQL)
- Countries
- Custom Collections
- Customers
- Customer Journey Summary (GraphQL)
- Customer Address (GraphQL)
- Customer Saved Search
- Draft Orders
- Discount Codes (GraphQL)
- Disputes
- Fulfillments
- Fulfillment Orders (GraphQL)
- Inventory Items (GraphQL)
- Inventory Levels (GraphQL)
- Locations
- Metafields (GraphQL)
- Order Agreements (GraphQL)
- Orders
- Order Refunds
- Order Risks (GraphQL)
- Pages
- Price Rules
- Products (GraphQL)
- Product Images (GraphQL)
- Product Variants (GraphQL)
- Shop
- Smart Collections
- Transactions
- Transactions (GraphQL)
- Tender Transactions
Entity-Relationship Diagram (ERD)
Capturing deleted records
The connector captures deletions for records in the Articles
, Blogs
, CustomCollections
, Orders
, Pages
, PriceRules
and Products
streams.
When a record is deleted, the connector outputs a record with the ID
of that record and the deleted_at
, deleted_message
, and deleted_description
fields filled out. No other fields are filled out for the deleted records.
Check the following Shopify documentation for more information about retrieving deleted records.
Marketing Attribution data
Data related to marketing attribution can be found across a few different streams. Sync these streams to understand marketing performance:
Customer Journey Summary
(firstVisit.source, firstVisit.sourcetype)Orders
(referring_site, source_name)Abandoned Checkouts
(referring_site, source_name)
Features
Feature | Supported?(Yes/No) |
---|---|
Full Refresh Sync | Yes |
Incremental - Append Sync | Yes |
Namespaces | No |
Data type map
Integration Type | Airbyte Type |
---|---|
string | string |
number | number |
array | array |
object | object |
boolean | boolean |
Limitations & Troubleshooting
Expand to see details about Shopify connector limitations and troubleshooting
Connector limitations
Rate limiting
Shopify has some rate limit restrictions. Typically, there should not be issues with throttling or exceeding the rate limits but, in some edge cases, you may encounter the following warning message:
"Caught retryable error '<some_error> or null' after <some_number> tries.
Waiting <some_number> seconds then retrying..."
This is expected when the connector hits a 429 - Rate Limit Exceeded
HTTP Error. The sync operation will continue successfully after a short backoff period.
For all Shopify GraphQL BULK
api requests these limitations are applied: https://shopify.dev/docs/api/usage/bulk-operations/queries#operation-restrictions
Troubleshooting
- If you encounter access errors while using OAuth2.0 authentication, please make sure you've followed this Shopify Article to request the access to the client's store first. Once the access is granted, you should be able to proceed with OAuth2.0 authentication.
- Check out common troubleshooting issues for the Shopify source connector on our Airbyte Forum here.
Reference
Config fields reference
Changelog
Expand to review
Version | Date | Pull Request | Subject |
---|---|---|---|
2.5.8 | 2024-10-28 | 47044 | Update dependencies |
2.5.7 | 2024-10-14 | 46552 | Add parent state tracking for BULK sub-streams |
2.5.6 | 2024-10-12 | 46844 | Update dependencies |
2.5.5 | 2024-10-05 | 46578 | Raise exception on missing stream |
2.5.4 | 2024-10-05 | 45759 | Update dependencies |
2.5.3 | 2024-09-27 | 46095 | Fixed duplicates for Product Images , Metafield Product Images and Metafield Products streams for Incremental syncs |
2.5.2 | 2024-09-17 | 45633 | Adds read_inventory as a required scope for product_variants stream |
2.5.1 | 2024-09-14 | 45255 | Update dependencies |
2.5.0 | 2024-09-06 | 45190 | Migrate to CDK v5 |
2.4.24 | 2024-09-03 | 45116 | Have message and description be nullable for custom_collections deleted events |
2.4.23 | 2024-08-31 | 44971 | Update dependencies |
2.4.22 | 2024-08-24 | 44723 | Update dependencies |
2.4.21 | 2024-08-17 | 44318 | Update dependencies |
2.4.20 | 2024-08-12 | 43834 | Update dependencies |
2.4.19 | 2024-08-10 | 43194 | Update dependencies |
2.4.18 | 2024-08-06 | 43326 | Added missing type type for customer_journey_summary field for Customer Journey Summary stream schema |
2.4.17 | 2024-08-02 | 42973 | Fixed FAILED Job handling for no-checkpointing BULK Streams, fixed STATE collision for REST Streams with Deleted Events |
2.4.16 | 2024-07-21 | 42095 | Added the Checkpointing for the BULK streams, fixed the store redirection |
2.4.15 | 2024-07-27 | 42806 | Update dependencies |
2.4.14 | 2024-07-20 | 42150 | Update dependencies |
2.4.13 | 2024-07-13 | 41809 | Update dependencies |
2.4.12 | 2024-07-10 | 41103 | Update dependencies |
2.4.11 | 2024-07-09 | 41068 | Added options field to Product Variants stream |
2.4.10 | 2024-07-09 | 41042 | Use latest CDK : 3.0.0 |
2.4.9 | 2024-07-06 | 40768 | Update dependencies |
2.4.8 | 2024-07-03 | 40707 | Fixed the bug when product_images stream emitted records with no primary_key |
2.4.7 | 2024-06-27 | 40593 | Use latest CDK version possible |
2.4.6 | 2024-06-26 | 40526 | Made BULK Job termination threshold limit adjustable from input configuration , increased the default value to 1 hour . |
2.4.5 | 2024-06-25 | 40484 | Update dependencies |
2.4.4 | 2024-06-19 | 39594 | Extended the Discount Codes , Fulfillment Orders , Inventory Items , Inventory Levels , Products , Product Variants and Transactions stream schemas |
2.4.3 | 2024-06-06 | 38084 | add resiliency on some transient errors using the HttpClient |
2.4.1 | 2024-06-20 | 39651 | Update dependencies |
2.4.0 | 2024-06-17 | 39527 | Added new stream Order Agreements |
2.3.0 | 2024-06-14 | 39487 | Added new stream Customer Journey Summary |
2.2.3 | 2024-06-06 | 38084 | add resiliency on some transient errors using the HttpClient |
2.2.2 | 2024-06-04 | 39019 | [autopull] Upgrade base image to v1.2.1 |
2.2.1 | 2024-05-30 | 38769 | Have products stream return all the tags comma separated |
2.2.0 | 2024-05-29 | 38746 | Updated countries schema |
2.1.4 | 2024-05-24 | 38610 | Updated the source API Version to 2024-04 |
2.1.3 | 2024-05-23 | 38464 | Added missing fields to Products stream |
2.1.2 | 2024-05-23 | 38352 | Migrated Order Risks stream to GraphQL BULK |
2.1.1 | 2024-05-20 | 38251 | Replace AirbyteLogger with logging.Logger |
2.1.0 | 2024-05-02 | 37767 | Migrated Products , Product Images and Product Variants to GraphQL BULK |
2.0.8 | 2024-05-02 | 37589 | Added retry for known HTTP Errors for BULK streams |
2.0.7 | 2024-04-24 | 36660 | Schema descriptions |
2.0.6 | 2024-04-22 | 37468 | Fixed one time retry for Internal Server Error for BULK streams |
2.0.5 | 2024-04-03 | 36788 | Added ability to dynamically adjust the size of the slice |
2.0.4 | 2024-03-22 | 36355 | Update CDK version to ensure Per-Stream Error Messaging and Record Counts In State (features were already there so just upping the version) |
2.0.3 | 2024-03-15 | 36170 | Fixed the STATE messages emittion frequency for the nested sub-streams |
2.0.2 | 2024-03-12 | 36000 | Fix and issue where invalid shop name causes index out of bounds error |
2.0.1 | 2024-03-11 | 35952 | Fixed the issue when start date is missing but the stream required it |
2.0.0 | 2024-02-12 | 32345 | Fixed the issue with state causing the substreams to skip the records, made metafield_* : collections, customers, draft_orders, locations, orders, product_images, product_variants, products , and fulfillment_orders, collections, discount_codes, inventory_levels, inventory_items, transactions_graphql, customer_address streams to use BULK Operations instead of REST |
1.1.8 | 2024-02-12 | 35166 | Manage dependencies with Poetry. |
1.1.7 | 2024-01-19 | 33804 | Updated documentation with list of all supported streams |
1.1.6 | 2024-01-04 | 33414 | Prepare for airbyte-lib |
1.1.5 | 2023-12-28 | 33827 | Fix GraphQL query |
1.1.4 | 2023-10-19 | 31599 | Base image migration: remove Dockerfile and use the python-connector-base image |
1.1.3 | 2023-10-17 | 31500 | Fixed the issue caused by the missing access token while setup the new source and not yet authenticated |
1.1.2 | 2023-10-13 | 31381 | Fixed the issue caused by the state presence while fetching the deleted events with pagination |
1.1.1 | 2023-09-18 | 30560 | Performance testing - include socat binary in docker image |
1.1.0 | 2023-09-07 | 30246 | Added ability to fetch destroyed records for Articles, Blogs, CustomCollections, Orders, Pages, PriceRules, Products |
1.0.0 | 2023-08-11 | 29361 | Migrate to the 2023-07 Shopify API Version |
0.6.2 | 2023-08-09 | 29302 | Handle the Internal Server Error when entity could be fetched |
0.6.1 | 2023-08-08 | 28291 | Allow shop field to accept *.myshopify.com shop names, updated OAuth Spec |
0.6.0 | 2023-08-02 | 28770 | Added Disputes stream |
0.5.1 | 2023-07-13 | 28700 | Improved error messages with more user-friendly description, refactored code |
0.5.0 | 2023-06-13 | 27732 | License Update: Elv2 |
0.4.0 | 2023-06-13 | 27083 | Added CustomerSavedSearch , CustomerAddress and Countries streams |
0.3.4 | 2023-05-10 | 25961 | Added validation for shop in input configuration (accepts non-url-like inputs) |
0.3.3 | 2023-04-12 | 25110 | Fixed issue when cursor_field is "None" , added missing properties to stream schemas, fixed access_scopes validation error |
0.3.2 | 2023-02-27 | 23473 | Fixed OOM / Memory leak issue for Airbyte Cloud |
0.3.1 | 2023-01-16 | 21461 | Added discount_applications to orders stream |
0.3.0 | 2022-11-16 | 19492 | Added support for graphql and add a graphql products stream |
0.2.0 | 2022-10-21 | 18298 | Updated API version to the 2022-10 , make stream schemas backward cpmpatible |
0.1.39 | 2022-10-13 | 17962 | Added metafield streams; support for nested list streams |
0.1.38 | 2022-10-10 | 17777 | Fixed 404 for configured streams, fix missing cursor error for old records |
0.1.37 | 2022-04-30 | 12500 | Improve input configuration copy |
0.1.36 | 2022-03-22 | 9850 | Added BalanceTransactions stream |
0.1.35 | 2022-03-07 | 10915 | Fixed a bug which caused full-refresh syncs of child REST entities configured for incremental |
0.1.34 | 2022-03-02 | 10794 | Minor specification re-order, fixed links in documentation |
0.1.33 | 2022-02-17 | 10419 | Fixed wrong field type for tax_exemptions for Abandoned_checkouts stream |
0.1.32 | 2022-02-18 | 10449 | Added tender_transactions stream |
0.1.31 | 2022-02-08 | 10175 | Fixed compatibility issues for legacy user config |
0.1.30 | 2022-01-24 | 9648 | Added permission validation before sync |
0.1.29 | 2022-01-20 | 9049 | Added shop_url to the record for all streams |
0.1.28 | 2022-01-19 | 9591 | Implemented OAuth2.0 authentication method for Airbyte Cloud |
0.1.27 | 2021-12-22 | 9049 | Updated connector fields title/description |
0.1.26 | 2021-12-14 | 8597 | Fixed mismatched number of tables for base-normalization, increased performance of order_refunds stream |
0.1.25 | 2021-12-02 | 8297 | Added Shop stream |
0.1.24 | 2021-11-30 | 7783 | Reviewed and corrected schemas for all streams |
0.1.23 | 2021-11-15 | 7973 | Added InventoryItems |
0.1.22 | 2021-10-18 | 7101 | Added FulfillmentOrders, Fulfillments streams |
0.1.21 | 2021-10-14 | 7382 | Fixed InventoryLevels primary key |
0.1.20 | 2021-10-14 | 7063 | Added Location and InventoryLevels as streams |
0.1.19 | 2021-10-11 | 6951 | Added support of OAuth 2.0 authorisation option |
0.1.18 | 2021-09-21 | 6056 | Added pre_tax_price to the orders/line_items schema |
0.1.17 | 2021-09-17 | 5244 | Created data type enforcer for converting prices into numbers |
0.1.16 | 2021-09-09 | 5965 | Fixed the connector's performance for Incremental refresh |
0.1.15 | 2021-09-02 | 5853 | Fixed amount type in order_refund schema |
0.1.14 | 2021-09-02 | 5801 | Fixed line_items/discount allocations & duties parts of orders schema |
0.1.13 | 2021-08-17 | 5470 | Fixed rate limits throttling |
0.1.12 | 2021-08-09 | 5276 | Added status property to product schema |
0.1.11 | 2021-07-23 | 4943 | Fixed products schema up to API 2021-07 |
0.1.10 | 2021-07-19 | 4830 | Fixed for streams json schemas, upgrade to API version 2021-07 |
0.1.9 | 2021-07-04 | 4472 | Incremental sync is now using updated_at instead of since_id by default |
0.1.8 | 2021-06-29 | 4121 | Added draft orders stream |
0.1.7 | 2021-06-26 | 4290 | Fixed the bug when limiting output records to 1 caused infinity loop |
0.1.6 | 2021-06-24 | 4009 | Added pages, price rules and discount codes streams |
0.1.5 | 2021-06-10 | 3973 | Added AIRBYTE_ENTRYPOINT for Kubernetes support |
0.1.4 | 2021-06-09 | 3926 | New attributes to Orders schema |
0.1.3 | 2021-06-08 | 3787 | Added Native Shopify Source Connector |