Incremental streams #40

cirotix · 2021-05-06T08:36:02Z

This PR provides incremental replication for:

products
product_family
subscriptions
customers
events
transactions

Also some schema fixes

It has been tested with the big-query target.

fixes #35

Chargify API allow 200 records per page. It is more efficient to use as many records as possible.

This is more accurate to use the id for the bookmark. If there is no state persisted, we use the start_date from the configuration. Once a state is persisted (biggest id) we filter the transactions on this id, with the since_id parameter.

Add created_at and updated_at fields

yoren · 2021-05-10T16:03:02Z

We currently use the Chargify integration on Stitch and experiencing a major data discrepancy issue that 20% of transactions are not synced from Chargify to Redshift.

After talking to @cirotix on Slack, this PR seems to be the rescue to the issue. We look forward to seeing it merged. Thanks!

chrishumphries · 2021-05-19T03:05:28Z

Hoping to see this merged as well.

cosimon

Thank you for the work on this, we are interested in merging this but first there were some API parameters that looked to be out of place.

Before we can merge this we also need logs demonstrating that the tap has been tested locally with the changes and it functions as expected

cosimon · 2021-06-02T14:52:17Z

tap_chargify/chargify.py


  def customers(self, bookmark=None):
-    for i in self.get("customers.json"):
+    for i in self.get("customers.json", sort="asc", date_field="updated_at", start_datetime=bookmark):


The docs indicate the customers.json endpoint accepts a direction but not a sort parameter:
https://reference.chargify.com/v1/customers/list-customers

cosimon · 2021-06-02T14:53:10Z

tap_chargify/chargify.py

-
  def product_families(self, bookmark=None):
-    for i in self.get("product_families.json"):
+    for i in self.get("product_families.json", sort="asc", date_field="updated_at", start_datetime=bookmark):


The docs for the product_families.json endpoint do not include the sort parameter
https://reference.chargify.com/v1/product-families/list-product-family-via-site

cosimon · 2021-06-02T14:55:51Z

tap_chargify/chargify.py

+        for j in self.get("product_families/{product_family_id}/products.json".format(product_family_id=k["product_family"]["id"]),
+                          sort="asc", date_field="updated_at", start_datetime=bookmark):


The docs for the product_families/{product_family_id}/products.json do not include a sort parameter:
https://reference.chargify.com/v1/products/list-products

cosimon · 2021-06-02T19:54:18Z

Hi @cirotix

The simplest and safest way for you to share credentials with Stitch is to create a connection in Stitch and then open a conversation with the support staff informing them that you want to share the credentials on the connection with the engineering team for testing purposes.

cirotix · 2021-06-02T19:58:32Z

Hi @cosimon
I will. Actually I have done a quick check earlier and your comments are actually accurate. I will be able to reply more in detail on Friday and will provide a patch as well

luandy64 · 2021-06-28T15:30:24Z

@cirotix Do you have any updates on this?

cirotix added 10 commits April 9, 2021 15:07

fix(subscriptions): Make replication incremental (singer-io#35)

23834c6

use 200 records per page

279f79d

Chargify API allow 200 records per page. It is more efficient to use as many records as possible.

formating

2f7383a

fix(events): incremental event replication (singer-io#35)

c02e618

fix(events): fix events schema

ff249ee

fix(customers): make incremental replication (singer-io#35)

4498242

fix(product family): make incremental replication (singer-io#35)

91481d8

fix product_families schema

165e401

Add created_at and updated_at fields

make product replication incremental (singer-io#35)

a3cd39a

cosimon requested changes Jun 2, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Incremental streams #40

Incremental streams #40

Uh oh!

cirotix commented May 6, 2021

Uh oh!

yoren commented May 10, 2021

Uh oh!

chrishumphries commented May 19, 2021

Uh oh!

cosimon left a comment •

edited

Loading

Uh oh!

cosimon Jun 2, 2021

Uh oh!

cosimon Jun 2, 2021

Uh oh!

cosimon Jun 2, 2021

Uh oh!

cosimon commented Jun 2, 2021

Uh oh!

cirotix commented Jun 2, 2021

Uh oh!

luandy64 commented Jun 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		for j in self.get("product_families/{product_family_id}/products.json".format(product_family_id=k["product_family"]["id"]),
		sort="asc", date_field="updated_at", start_datetime=bookmark):

Incremental streams #40

Are you sure you want to change the base?

Incremental streams #40

Uh oh!

Conversation

cirotix commented May 6, 2021

Uh oh!

yoren commented May 10, 2021

Uh oh!

chrishumphries commented May 19, 2021

Uh oh!

cosimon left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cosimon Jun 2, 2021

Choose a reason for hiding this comment

Uh oh!

cosimon Jun 2, 2021

Choose a reason for hiding this comment

Uh oh!

cosimon Jun 2, 2021

Choose a reason for hiding this comment

Uh oh!

cosimon commented Jun 2, 2021

Uh oh!

cirotix commented Jun 2, 2021

Uh oh!

luandy64 commented Jun 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

cosimon left a comment •

edited

Loading