Skip to content

Conversation

@kdbrooks
Copy link
Contributor

@kdbrooks kdbrooks commented Jul 22, 2022

This implements a PyArrow Dataset TableProvider that allows for using Datasets as tables in Datafusion.

Fixes #8 originally datafusion-contrib/datafusion-python#10

This is a rebase of datafusion-contrib/datafusion-python#59

I have updated the TableProvider trait for DataFusion 10.0.0 but have not been able to make sure this compiles yet because the master branch is not building.

Copy link
Member

@andygrove andygrove left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @kylebrooks-8451.

@andygrove andygrove merged commit e0cbf48 into apache:master Jul 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Support reading from PyArrow datasets

2 participants