-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
parquet-go/parquet-go
vs apache/arrow/go/parquet
#252
Comments
It honestly depends. Both are released under the Apache License 2.0, so licensing isn't going to be an issue or differ between them. Ultimately it's going to be a question of Features, Performance and Maintenance. FeaturesFor example, if you need to support the parquet encryption capabilities, then you should use this library apache/arrow-go/v18/parquet as If you need to leverage bloom filters, then you'll need to use If you are already leveraging Apache Arrow itself for anything (ADBC for database interaction, Flight/FlightSQL for wire protocol, interacting with DuckDB or other Arrow-compatible/native compute engines, etc), then this library is going to be more performant and beneficial due to the direct integration it has with Arrow through the I have plans on improving the public APIs for the writers of this library to better utilize generics, while PerformanceAs far as Performance and memory usage, I haven't benchmarked anything significant between the two libraries so I can't speak to any comparison there. On this, I invite you to perform comparisons with your use case. That said, if you do find that MaintenanceBoth projects are actively maintained looking at the frequency of commits. While I can't speak to the maintainers of I hope the above helps you make a decision, or at least gives you a direction for exploration. In the end, I'm interested in what you end up going with and why. Particularly, if you do end up going with Honestly, thanks for filing this issue! |
Describe the usage question you have. Please include as many useful details as possible.
Hello, I'm writing a data archiving tool, and I expect to archive data from a trillion-row database. I'm wondering whether I should use
parquet-go/parquet-go
orapache/arrow/go/parquet
to solve my problem?Component(s)
Parquet
The text was updated successfully, but these errors were encountered: