Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add schema evolution to merge statement #1

Merged
merged 158 commits into from
Jan 16, 2025

Conversation

JustinRush80
Copy link
Owner

@JustinRush80 JustinRush80 commented Jan 16, 2025

Description

add schema evolution to merge statement

Related Issue(s)

Documentation

Rush and others added 30 commits January 16, 2025 00:23
Fixes a check so readerFeatures is enabled on version 3 or higher

Signed-off-by: Russell Jancewicz <[email protected]>
Signed-off-by: Rush <[email protected]>
Updates the requirements on [which](https://github.com/harryfei/which-rs) to permit the latest version.
- [Release notes](https://github.com/harryfei/which-rs/releases)
- [Changelog](https://github.com/harryfei/which-rs/blob/master/CHANGELOG.md)
- [Commits](harryfei/which-rs@6.0.0...7.0.0)

---
updated-dependencies:
- dependency-name: which
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <[email protected]>
Signed-off-by: Rush <[email protected]>
…y partitions

(cherry picked from commit af17bb2)
Signed-off-by: Alex Wilcoxson <[email protected]>

chore: fmt
Signed-off-by: Rush <[email protected]>
Updates the requirements on [thiserror](https://github.com/dtolnay/thiserror) to permit the latest version.
- [Release notes](https://github.com/dtolnay/thiserror/releases)
- [Commits](dtolnay/thiserror@1.0.0...1.0.69)

---
updated-dependencies:
- dependency-name: thiserror
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <[email protected]>
Signed-off-by: Rush <[email protected]>
(cherry picked from commit 12abf00)
Signed-off-by: Alex Wilcoxson <[email protected]>
Signed-off-by: Rush <[email protected]>
small correction to z_order columns argument.

Signed-off-by: Rush <[email protected]>
Signed-off-by: Thomas Frederik Hoeck <[email protected]>
Signed-off-by: Rush <[email protected]>
`object_store` invokes `get_credential` on _every_ invocation of a
get/list/put/etc. The provider invocation for environment based
credentials is practically zero-cost, so this has no/low overhead.

In the case of the AssumeRoleProvider or any provider which has _some_
cost, such as an invocation of the AWS STS APIs, this can result in
rate-limiting or service quota exhaustion.

In order to prevent this, the credentials are attempted to be cached
only so long as they have no expired, which is defined in the
`aws_credential_types::Credential` struct

Signed-off-by: R. Tyler Croy <[email protected]>
Sponsored-by: Scribd Inc
Signed-off-by: Rush <[email protected]>
This is a fix aimed to enable jsonwriter to checkpoint in accordance
with delta.checkpointInterval.  It changes the default commitbuilder to
set a post_commit_hook so that checkpointing will be done by default.
Potentially we could also expose CommitProperties as an argument to
flush_and_commit, but that would require a change to the function
signature and would be a breaking change.

Signed-off-by: Justin Jossick <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: stretchadito <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: R. Tyler Croy <[email protected]>
Signed-off-by: Rush <[email protected]>
The release of pyo3 0.22.3 compells this since we cannot otherwise
compile. The choice is between pinning 0.22.2 and upgrading our ABI, and
I think it's better to upgrade the ABI

Signed-off-by: R. Tyler Croy <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: R. Tyler Croy <[email protected]>
Signed-off-by: Rush <[email protected]>
Today the make_array function from Datafusion uses "item" as the list
element's field name. With recent changes in delta-kernel-rs we have
switched to calling it "element" which is more conventional related to
how Apache Parquet handles things

This change introduces a test which helps isolate the behavior seen in
Python tests within the core crate for easier regression testing

Signed-off-by: R. Tyler Croy <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Vikas Sharma <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Marko Grujic <[email protected]>
Signed-off-by: Rush <[email protected]>
This change includes some minor warnings cleanup which were pestering
me too.

Closes delta-io#3015

Signed-off-by: R. Tyler Croy <[email protected]>
Signed-off-by: Rush <[email protected]>
ion-elgreco and others added 25 commits January 16, 2025 00:25
Signed-off-by: Ion Koutsouris <[email protected]>
Signed-off-by: Rush <[email protected]>
…valid delta table

Signed-off-by: Julien Guillot <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Julien Guillot <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Abdullah Sabaa Allil <[email protected]>
Signed-off-by: Rush <[email protected]>
The Snapshot.files() functrion is public but cannot be possibly used
because the trait it relies upon isn't public. Oops!

Signed-off-by: R. Tyler Croy <[email protected]>
Sponsored-by: Scribd Inc
Signed-off-by: Rush <[email protected]>
Signed-off-by: Ion Koutsouris <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Ion Koutsouris <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Ion Koutsouris <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Ion Koutsouris <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Ion Koutsouris <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Ion Koutsouris <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Francisco Garcia Florez <[email protected]>
Signed-off-by: Rush <[email protected]>
Signed-off-by: Rush <[email protected]>
Copy link

ACTION NEEDED

delta-rs follows the Conventional Commits specification for release automation.

The PR title and description are used as the merge commit message. Please update your PR title and description to match the specification.

@JustinRush80 JustinRush80 marked this pull request as ready for review January 16, 2025 12:57
@JustinRush80 JustinRush80 changed the title Feat/merge schema upsert feat: add schema evolution to merge statement Jan 16, 2025
@JustinRush80 JustinRush80 merged commit fcc92b2 into main Jan 16, 2025
21 of 26 checks passed
@JustinRush80 JustinRush80 deleted the feat/merge_schema_upsert branch January 16, 2025 13:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.