creating a dataframe for a delta table between two versions to use with merge #2409
Unanswered
krishna-prasad-s
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm working on a merge activity where I want to replicate the checkpoint behaviour of structured spark streaming.
I am working with delta tables and I have access to table versioning.
I am thinking to apply the version number as an alternative to structured spark streaming's checkpoint.
can we create a dataframe (to be used with the merge function) buy querying between versions.
for e.g.
pub async fn get_df_from_version(&mut self, table_name: &str, version: i64) -> DataFrame { let sql = &format!("SELECT * FROM {} VERSION BETWEEN {} AND current", table_name, version); self.ctx.sql(sql).await.unwrap() }
or is there any better alternative?
Beta Was this translation helpful? Give feedback.
All reactions