Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP][SPARK-51180][BUILD] Upgrade Arrow to 19.0.0 #49909

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

aimtsou
Copy link
Contributor

@aimtsou aimtsou commented Feb 12, 2025

What changes were proposed in this pull request?

This PR aims to upgrade Arrow to 19.0.0 for Apache Spark 4.1.0.

Why are the changes needed?

Bring the latest changes coming from https://arrow.apache.org/release/19.0.0.html

Does this PR introduce any user-facing change?

No

How was this patch tested?

CI

Was this patch authored or co-authored using generative AI tooling?

No

Extra

Keeping for the moment 19.0.0 until 19.0.1 is released.

@github-actions github-actions bot added the BUILD label Feb 12, 2025
@aimtsou aimtsou changed the title [SPARK-51180][BUILD] Upgrade Arrow to 19.0.0 [WIP][SPARK-51180][BUILD] Upgrade Arrow to 19.0.0 Feb 12, 2025
@aimtsou
Copy link
Contributor Author

aimtsou commented Feb 12, 2025

For some reason, the workflow gets forbidden (403) when uploading the image.

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for making a PR, @aimtsou .

Apache Spark Pull Request CI is running on your fork.

Testing with GitHub Actions workflow
Apache Spark leverages GitHub Actions that enables continuous integration and a wide range of automation. Apache Spark repository provides several GitHub Actions workflows for developers to run before creating a pull request.

Could you check the setting of your fork? The 403 Forbidden error happens on your repository like the following.

1 warning found (use docker --debug to expand):
 - UndefinedVar: Usage of undefined variable '$R_LIBS_SITE' (line 75)
ERROR: failed to solve: failed to push ghcr.io/aimtsou/apache-spark-ci-image-docs:master-13287334536: unexpected status from POST request to https://ghcr.io/v2/aimtsou/apache-spark-ci-image-docs/blobs/uploads/: 403 Forbidden

BTW, it's too late for Apache Spark 4.0.0. Please update your PR description to 4.1.0.

@aimtsou
Copy link
Contributor Author

aimtsou commented Feb 12, 2025

@dongjoon-hyun: I will look the settings for the fork.

I will update the PR and the ticket on JIRA, and probably hold off until Arrow 19.0.1 since 19.0.0 is not on central maven, and has a small issue which will be corrected on 19.0.1.

@zhengruifeng
Copy link
Contributor

cc @LuciferYang who is also working on arrow upgrade

@LuciferYang
Copy link
Contributor

LuciferYang commented Feb 13, 2025

Starting from 18.2.0, the Java version of Arrow has been moved to a separate repository and is now released independently

19.0.0 of Arrow Java has not yet been released now ...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants