Skip to content

Implement concat_json to join JSON strings given by strings column #2457

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 46 commits into from
Oct 10, 2024

Conversation

ttnghia
Copy link
Collaborator

@ttnghia ttnghia commented Sep 30, 2024

This adds concat_json to join multiple JSON strings given by strings column into one unified JSON string. Along with the concatenated data buffer, the result also contains a vector that indicates whether each row in the input is null or empty, and the delimiter character used for concatenation.

karthikeyann and others added 21 commits August 29, 2024 23:35
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
# Conflicts:
#	thirdparty/cudf
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
@ttnghia ttnghia requested a review from revans2 September 30, 2024 23:42
@ttnghia ttnghia self-assigned this Sep 30, 2024
@ttnghia
Copy link
Collaborator Author

ttnghia commented Sep 30, 2024

build

@ttnghia
Copy link
Collaborator Author

ttnghia commented Oct 1, 2024

build

@ttnghia
Copy link
Collaborator Author

ttnghia commented Oct 1, 2024

build

@ttnghia
Copy link
Collaborator Author

ttnghia commented Oct 1, 2024

build

@ttnghia
Copy link
Collaborator Author

ttnghia commented Oct 1, 2024

build

Signed-off-by: Nghia Truong <[email protected]>
This reverts commit 9af88ca.
Signed-off-by: Nghia Truong <[email protected]>
@ttnghia
Copy link
Collaborator Author

ttnghia commented Oct 7, 2024

build

@ttnghia ttnghia changed the base branch from branch-24.10 to branch-24.12 October 8, 2024 18:30
@ttnghia
Copy link
Collaborator Author

ttnghia commented Oct 8, 2024

build

Copy link
Collaborator

@revans2 revans2 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The performance got a lot better on my tests 24% better than the current baseline and all JSON related tests passed.

I need to track down what happened with some regexp tests, but they appear unrelated.

@ttnghia ttnghia merged commit 10fbcff into NVIDIA:branch-24.12 Oct 10, 2024
3 checks passed
@ttnghia ttnghia deleted the concat_json branch October 10, 2024 22:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants