Skip to content

[#2518] fix(client): Disable fast fail on any pushing failure if partition reassign is activated #2519

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jun 25, 2025

Conversation

zuston
Copy link
Member

@zuston zuston commented Jun 24, 2025

What changes were proposed in this pull request?

Disable fast fail on any pushing failure if partition reassign is activated

Why are the changes needed?

Before this PR, if multi-replica is disabled, the sending logic would fail fast upon any push failure.
However, when partition reassignment is enabled, this behavior could cause some block futures to be canceled, leading to the loss of block sending events.
As a result, the client might fail due to unsent remaining blocks.

for #2518

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Needn't. This could be quickly validated by human.

Copy link

Test Results

 3 049 files  ±0   3 049 suites  ±0   6h 47m 58s ⏱️ -45s
 1 186 tests ±0   1 185 ✅ ±0   1 💤 ±0  0 ❌ ±0 
15 042 runs  ±0  15 027 ✅ ±0  15 💤 ±0  0 ❌ ±0 

Results for commit 61dca3c. ± Comparison against base commit 907e625.

@zuston zuston requested a review from jerqi June 24, 2025 08:11
@zuston zuston merged commit f3eaf06 into apache:master Jun 25, 2025
41 checks passed
@zuston zuston deleted the fixBlockRelease branch June 25, 2025 02:11
zuston added a commit to zuston/incubator-uniffle that referenced this pull request Jul 2, 2025
…f partition reassign is activated (apache#2519)

### What changes were proposed in this pull request?

Disable fast fail on any pushing failure if partition reassign is activated

### Why are the changes needed?

Before this PR, if multi-replica is disabled, the sending logic would fail fast upon any push failure.
However, when partition reassignment is enabled, this behavior could cause some block futures to be canceled, leading to the loss of block sending events.
As a result, the client might fail due to unsent remaining blocks.

for apache#2518

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Needn't. This could be quickly validated by human.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants