Skip to content

Conversation

@mdw771
Copy link
Collaborator

@mdw771 mdw771 commented Dec 2, 2025

Features/fixes

Fixed the inconsistency between multi-GPU and single GPU LSQML with compact batching.

The code now syncs

  • combined object update direction buffer
  • probe update direction from all object patches (for preconditioning)
  • indices for step size averaging
    across all ranks.

Checklist

Have you...

  • Formatted your code properly adhering to PEP-8? Considering using RUFF to format your code automatically.
  • Resolved all merge conflicts with the main branch?
  • Checked the diffs between your branch and the main branch to ensure that your changes are not introducing any regressions or unnecessary/irrelevant changes?

@mdw771 mdw771 merged commit 5d37add into main Dec 2, 2025
3 of 4 checks passed
@mdw771 mdw771 deleted the multigpu_fix branch December 2, 2025 22:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants