Speed up polysemous training with AVX-512. #4578

mulugetam · 2025-09-10T18:23:18Z

This change adds AVX-512 implementations of the compute_cost and cost_update functions used in polysemous training. Benchmarks of the training phase with benchs/bench_polysemous_sift1m.py show ~1.5x speedup over the existing scalar implementation, which relied only on partial auto-vectorization.

Once the SIMD support overhaul is complete, I hope this can be considered for integration.
cc: @mdouze @subhadeepkaran

Signed-off-by: Mulugeta Mammo <[email protected]>

bshethmeta · 2025-09-11T18:48:36Z

@mnorris11 @subhadeepkaran Do you have enough context to review this?

Signed-off-by: Mulugeta Mammo <[email protected]>

subhadeepkaran · 2025-09-15T07:28:23Z

@mnorris11 @subhadeepkaran Do you have enough context to review this?

Yep, you can assign it to me. the change can be reviewed and merged post dynamic dispatch landing

Speed up polysemous training with AVX-512.

9a9694c

Signed-off-by: Mulugeta Mammo <[email protected]>

meta-cla bot added the CLA Signed label Sep 10, 2025

mulugetam added 2 commits September 11, 2025 21:30

Make the avx-512 implementation work on systems without AVX512VPOPCNTDQ.

58da41c

Signed-off-by: Mulugeta Mammo <[email protected]>

Use intrinsics for popcnt even when __AVX512VPOPCNTDQ__ is not defined.

fe424e6

Signed-off-by: Mulugeta Mammo <[email protected]>

mnorris11 assigned subhadeepkaran Oct 6, 2025

mnorris11 added the Implementation label Oct 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed up polysemous training with AVX-512. #4578

Speed up polysemous training with AVX-512. #4578

Uh oh!

mulugetam commented Sep 10, 2025

Uh oh!

bshethmeta commented Sep 11, 2025

Uh oh!

subhadeepkaran commented Sep 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Speed up polysemous training with AVX-512. #4578

Are you sure you want to change the base?

Speed up polysemous training with AVX-512. #4578

Uh oh!

Conversation

mulugetam commented Sep 10, 2025

Uh oh!

bshethmeta commented Sep 11, 2025

Uh oh!

subhadeepkaran commented Sep 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants