-
Notifications
You must be signed in to change notification settings - Fork 592
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix Signed-Unsigned Comparison in Tensor Utils
cla signed
fb-exported
#4279
opened Jun 6, 2025 by
wenxin0319
Loading…
Migrate jagged tensor kernels to
FBGEMM_LAUNCH_KERNEL
, pt 1
cla signed
fb-exported
#4278
opened Jun 5, 2025 by
q10
Loading…
improve write performance by ~10x
cla signed
fb-exported
#4277
opened Jun 5, 2025 by
steven1327
Loading…
Update AI Codesign Cutlass to 4.0
cla signed
fb-exported
#4276
opened Jun 5, 2025 by
jwfromm
Loading…
Added triton implementation for nvfp4 quantization scheme
cla signed
fb-exported
#4275
opened Jun 5, 2025 by
Tianyu-Liang
Loading…
Deprecate float32 bias for Cutlass FP8 rowwise
cla signed
fb-exported
#4274
opened Jun 5, 2025 by
cthi
Loading…
Decouple some operator defs from operator impl
cla signed
fb-exported
#4272
opened Jun 5, 2025 by
PatriceVignola
Loading…
Skip ROCm test for MPZCH kernel
cla signed
fb-exported
module: rocm
#4271
opened Jun 5, 2025 by
lizhouyu
Loading…
Attempt to fix the operator [] error in CUDA Kernel
cla signed
fb-exported
#4269
opened Jun 5, 2025 by
lizhouyu
Loading…
Attempts to solve errors caused by undefined function "CHECK_NOTNULL" - Use TORCH_CHECK instead
cla signed
fb-exported
#4266
opened Jun 4, 2025 by
lizhouyu
Loading…
Attempts to solve errors caused by undefined function "CHECK_NOTNULL" - Move common header back to src folder
cla signed
fb-exported
#4265
opened Jun 4, 2025 by
lizhouyu
Loading…
Vectorize load/store for get_FP8_qparam_cuda_kernel
cla signed
fb-exported
#4263
opened Jun 4, 2025 by
flaviotruzzi
Loading…
Vectorize load/store for FP8 Quantization
cla signed
fb-exported
#4262
opened Jun 4, 2025 by
flaviotruzzi
Loading…
Add headers and modify the cmake
cla signed
fb-exported
#4256
opened Jun 4, 2025 by
lizhouyu
Loading…
Update the rowwise adagrad optimizer to leverage optimizer state offloading, v4, frontend
cla signed
fb-exported
#4249
opened Jun 3, 2025 by
q10
Loading…
Test outside of namespace reference
cla signed
fb-exported
#4248
opened Jun 3, 2025 by
aporialiao
Loading…
Revert D75037710: Decouple some operator defs from operator impl
ci-no-td
cla signed
fb-exported
#4244
opened Jun 3, 2025 by
PatriceVignola
Loading…
Simplify CK FP8 Kernel Launch and enable FP16 Outputs.
cla signed
fb-exported
#4233
opened Jun 1, 2025 by
jwfromm
Loading…
Added unit tests for the entire ssd offloading using rocksdb checkpoint flow
cla signed
fb-exported
#4228
opened May 30, 2025 by
Raahul46
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.