Skip to content

Actions: pytorch/FBGEMM

FBGEMM_GPU-CUDA Benchmark

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,903 workflow runs
2,903 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

FBGEMM build changes to support integration with pytorch
FBGEMM_GPU-CUDA Benchmark #3006: Pull request #4354 synchronize by cthi
June 17, 2025 20:53 50m 43s cthi:export-D76734540
June 17, 2025 20:53 50m 43s
FBGEMM build changes to support integration with pytorch
FBGEMM_GPU-CUDA Benchmark #3005: Pull request #4354 synchronize by cthi
June 17, 2025 20:45 8m 46s cthi:export-D76734540
June 17, 2025 20:45 8m 46s
FBGEMM build changes to support integration with pytorch
FBGEMM_GPU-CUDA Benchmark #3004: Pull request #4354 synchronize by cthi
June 17, 2025 20:37 9m 12s cthi:export-D76734540
June 17, 2025 20:37 9m 12s
Enrich auto-tune shapes for OC OBA model
FBGEMM_GPU-CUDA Benchmark #3003: Pull request #4368 opened by RandySheriff
June 17, 2025 20:33 Action required RandySheriff:export-D76631650
June 17, 2025 20:33 Action required
NVFP4 quantization emulation kernels as reference
FBGEMM_GPU-CUDA Benchmark #3002: Pull request #4324 synchronize by summerdengfb
June 17, 2025 20:07 52m 36s summerdengfb:export-D76363519
June 17, 2025 20:07 52m 36s
FBGEMM build changes to support integration with pytorch
FBGEMM_GPU-CUDA Benchmark #3001: Pull request #4354 synchronize by cthi
June 17, 2025 20:03 34m 51s cthi:export-D76734540
June 17, 2025 20:03 34m 51s
Add CudaEvent Sync to Two Hop All To One Copies
FBGEMM_GPU-CUDA Benchmark #3000: Pull request #4367 opened by Jason-KChen
June 17, 2025 19:55 Action required Jason-KChen:export-D76556844
June 17, 2025 19:55 Action required
NVFP4 quantization emulation kernels as reference
FBGEMM_GPU-CUDA Benchmark #2999: Pull request #4324 synchronize by summerdengfb
June 17, 2025 19:29 10m 37s summerdengfb:export-D76363519
June 17, 2025 19:29 10m 37s
FBGEMM build changes to support integration with pytorch
FBGEMM_GPU-CUDA Benchmark #2998: Pull request #4354 synchronize by cthi
June 17, 2025 19:28 35m 26s cthi:export-D76734540
June 17, 2025 19:28 35m 26s
FBGEMM build changes to support integration with pytorch
FBGEMM_GPU-CUDA Benchmark #2997: Pull request #4354 synchronize by cthi
June 17, 2025 19:19 9m 57s cthi:export-D76734540
June 17, 2025 19:19 9m 57s
Add CudaEvents Barrier before MemCpy V33
FBGEMM_GPU-CUDA Benchmark #2996: Pull request #4348 synchronize by Jason-KChen
June 17, 2025 19:10 42m 8s Jason-KChen:export-D76478282
June 17, 2025 19:10 42m 8s
Add CudaEvents Barrier before MemCpy V33
FBGEMM_GPU-CUDA Benchmark #2995: Pull request #4348 synchronize by Jason-KChen
June 17, 2025 19:04 7m 21s Jason-KChen:export-D76478282
June 17, 2025 19:04 7m 21s
kvzch inference python operator
FBGEMM_GPU-CUDA Benchmark #2994: Pull request #4344 synchronize by chenyuzhcy
June 17, 2025 19:03 37m 42s chenyuzhcy:export-D73219651
June 17, 2025 19:03 37m 42s
[fbgemm_gpu] Upgrade CI instances
FBGEMM_GPU-CUDA Benchmark #2993: Pull request #4366 opened by q10
June 17, 2025 18:21 15m 3s q10:bm/upgrade-ci-machines
June 17, 2025 18:21 15m 3s
[fbgemm_gpu] ROCm fixes for CI
FBGEMM_GPU-CUDA Benchmark #2992: Pull request #4345 synchronize by q10
June 17, 2025 18:01 53m 42s q10:bm/rocm-fixing
June 17, 2025 18:01 53m 42s
[fbgemm_gpu] Fix CUDA 12.9 OSS compilation for HSTU
FBGEMM_GPU-CUDA Benchmark #2991: Pull request #4360 synchronize by q10
June 17, 2025 17:42 1h 0m 3s q10:bm/fix-cu129
June 17, 2025 17:42 1h 0m 3s
New DeepGemm Style Groupwise Kernel
FBGEMM_GPU-CUDA Benchmark #2990: Pull request #4365 opened by jwfromm
June 17, 2025 17:26 50m 11s jwfromm:export-D76830629
June 17, 2025 17:26 50m 11s
Add TBE data configuration reporter to TBE forward (v2)
FBGEMM_GPU-CUDA Benchmark #2989: Pull request #4364 synchronize by gchalump
June 17, 2025 16:59 45m 20s gchalump:export-D75462895
June 17, 2025 16:59 45m 20s
Add TBE data configuration reporter to TBE forward (v2)
FBGEMM_GPU-CUDA Benchmark #2988: Pull request #4364 synchronize by gchalump
June 17, 2025 16:42 16m 55s gchalump:export-D75462895
June 17, 2025 16:42 16m 55s
Add TBE data configuration reporter to TBE forward (v2)
FBGEMM_GPU-CUDA Benchmark #2987: Pull request #4364 opened by gchalump
June 17, 2025 16:31 11m 18s gchalump:export-D75462895
June 17, 2025 16:31 11m 18s
Support scale_bias_last on tbe lookup kernel
FBGEMM_GPU-CUDA Benchmark #2986: Pull request #4363 opened by jnwan
June 17, 2025 16:28 Action required jnwan:export-D76615824
June 17, 2025 16:28 Action required
kv embedding inference cache wrapper
FBGEMM_GPU-CUDA Benchmark #2985: Pull request #4343 synchronize by chenyuzhcy
June 17, 2025 16:05 49m 52s chenyuzhcy:export-D72587941
June 17, 2025 16:05 49m 52s
kv embedding inference cache wrapper
FBGEMM_GPU-CUDA Benchmark #2984: Pull request #4343 synchronize by chenyuzhcy
June 17, 2025 16:00 5m 44s chenyuzhcy:export-D72587941
June 17, 2025 16:00 5m 44s
Add CudaEvents Barrier before MemCpy V33
FBGEMM_GPU-CUDA Benchmark #2983: Pull request #4348 synchronize by Jason-KChen
June 17, 2025 15:37 51m 38s Jason-KChen:export-D76478282
June 17, 2025 15:37 51m 38s
Add CudaEvents Barrier before MemCpy V33
FBGEMM_GPU-CUDA Benchmark #2982: Pull request #4348 synchronize by Jason-KChen
June 17, 2025 15:32 5m 23s Jason-KChen:export-D76478282
June 17, 2025 15:32 5m 23s