Skip to content

Actions: pytorch/FBGEMM

FBGEMM_GPU-CUDA Benchmark

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,841 workflow runs
2,841 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

kv embedding inference cache wrapper
FBGEMM_GPU-CUDA Benchmark #2968: Pull request #4343 synchronize by chenyuzhcy
June 17, 2025 02:19 In progress chenyuzhcy:export-D72587941
June 17, 2025 02:19 In progress
Implement a stat library for fbgemm embedding
FBGEMM_GPU-CUDA Benchmark #2967: Pull request #4339 synchronize by Kaiweitu
June 17, 2025 00:55 54m 12s Kaiweitu:export-D76060846
June 17, 2025 00:55 54m 12s
handle inference buck gpu deps
FBGEMM_GPU-CUDA Benchmark #2966: Pull request #4358 synchronize by chenyuzhcy
June 16, 2025 23:02 58m 30s chenyuzhcy:export-D76228086
June 16, 2025 23:02 58m 30s
handle inference buck gpu deps
FBGEMM_GPU-CUDA Benchmark #2965: Pull request #4358 synchronize by chenyuzhcy
June 16, 2025 22:55 7m 35s chenyuzhcy:export-D76228086
June 16, 2025 22:55 7m 35s
tbe cpu nobag dispatch and forward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2964: Pull request #4302 synchronize by yabalaban
June 16, 2025 21:35 50m 53s yabalaban:export-D75464152
June 16, 2025 21:35 50m 53s
tbe cpu nobag dispatch and backward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2963: Pull request #4303 synchronize by yabalaban
June 16, 2025 21:34 52m 17s yabalaban:export-D75464151
June 16, 2025 21:34 52m 17s
tbe cpu nobag dispatch and backward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2962: Pull request #4303 synchronize by yabalaban
June 16, 2025 21:28 5m 18s yabalaban:export-D75464151
June 16, 2025 21:28 5m 18s
tbe cpu nobag dispatch and forward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2961: Pull request #4302 synchronize by yabalaban
June 16, 2025 21:27 8m 44s yabalaban:export-D75464152
June 16, 2025 21:27 8m 44s
Vectorize load/store for FP8 Quantization
FBGEMM_GPU-CUDA Benchmark #2960: Pull request #4262 synchronize by flaviotruzzi
June 16, 2025 20:41 49m 14s flaviotruzzi:export-D75563906
June 16, 2025 20:41 49m 14s
silu_mul API Update
FBGEMM_GPU-CUDA Benchmark #2959: Pull request #4359 opened by sunfish2010
June 16, 2025 20:01 Action required sunfish2010:export-D76395658
June 16, 2025 20:01 Action required
kvzch inference python operator
FBGEMM_GPU-CUDA Benchmark #2958: Pull request #4344 synchronize by chenyuzhcy
June 16, 2025 19:51 52m 45s chenyuzhcy:export-D73219651
June 16, 2025 19:51 52m 45s
kv embedding inference cache wrapper
FBGEMM_GPU-CUDA Benchmark #2957: Pull request #4343 synchronize by chenyuzhcy
June 16, 2025 19:48 51m 4s chenyuzhcy:export-D72587941
June 16, 2025 19:48 51m 4s
kvzch inference python operator
FBGEMM_GPU-CUDA Benchmark #2956: Pull request #4344 synchronize by chenyuzhcy
June 16, 2025 19:40 11m 41s chenyuzhcy:export-D73219651
June 16, 2025 19:40 11m 41s
kv embedding inference cache wrapper
FBGEMM_GPU-CUDA Benchmark #2955: Pull request #4343 synchronize by chenyuzhcy
June 16, 2025 19:39 10m 54s chenyuzhcy:export-D72587941
June 16, 2025 19:39 10m 54s
[fbgemm_gpu] ROCm fixes for CI
FBGEMM_GPU-CUDA Benchmark #2954: Pull request #4345 synchronize by q10
June 16, 2025 19:33 49m 38s q10:bm/rocm-fixing
June 16, 2025 19:33 49m 38s
handle inference buck gpu deps
FBGEMM_GPU-CUDA Benchmark #2953: Pull request #4358 opened by chenyuzhcy
June 16, 2025 19:27 49m 18s chenyuzhcy:export-D76228086
June 16, 2025 19:27 49m 18s
Deprecate barrier isolation macros
FBGEMM_GPU-CUDA Benchmark #2952: Pull request #4357 opened by q10
June 16, 2025 18:56 49m 25s q10:export-D76700671
June 16, 2025 18:56 49m 25s
[fbgemm_gpu] Add build support for CUDA 12.9
FBGEMM_GPU-CUDA Benchmark #2951: Pull request #4356 synchronize by q10
June 16, 2025 18:47 50m 55s q10:bm/cuda-129
June 16, 2025 18:47 50m 55s
[fbgemm_gpu] Add build support for CUDA 12.9
FBGEMM_GPU-CUDA Benchmark #2950: Pull request #4356 synchronize by q10
June 16, 2025 18:33 15m 6s q10:bm/cuda-129
June 16, 2025 18:33 15m 6s
Add CudaEvents Barrier before MemCpy V33
FBGEMM_GPU-CUDA Benchmark #2949: Pull request #4348 synchronize by Jason-KChen
June 16, 2025 18:27 52m 25s Jason-KChen:export-D76478282
June 16, 2025 18:27 52m 25s
Add CudaEvents Barrier before MemCpy V33
FBGEMM_GPU-CUDA Benchmark #2948: Pull request #4348 synchronize by Jason-KChen
June 16, 2025 18:12 16m 43s Jason-KChen:export-D76478282
June 16, 2025 18:12 16m 43s
Add CudaEvents Barrier before MemCpy V33
FBGEMM_GPU-CUDA Benchmark #2947: Pull request #4348 synchronize by Jason-KChen
June 16, 2025 18:06 7m 13s Jason-KChen:export-D76478282
June 16, 2025 18:06 7m 13s
[fbgemm_gpu] Add build support for CUDA 12.9
FBGEMM_GPU-CUDA Benchmark #2946: Pull request #4356 synchronize by q10
June 16, 2025 17:58 36m 5s q10:bm/cuda-129
June 16, 2025 17:58 36m 5s
Vectorize load/store for FP8 Quantization
FBGEMM_GPU-CUDA Benchmark #2945: Pull request #4262 synchronize by flaviotruzzi
June 16, 2025 17:53 50m 19s flaviotruzzi:export-D75563906
June 16, 2025 17:53 50m 19s
[fbgemm_gpu] Add build support for CUDA 12.9
FBGEMM_GPU-CUDA Benchmark #2944: Pull request #4356 opened by q10
June 16, 2025 17:53 6m 8s q10:bm/cuda-129
June 16, 2025 17:53 6m 8s