Skip to content

Actions: pytorch/FBGEMM

FBGEMM_GPU-CUDA Benchmark

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,902 workflow runs
2,902 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add CudaEvents Barrier before MemCpy V33
FBGEMM_GPU-CUDA Benchmark #2929: Pull request #4348 synchronize by Jason-KChen
June 16, 2025 14:47 4m 53s Jason-KChen:export-D76478282
June 16, 2025 14:47 4m 53s
tbe cpu nobag dispatch and backward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2928: Pull request #4303 synchronize by yabalaban
June 16, 2025 14:44 27m 30s yabalaban:export-D75464151
June 16, 2025 14:44 27m 30s
tbe cpu nobag dispatch and forward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2927: Pull request #4302 synchronize by yabalaban
June 16, 2025 14:41 50m 55s yabalaban:export-D75464152
June 16, 2025 14:41 50m 55s
tbe cpu nobag dispatch and backward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2926: Pull request #4303 synchronize by yabalaban
June 16, 2025 14:40 4m 41s yabalaban:export-D75464151
June 16, 2025 14:40 4m 41s
tbe cpu nobag dispatch and forward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2925: Pull request #4302 synchronize by yabalaban
June 16, 2025 14:37 8m 22s yabalaban:export-D75464152
June 16, 2025 14:37 8m 22s
Support prefetch pipeline in bounds_check_indices
FBGEMM_GPU-CUDA Benchmark #2924: Pull request #4312 synchronize by sryap
June 16, 2025 06:24 50m 7s sryap:export-D72365505
June 16, 2025 06:24 50m 7s
Support prefetch pipeline in bounds_check_indices
FBGEMM_GPU-CUDA Benchmark #2923: Pull request #4312 synchronize by sryap
June 16, 2025 06:19 6m 8s sryap:export-D72365505
June 16, 2025 06:19 6m 8s
Support prefetch pipeline in bounds_check_indices
FBGEMM_GPU-CUDA Benchmark #2922: Pull request #4312 synchronize by sryap
June 16, 2025 06:16 4m 55s sryap:export-D72365505
June 16, 2025 06:16 4m 55s
tbe cpu nobag dispatch and forward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2921: Pull request #4302 synchronize by yabalaban
June 15, 2025 22:31 50m 30s yabalaban:export-D75464152
June 15, 2025 22:31 50m 30s
tbe cpu nobag dispatch and backward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2920: Pull request #4303 synchronize by yabalaban
June 15, 2025 22:30 50m 52s yabalaban:export-D75464151
June 15, 2025 22:30 50m 52s
tbe cpu nobag dispatch and forward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2919: Pull request #4302 synchronize by yabalaban
June 15, 2025 22:23 9m 31s yabalaban:export-D75464152
June 15, 2025 22:23 9m 31s
tbe cpu nobag dispatch and backward pass kernel impl
FBGEMM_GPU-CUDA Benchmark #2918: Pull request #4303 synchronize by yabalaban
June 15, 2025 22:23 8m 15s yabalaban:export-D75464151
June 15, 2025 22:23 8m 15s
Add FP32 support for routing_score dtype
FBGEMM_GPU-CUDA Benchmark #2917: Pull request #4352 synchronize by jianyuh
June 15, 2025 21:26 51m 4s jianyuh:export-D76679848
June 15, 2025 21:26 51m 4s
Add FP32 support for routing_score dtype
FBGEMM_GPU-CUDA Benchmark #2916: Pull request #4352 synchronize by jianyuh
June 15, 2025 21:22 5m 40s jianyuh:export-D76679848
June 15, 2025 21:22 5m 40s
Fix array-bounds error
FBGEMM_GPU-CUDA Benchmark #2915: Pull request #3798 synchronize by cyyever
June 15, 2025 08:48 Action required cyyever:boundary
June 15, 2025 08:48 Action required
Add FP32 support for routing_score dtype
FBGEMM_GPU-CUDA Benchmark #2914: Pull request #4352 synchronize by jianyuh
June 15, 2025 08:40 49m 48s jianyuh:export-D76679848
June 15, 2025 08:40 49m 48s
Add FP32 support for routing_score dtype
FBGEMM_GPU-CUDA Benchmark #2913: Pull request #4352 synchronize by jianyuh
June 15, 2025 08:35 5m 32s jianyuh:export-D76679848
June 15, 2025 08:35 5m 32s
Add FP32 support for routing_score dtype
FBGEMM_GPU-CUDA Benchmark #2912: Pull request #4352 synchronize by jianyuh
June 15, 2025 08:31 6m 15s jianyuh:export-D76679848
June 15, 2025 08:31 6m 15s
Build and optimize BF16 grouped GEMM on blackwell
FBGEMM_GPU-CUDA Benchmark #2911: Pull request #4353 opened by jiawenliu64
June 15, 2025 07:17 1h 24m 58s jiawenliu64:export-D76632951
June 15, 2025 07:17 1h 24m 58s
Add FP32 support for routing_score dtype
FBGEMM_GPU-CUDA Benchmark #2910: Pull request #4352 opened by jianyuh
June 15, 2025 06:37 49m 6s jianyuh:export-D76679848
June 15, 2025 06:37 49m 6s
Tune FP8 grouped GEMM for Llama4 shapes
FBGEMM_GPU-CUDA Benchmark #2909: Pull request #4326 synchronize by jiawenliu64
June 15, 2025 06:02 50m 33s jiawenliu64:export-D76460456
June 15, 2025 06:02 50m 33s
Tune FP8 grouped GEMM for Llama4 shapes
FBGEMM_GPU-CUDA Benchmark #2908: Pull request #4326 synchronize by jiawenliu64
June 15, 2025 05:52 10m 23s jiawenliu64:export-D76460456
June 15, 2025 05:52 10m 23s
[fbgemm_gpu] SSD test fix for OSS
FBGEMM_GPU-CUDA Benchmark #2907: Pull request #4351 synchronize by q10
June 14, 2025 20:57 49m 35s q10:bm/ssd-test-fix
June 14, 2025 20:57 49m 35s
[fbgemm_gpu] SSD test fix for OSS
FBGEMM_GPU-CUDA Benchmark #2906: Pull request #4351 synchronize by q10
June 14, 2025 19:23 51m 55s q10:bm/ssd-test-fix
June 14, 2025 19:23 51m 55s
[fbgemm_gpu] ROCm fixes for CI
FBGEMM_GPU-CUDA Benchmark #2905: Pull request #4345 synchronize by q10
June 14, 2025 19:23 49m 32s q10:bm/rocm-fixing
June 14, 2025 19:23 49m 32s