-
Notifications
You must be signed in to change notification settings - Fork 385
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: remove max_len from generate_permute_indices
CLA Signed
This label is managed by the Meta Open Source bot.
#1268
opened Jun 6, 2025 by
hann-wang
Loading…
Fix lr scheduler
CLA Signed
This label is managed by the Meta Open Source bot.
#1261
opened Jun 4, 2025 by
CarlosGomes98
Loading…
Added support for creating ROCm docker image for torchtian & run torchtitan tests on ROCm.
CLA Signed
This label is managed by the Meta Open Source bot.
module: rocm
#1260
opened Jun 4, 2025 by
akashveramd
•
Draft
[WIP][Blackwell Kernels] Blackwell group gemm and dense gemms with Python Cutlass
CLA Signed
This label is managed by the Meta Open Source bot.
#1256
opened Jun 3, 2025 by
lessw2020
Loading…
alternative implementation of create_indices_from_offsets_nosync compatible with torch.compile
CLA Signed
This label is managed by the Meta Open Source bot.
#1251
opened Jun 1, 2025 by
hann-wang
Loading…
[float8] add float8 rowwise MoE prototype
CLA Signed
This label is managed by the Meta Open Source bot.
#1245
opened May 30, 2025 by
danielvegamyhre
•
Draft
Implement initial_load_path for checkpointer
CLA Signed
This label is managed by the Meta Open Source bot.
#1236
opened May 28, 2025 by
fegin
Loading…
[cp][flex_attention] integration test trial
CLA Signed
This label is managed by the Meta Open Source bot.
[Flux] Add batched inference
CLA Signed
This label is managed by the Meta Open Source bot.
#1227
opened May 27, 2025 by
CarlosGomes98
Loading…
[WIP] Implement the feature to save unsharded weights at the last step
CLA Signed
This label is managed by the Meta Open Source bot.
#1219
opened May 23, 2025 by
fegin
Loading…
[WIP][Experimental] Activation Offloading
CLA Signed
This label is managed by the Meta Open Source bot.
#1218
opened May 23, 2025 by
lessw2020
Loading…
[WIP][DeepSeek] DeepSeek training and component integration with Titan main components
CLA Signed
This label is managed by the Meta Open Source bot.
#1183
opened May 13, 2025 by
lessw2020
Loading…
compile: turn off fullgraph=True to support llama4
CLA Signed
This label is managed by the Meta Open Source bot.
#1182
opened May 12, 2025 by
bdhirsh
Loading…
[cp][flex_attention] integration test trial
CLA Signed
This label is managed by the Meta Open Source bot.
module: context parallel
[WIP] float8 rowwise all gather
CLA Signed
This label is managed by the Meta Open Source bot.
#1157
opened Apr 30, 2025 by
danielvegamyhre
•
Draft
[WIP] token-expert assignments and layer affinity tracking for expert placement via ILP solving
CLA Signed
This label is managed by the Meta Open Source bot.
#1152
opened Apr 28, 2025 by
lessw2020
Loading…
Add This label is managed by the Meta Open Source bot.
grad_norm
metrics
CLA Signed
#1143
opened Apr 25, 2025 by
yzhangcs
Loading…
Enable save plan caching
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#1140
opened Apr 23, 2025 by
MeetVadakkanchery
Loading…
[WIP] Llama4 Vision Encoder
CLA Signed
This label is managed by the Meta Open Source bot.
#1116
opened Apr 17, 2025 by
pbontrager
Loading…
[WIP]Implement llama4 HF format to DCP converter
CLA Signed
This label is managed by the Meta Open Source bot.
#1104
opened Apr 15, 2025 by
fegin
Loading…
improve reshard_after_forward logic
CLA Signed
This label is managed by the Meta Open Source bot.
#1094
opened Apr 11, 2025 by
tianyu-l
Loading…
[Fux] load AutoencoderKL from diffusers
CLA Signed
This label is managed by the Meta Open Source bot.
#1085
opened Apr 10, 2025 by
kashif
Loading…
[DeepSeek][kernels] index select permute, cuda
CLA Signed
This label is managed by the Meta Open Source bot.
#1083
opened Apr 9, 2025 by
lessw2020
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.