Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: remove max_len from generate_permute_indices CLA Signed This label is managed by the Meta Open Source bot.
#1268 opened Jun 6, 2025 by hann-wang Loading…
Fix lr scheduler CLA Signed This label is managed by the Meta Open Source bot.
#1261 opened Jun 4, 2025 by CarlosGomes98 Loading…
[WIP][Blackwell Kernels] Blackwell group gemm and dense gemms with Python Cutlass CLA Signed This label is managed by the Meta Open Source bot.
#1256 opened Jun 3, 2025 by lessw2020 Loading…
alternative implementation of create_indices_from_offsets_nosync compatible with torch.compile CLA Signed This label is managed by the Meta Open Source bot.
#1251 opened Jun 1, 2025 by hann-wang Loading…
[float8] add float8 rowwise MoE prototype CLA Signed This label is managed by the Meta Open Source bot.
#1245 opened May 30, 2025 by danielvegamyhre Draft
Add AMD GPU node for integration test CLA Signed This label is managed by the Meta Open Source bot.
#1241 opened May 29, 2025 by mori360 Draft
Implement initial_load_path for checkpointer CLA Signed This label is managed by the Meta Open Source bot.
#1236 opened May 28, 2025 by fegin Loading…
[cp][flex_attention] integration test trial CLA Signed This label is managed by the Meta Open Source bot.
#1228 opened May 27, 2025 by XilunWu Draft
[Flux] Add batched inference CLA Signed This label is managed by the Meta Open Source bot.
#1227 opened May 27, 2025 by CarlosGomes98 Loading…
[WIP] Implement the feature to save unsharded weights at the last step CLA Signed This label is managed by the Meta Open Source bot.
#1219 opened May 23, 2025 by fegin Loading…
[WIP][Experimental] Activation Offloading CLA Signed This label is managed by the Meta Open Source bot.
#1218 opened May 23, 2025 by lessw2020 Loading…
[WIP][DeepSeek] DeepSeek training and component integration with Titan main components CLA Signed This label is managed by the Meta Open Source bot.
#1183 opened May 13, 2025 by lessw2020 Loading…
compile: turn off fullgraph=True to support llama4 CLA Signed This label is managed by the Meta Open Source bot.
#1182 opened May 12, 2025 by bdhirsh Loading…
🐛 Use correct path for train_configs
#1163 opened May 2, 2025 by brianlechthaler Loading…
[cp][flex_attention] integration test trial CLA Signed This label is managed by the Meta Open Source bot. module: context parallel
#1160 opened May 1, 2025 by XilunWu Draft
[WIP] float8 rowwise all gather CLA Signed This label is managed by the Meta Open Source bot.
#1157 opened Apr 30, 2025 by danielvegamyhre Draft
[WIP] token-expert assignments and layer affinity tracking for expert placement via ILP solving CLA Signed This label is managed by the Meta Open Source bot.
#1152 opened Apr 28, 2025 by lessw2020 Loading…
Add grad_norm metrics CLA Signed This label is managed by the Meta Open Source bot.
#1143 opened Apr 25, 2025 by yzhangcs Loading…
Enable save plan caching CLA Signed This label is managed by the Meta Open Source bot. fb-exported
#1140 opened Apr 23, 2025 by MeetVadakkanchery Loading…
[WIP] Llama4 Vision Encoder CLA Signed This label is managed by the Meta Open Source bot.
#1116 opened Apr 17, 2025 by pbontrager Loading…
[WIP]Implement llama4 HF format to DCP converter CLA Signed This label is managed by the Meta Open Source bot.
#1104 opened Apr 15, 2025 by fegin Loading…
improve reshard_after_forward logic CLA Signed This label is managed by the Meta Open Source bot.
#1094 opened Apr 11, 2025 by tianyu-l Loading…
[Fux] load AutoencoderKL from diffusers CLA Signed This label is managed by the Meta Open Source bot.
#1085 opened Apr 10, 2025 by kashif Loading…
[DeepSeek][kernels] index select permute, cuda CLA Signed This label is managed by the Meta Open Source bot.
#1083 opened Apr 9, 2025 by lessw2020 Loading…
ProTip! Filter pull requests by the default branch with base:main.