-
Notifications
You must be signed in to change notification settings - Fork 338
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Integrate DeepGEMM, add supporting utils and unit testing, to enable blockwise fp8 inference
CLA Signed
This label is managed by the Meta Open Source bot.
#1124
opened Apr 21, 2025 by
lessw2020
Loading…
[WIP] [FT] Support local_sgd / diloco in titan
CLA Signed
This label is managed by the Meta Open Source bot.
[WIP] Llama4 Vision Encoder
CLA Signed
This label is managed by the Meta Open Source bot.
#1116
opened Apr 17, 2025 by
pbontrager
Loading…
Keeping SDPA happy, else hits following exceptions
CLA Signed
This label is managed by the Meta Open Source bot.
#1113
opened Apr 17, 2025 by
githubsgi
Loading…
[WIP]Implement llama4 HF format to DCP converter
CLA Signed
This label is managed by the Meta Open Source bot.
#1104
opened Apr 15, 2025 by
fegin
Loading…
improve reshard_after_forward logic
CLA Signed
This label is managed by the Meta Open Source bot.
#1094
opened Apr 11, 2025 by
tianyu-l
Loading…
[CI] Re-enable async TP test
CLA Signed
This label is managed by the Meta Open Source bot.
#1090
opened Apr 11, 2025 by
kwen2501
Loading…
[Fux] load AutoencoderKL from diffusers
CLA Signed
This label is managed by the Meta Open Source bot.
#1085
opened Apr 10, 2025 by
kashif
Loading…
[DeepSeek][kernels] index select permute, cuda
CLA Signed
This label is managed by the Meta Open Source bot.
#1083
opened Apr 9, 2025 by
lessw2020
Loading…
Fast dataset resume
CLA Signed
This label is managed by the Meta Open Source bot.
#1082
opened Apr 9, 2025 by
mariosasko
Loading…
[DeepSeek][Kernels] MoE sorting - Scatter Gather kernels
CLA Signed
This label is managed by the Meta Open Source bot.
#1065
opened Apr 7, 2025 by
lessw2020
Loading…
Adding Llama 1B and 3B model.
CLA Signed
This label is managed by the Meta Open Source bot.
#1040
opened Apr 1, 2025 by
githubsgi
Loading…
[WIP][Kernels] Contiguous Group GeMM
CLA Signed
This label is managed by the Meta Open Source bot.
#1036
opened Mar 31, 2025 by
lessw2020
Loading…
[Async TP] Add back reduce_scatter_tensor to save list for per op SAC now that it's supported in core
CLA Signed
This label is managed by the Meta Open Source bot.
#1031
opened Mar 29, 2025 by
danielvegamyhre
Loading…
[DeepSeek] Potential memory bug for noaux_tc?
CLA Signed
This label is managed by the Meta Open Source bot.
#1030
opened Mar 28, 2025 by
EugenHotaj
Loading…
[DeepSeek] Move seqlen from model config to This label is managed by the Meta Open Source bot.
setup_symm_mem
CLA Signed
#1017
opened Mar 24, 2025 by
kwen2501
Loading…
[not for land] enable torchao's mxfp8 training recipe
CLA Signed
This label is managed by the Meta Open Source bot.
#1015
opened Mar 24, 2025 by
vkuzo
Loading…
Hpc setup
CLA Signed
This label is managed by the Meta Open Source bot.
#1004
opened Mar 22, 2025 by
githubsgi
Loading…
[Experimental Feature] Huggingface model training
CLA Signed
This label is managed by the Meta Open Source bot.
#919
opened Mar 3, 2025 by
junjzhang
Loading…
Configure arbitrary frozen modules via config
CLA Signed
This label is managed by the Meta Open Source bot.
#869
opened Feb 20, 2025 by
lkhphuc
Loading…
[Not for landing] piggy back on titan for scale init test
CLA Signed
This label is managed by the Meta Open Source bot.
profile with modules and stack
CLA Signed
This label is managed by the Meta Open Source bot.
#829
opened Feb 10, 2025 by
carmocca
Loading…
[cp] Add cudnn attention support to Context Parallel
CLA Signed
This label is managed by the Meta Open Source bot.
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.