Skip to content

Fetch from nvidia Megatron-LM #5

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 4,916 commits into
base: load-iter
Choose a base branch
from
Open

Conversation

RaymondLi0
Copy link

No description provided.

tomlifu and others added 30 commits April 15, 2025 08:12
Fix for Enabling CUDA Graph for MMDiT and fluxSingleTransformer layer

See merge request ADLR/megatron-lm!2984
Merge branch 'ko3n1g/ci/codeless-builds' into 'main'

See merge request ADLR/megatron-lm!3109
chore: Update codeowners

See merge request ADLR/megatron-lm!3079
Co-authored-by: root <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: William Dykas <[email protected]>
Switch seq parallel utility

See merge request ADLR/megatron-lm!3011
…N8G_mcore_tp1_pp1_resume_torch_dist_dist_optimizer`
ci: Broken test `gpt3_345m_nightly_dgx_a100_1N8G_mcore_tp1_pp1_resume_torch_dist_dist_optimizer`

See merge request ADLR/megatron-lm!3110
Removed deprecated real quant configs in Modelopt

See merge request ADLR/megatron-lm!2934
ci: Fix publish notify job

See merge request ADLR/megatron-lm!3117
ci: Upload pipeline telemetrics

See merge request ADLR/megatron-lm!3106
Fix `post_training/test_get_gpt_modelopt_spec_interface`

See merge request ADLR/megatron-lm!3118
Remove legacy bert tests

See merge request ADLR/megatron-lm!3023
Co-authored-by: Ali Taghibakhshi <[email protected]>
Co-authored-by: Mcore Bot <[email protected]>
Alit/config mamba head

See merge request ADLR/megatron-lm!2601
Update CODEOWNERS to make modelopt  review only for QAT.

See merge request ADLR/megatron-lm!3125
Run nemo2 tests instead of nemo1

See merge request ADLR/megatron-lm!3119
…attn for dynamic batching.

Co-authored-by: Shanmugam Ramasamy <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: Vijay Korthikanti <[email protected]>
Co-authored-by: Mcore Bot <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: root <[email protected]>
Integrating paged attention feature of flash_attn for dynamic batching.

See merge request ADLR/megatron-lm!2955
Co-authored-by: Mcore Bot <[email protected]>
Co-authored-by: yaoyu-33 <[email protected]>
Co-authored-by: Chenhan Yu <[email protected]>
add l2 norm in torch_norm.py for LLAMA-4 support

See merge request ADLR/megatron-lm!2960
sbhavani and others added 30 commits May 8, 2025 10:56
Updated setup instructions in README.md

See merge request ADLR/megatron-lm!3210
Disable cudagraphs when pipeline parallel microbatched inference is on

See merge request ADLR/megatron-lm!3151
Inference functional test: 580M Minitron

See merge request ADLR/megatron-lm!2812
Invalidate cached SSM tensors if batch size changes during inference

See merge request ADLR/megatron-lm!3277
ci: Move unit test logic to file

See merge request ADLR/megatron-lm!3291
Adapt _write_item call to new signature with 'serialization_format'

See merge request ADLR/megatron-lm!3243
Add in-process restart

See merge request ADLR/megatron-lm!2711
ci: Run on multiple clusters

See merge request ADLR/megatron-lm!3292
ci: Allow specific TE-ref

See merge request ADLR/megatron-lm!3302
ci(fix): Write logs to log_dir

See merge request ADLR/megatron-lm!3299
Address dist checkpointing PyT 24.08 failure

See merge request ADLR/megatron-lm!3253
ci(hotfix): Downstream pipeline

See merge request ADLR/megatron-lm!3307
…nal argparse flag to clear GPU...

Co-authored-by: Szymon Migacz <[email protected]>
MR feedback: added units for arguments, optional argparse flag to clear GPU...

See merge request ADLR/megatron-lm!3308
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.