-
Notifications
You must be signed in to change notification settings - Fork 356
Issues: AI-Hypercomputer/maxtext
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Feature Request: Implement Automatic Change of Grain Data Iterators Upon Training Resumption
feature request
#1760
opened May 21, 2025 by
ianmcampbell
execute bash setup.sh take too long due to dependency version not locked.
#1744
opened May 15, 2025 by
elbertwang
Package hierarchy: should benchmarks, end_to_end, pedagogical_examples be under MaxText?
#1635
opened Apr 25, 2025 by
SamuelMarks
Make maxtext_xpk_runner support other trainers
feature request
#1552
opened Apr 9, 2025 by
lukebaumann
Please create direct conversion scripts from huggingface for Gemma3 models
#1528
opened Apr 5, 2025 by
R4ZZ3
moe_lb_loss should be divided by gradient_accumulation_steps for reporting.
#1483
opened Mar 26, 2025 by
bzantium
When using dcn-DP and dcn-FSDP together got error when saving checkpoint.
#1434
opened Mar 20, 2025 by
jiagaoxiang
The default setting of
param_scan_axis=1
hurts performance and memory consumption on GPUs
#1382
opened Mar 12, 2025 by
jaro-sevcik
MFU drops significantly when using megablox with more experts
#1256
opened Feb 9, 2025 by
rodrigo-f-nogueira
llama GPU model with dcn fsdp + ici tp + cudnn flash attention broken
#1093
opened Dec 10, 2024 by
wang2yn84
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.