Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Pass metadata to Linen for AttentionOp and L2Norm pull ready
#1882 opened Jun 26, 2025 by bvandermoon Loading…
4 tasks done
Elastic: fast-resume
#1881 opened Jun 25, 2025 by lukebaumann Draft
4 tasks
Elastic: Use pathwaysutils wait_for_slices
#1880 opened Jun 25, 2025 by lukebaumann Draft
4 tasks
Elastic Pyconfig
#1879 opened Jun 25, 2025 by lukebaumann Draft
4 tasks
Elastic refactor
#1878 opened Jun 25, 2025 by lukebaumann Draft
4 tasks
Internal change
#1877 opened Jun 25, 2025 by copybara-service bot Loading…
Reverts 61beb22933f50a9e5a4585a5dc79580526a853da
#1876 opened Jun 25, 2025 by copybara-service bot Loading…
Remove unused setup.py file
#1875 opened Jun 25, 2025 by bvandermoon Loading…
4 tasks done
update announcement for deepseek pull ready
#1873 opened Jun 25, 2025 by shuningjin Loading…
4 tasks done
Internal change
#1872 opened Jun 25, 2025 by copybara-service bot Loading…
Update MaxText's requirements.txt
#1871 opened Jun 24, 2025 by kanglant Loading…
4 tasks done
add fp8 recipe
#1870 opened Jun 24, 2025 by suexu1025 Loading…
4 tasks done
fix mmlu api
#1868 opened Jun 24, 2025 by shuningjin Loading…
4 tasks done
Elastic update
#1866 opened Jun 23, 2025 by lukebaumann Draft
4 tasks done
All Gather Once (FSDP Zero-one)
#1865 opened Jun 23, 2025 by wei879-100 Draft
4 tasks
Test notify changes
#1864 opened Jun 23, 2025 by quoctruong Draft
4 tasks done
Add DataLoader module and introduce custom StopTraining exception
#1862 opened Jun 23, 2025 by SurbhiJainUSC Loading…
4 tasks done
[DEBUGGING] Debugging flaky tests
#1861 opened Jun 22, 2025 by gobbleturk Loading…
4 tasks
[DRAFT] Qwen3 0.6b
#1858 opened Jun 21, 2025 by shralex Loading…
4 tasks
Add user guide to customize model
#1857 opened Jun 21, 2025 by RissyRan Loading…
4 tasks done
Use forked splash_attention_kernel
#1856 opened Jun 20, 2025 by copybara-service bot Loading…
Refactor: Decouple Core Transformer Blocks
#1852 opened Jun 19, 2025 by parambole Loading…
4 tasks done
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.