Skip to content

how many partitios will zero stage 3 divide mode #3438

Answered by tjruwase
cccc0der asked this question in Q&A
Discussion options

You must be logged in to vote

@cccc0der, yes, the vanilla zero3 will split each parameter across the 640 GPUs. However, we are integrating zero3 improvements that split across a subset of DP GPUs. One such called miCS is already available in the 0.9.2 release. A second algorithm will be out soon, so be on the lookout.

@samadejacobs, FYI.

Replies: 3 comments 6 replies

Comment options

You must be logged in to vote
3 replies
@cccc0der
Comment options

@yunll
Comment options

@JianqiaoLu
Comment options

Answer selected by cccc0der
Comment options

You must be logged in to vote
1 reply
@cccc0der
Comment options

Comment options

You must be logged in to vote
2 replies
@learning-chip
Comment options

@JianqiaoLu
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
6 participants