Skip to content

DeepSpeed v0.5.0

Compare
Choose a tag to compare
@jeffra jeffra released this 17 Aug 05:29
· 1997 commits to master since this release
f284324
  • Mixture of Experts (MoE) support
  • Curriculum learning