Skip to content
Change the repository type filter

All

    Repositories list

    • A framework for few-shot evaluation of language models.
      Python
      2.6k100Updated Jul 18, 2025Jul 18, 2025
    • s1

      Public
      s1: Simple test-time scaling
      Python
      7576.5k643Updated Jun 25, 2025Jun 25, 2025
    • JavaScript
      0200Updated Feb 11, 2025Feb 11, 2025