Skip to content
Change the repository type filter

All

    Repositories list

    • lmms-eval

      Public
      One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
      Python
      3422.8k2423Updated Jul 21, 2025Jul 21, 2025
    • [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
      Python
      814530Updated Jul 11, 2025Jul 11, 2025
    • sae

      Public
      A framework that allows you to apply Sparse AutoEncoder on any models
      Python
      13310Updated Jul 11, 2025Jul 11, 2025
    • MGPO

      Public
      High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning
      13820Updated Jul 9, 2025Jul 9, 2025
    • MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.
      Python
      1126050Updated Jul 3, 2025Jul 3, 2025
    • VideoMMMU

      Public
      Python
      15000Updated Jun 24, 2025Jun 24, 2025
    • Open-source implementation of AlphaEvolve
      Python
      439200Updated Jun 20, 2025Jun 20, 2025
    • DeepEyes

      Public
      Python
      28300Updated Jun 16, 2025Jun 16, 2025
    • agent-rl

      Public
      A fork version of verl to support multi-turn tool use and many more agentic tasks.
      Python
      25100Updated Jun 14, 2025Jun 14, 2025
    • Aero-1

      Public
      Python
      67630Updated May 4, 2025May 4, 2025
    • EgoLife

      Public
      [CVPR 2025] EgoLife: Towards Egocentric Life Assistant
      Python
      1830860Updated Mar 19, 2025Mar 19, 2025
    • LongVA

      Public
      Long Context Transfer from Language to Vision
      Python
      20384270Updated Mar 18, 2025Mar 18, 2025
    • .github

      Public
      0100Updated Mar 7, 2025Mar 7, 2025
    • A fork to add multimodal model training to open-r1
      Python
      671.3k231Updated Feb 8, 2025Feb 8, 2025
    • my-python-template

      Public template
      My template repo for setting up a new python repo
      Python
      1000Updated Dec 11, 2024Dec 11, 2024
    • demos

      Public
      Python
      0000Updated Sep 18, 2024Sep 18, 2024
    • sglang

      Public
      SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
      Python
      2.4k400Updated Sep 18, 2024Sep 18, 2024
    • Otter

      Public
      🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
      Python
      2103.3k621Updated Mar 5, 2024Mar 5, 2024
    • Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
      Python
      2245660Updated Jul 4, 2023Jul 4, 2023