Change the repository type filter
All
Repositories list
53 repositories
General-Reasoner
PublicMMLU-Pro
Public- This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]
BrowserAgent
PublicQuickVideo
PublicQuick Long Video UnderstandingVideoScore2
PublicHierarchical-Reasoner
PublicCritique-Coder
PublicImagenHub
PublicA one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]VideoEval-Pro
PublicMore reliable Video Understanding EvaluationStructEval
PublicVisCoder
PublicPixelWorld
PublicOne-Shot-CFT
PublicVisualWebInstruct
PublicABC
PublicABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]Vamba
PublicTheoremExplainAgent
PublicOfficial Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]CritiqueFineTuning
PublicCode for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]ScholarCopilot
PublicMEGA-Bench
PublicThis repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]DisProtEdit
PublicVL-Rethinker
PublicAceCoder
Public