Change the repository type filter
All
Repositories list
83 repositories
VisionThink
Public- Official repository for VisionZip (CVPR 2025)
- Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)
VisionReasoner
PublicThe official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"TGDPO
Public[ICML 2025] TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference OptimizationVideo-P2P
PublicVideo-P2P: Video Editing with Cross-attention ControlRL-GPT
PublicJenga
PublicMagicMirror
Public- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
LLMGA
PublicThis project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 OralARPO
PublicMoTCoder
PublicThis is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.LSDBench
PublicA benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency of long-video VLMs.Open-Code-Zero
PublicLISA
PublicProject Page for "LISA: Reasoning Segmentation via Large Language Model"Step-DPO
PublicLyra
PublicOfficial Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"LBGAT
PublicLearnable Boundary Guided Adversarial Training (ICCV2021)Mr-Ben
PublicControlNeXt
PublicTagCLIP
PublicLongLoRA
PublicCode and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)DiffComplete
PublicOfficial Codebase of "DiffComplete: Diffusion-based Generative 3D Shape Completion"PFENet
PublicLLaMA-VID
PublicPointGroup
PublicPrompt-Highlighter
Public[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMsQ-LLM
PublicThis is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"