Change the repository type filter
All
Repositories list
82 repositories
TGDPO
Public[ICML 2025] TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization- The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"
Video-P2P
PublicVideo-P2P: Video Editing with Cross-attention ControlRL-GPT
PublicJenga
PublicMagicMirror
PublicLogits-Based-Finetuning
Public- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
LLMGA
PublicThis project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 OralARPO
PublicVisionZip
PublicOfficial repository for VisionZip (CVPR 2025)MoTCoder
PublicThis is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.LSDBench
PublicA benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency of long-video VLMs.Open-Code-Zero
PublicLISA
PublicProject Page for "LISA: Reasoning Segmentation via Large Language Model"Step-DPO
PublicLyra
PublicOfficial Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"LBGAT
PublicLearnable Boundary Guided Adversarial Training (ICCV2021)Mr-Ben
Public- Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)
ControlNeXt
PublicTagCLIP
PublicLongLoRA
PublicCode and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)DiffComplete
PublicOfficial Codebase of "DiffComplete: Diffusion-based Generative 3D Shape Completion"PFENet
PublicLLaMA-VID
PublicPointGroup
PublicPrompt-Highlighter
Public[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMsQ-LLM
PublicThis is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"VFIformer
Public