Change the repository type filter
All
Repositories list
53 repositories
- [CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations
VideoLLaMA3
PublicCMM
PublicSeaBench
PublicMMR1
PublicLongPO
PublicVideoLLaMA2
PublicInf-CLIP
Public[CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.CoI-Agent
PublicLLM-R2
Publicmultilingual_analysis
PublicDiGIT
Public[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified PerspectiveVCD
PublicAuto-Arena-LLMs
PublicWebDesignAgent
PublicLLM-argumentation
Public[ACL2024] Exploring the Potential of Large Language Models in Computational ArgumentationDAMO-SeaLLMs
PublicVideo-LLaMA
Public[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understandingchain-of-knowledge
PublicMultipurpose-Chatbot
PublicA chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)AdamergeX
PublicLLM_summeval
PublicHierEncDec
Public