Change the repository type filter
All
Repositories list
17 repositories
VLM-R1
PublicSolve Visual Understanding with Reinforced VLMsOmAgent
Public- RS5M: a large-scale vision language dataset for remote sensing [TGRS]
open-agent-leaderboard
PublicReproducible Language Agent ResearchOmChat
PublicZoomEye
PublicZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image ExplorationOmDet
PublicReal-time and accurate open-vocabulary end-to-end object detectionVL-CheckList
PublicEvaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]OmModel
Publicawesome-RSVLM
PublicOVDEval
PublicA Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)GroundVLP
PublicGroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)habitat-lab
Public