Skip to content
Change the repository type filter

All

    Repositories list

    • SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
      Python
      612710Updated Nov 27, 2025Nov 27, 2025
    • SAM2-Plus

      Public
      SAM 2++: Tracking Anything at Any Granularity
      Python
      34411Updated Nov 27, 2025Nov 27, 2025
    • RGE

      Public
      Reasoning Guided Embeddings: Leveraging MLLM Reasoning for Improved Multimodal Retrieval
      Python
      0300Updated Nov 26, 2025Nov 26, 2025
    • steadydancer-web

      Public
      JavaScript
      0100Updated Nov 25, 2025Nov 25, 2025
    • MobileViCLIP

      Public
      [ICCV 2025] MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
      Python
      0820Updated Nov 20, 2025Nov 20, 2025
    • UniAVGen

      Public
      HTML
      0300Updated Nov 6, 2025Nov 6, 2025
    • [NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory
      Python
      47550Updated Nov 4, 2025Nov 4, 2025
    • JointFormer

      Public
      [TPAMI] JointFormer: A Unified Framework with Joint Modeling for Video Object Segmentation
      Python
      01000Updated Oct 21, 2025Oct 21, 2025
    • MeMOTR

      Public
      [ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
      Python
      1720720Updated Oct 15, 2025Oct 15, 2025
    • MotionRAG

      Public
      [NeurIPS 2025] MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation
      Python
      41810Updated Oct 9, 2025Oct 9, 2025
    • VideoChat-Online

      Public
      [CVPR 2025] Online Video Understanding: OVBench and VideoChat-Online
      Python
      373100Updated Oct 7, 2025Oct 7, 2025
    • ArbInterp-Web

      Public
      JavaScript
      0000Updated Oct 2, 2025Oct 2, 2025
    • PixNerd

      Public
      PixNerd: Pixel Neural Field Diffusion
      Python
      413440Updated Sep 15, 2025Sep 15, 2025
    • CycleACR

      Public
      [TPAMI-2025] CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection
      Python
      0200Updated Sep 11, 2025Sep 11, 2025
    • DDT

      Public
      DDT: Decoupled Diffusion Transformer
      Python
      1632240Updated Aug 22, 2025Aug 22, 2025
    • MOTIP

      Public
      [CVPR 2025] Multiple Object Tracking as ID Prediction
      Python
      3141180Updated Aug 20, 2025Aug 20, 2025
    • VideoEval

      Public
      VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
      Python
      01400Updated Jul 31, 2025Jul 31, 2025
    • Video-DC

      Public
      Python
      11110Updated Jul 30, 2025Jul 30, 2025
    • CaReBench

      Public
      A Fine-grained Benchmark for Video Captioning and Retrieval
      Python
      22330Updated Jul 16, 2025Jul 16, 2025
    • [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling
      Python
      02110Updated Jul 7, 2025Jul 7, 2025
    • p-MoD

      Public
      [ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
      Python
      24310Updated Jun 26, 2025Jun 26, 2025
    • DEQDet

      Public
      [ICCV 2023] Deep Equilibrium Object Detection
      Jupyter Notebook
      12610Updated Jun 18, 2025Jun 18, 2025
    • SORCE

      Public
      Small Object Retrieval in Complex Environments (SORCE)
      Python
      1500Updated Jun 2, 2025Jun 2, 2025
    • DMM

      Public
      DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
      Python
      44530Updated Apr 27, 2025Apr 27, 2025
    • Tra-MoE

      Public
      [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
      Python
      35100Updated Apr 1, 2025Apr 1, 2025
    • TPM

      Public
      [WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation
      Python
      02300Updated Mar 28, 2025Mar 28, 2025
    • MoG_Web

      Public
      JavaScript
      0000Updated Mar 11, 2025Mar 11, 2025
    • MoG-VFI

      Public
      Motion-Aware Generative Frame Interpolation
      Python
      34130Updated Mar 11, 2025Mar 11, 2025
    • HTML
      0100Updated Jan 13, 2025Jan 13, 2025
    • FlowDCN

      Public
      [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
      Python
      13400Updated Dec 23, 2024Dec 23, 2024