Change the repository type filter
All
Repositories list
89 repositories
Awesome-Video-Diffusion
PublicA curated list of recent diffusion models for video generation, editing, and various other applications.MovieBench
PublicPhotoDoodle
PublicROICtrl
PublicMovieAgent
Public- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
VLog
Public[CVPR 2025] Video Narration as Vocabulary & Video as Long Document- [ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
GUI-Thinker
PublicEnable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.MovieSeq
PublicSMS
PublicSAM-I2V
Publiccomputer_use_ootb
PublicAwesome-GUI-Agent
Public💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.VideoGUI
PublicDoraCycle
PublicLOVA3
Public(NeurIPS 2024) Official PyTorch implementation of LOVA3Impossible-Videos
PublicMakeAnything
PublicInterFeedback
PublicDiffSim
PublicwhisperV
PublicUniMoD
PublicFQGAN
PublicTune-An-Ellipse
Public