Change the repository type filter
All
Repositories list
315 repositories
- A HTML5 video player with a parser that saves traffic
- A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
pysisyphus
PublicValley
PublicrdbStore
Public字节跳动鸿蒙生态数据库组件,支撑字节系鸿蒙应用数据库相关能力。- PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant references, to ultimately obtain comprehensive and accurate results for complex scholarly queries.
- ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild. ECCV 2022.
- A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.