Skip to content

yejun688/iccv-2025-oral-papers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

9 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

visitor badge

ICCV 2025 oral papers

2025

πŸ‘‹ hello

International Conference on Computer Vision is a massive conference. In 2025 alone, 11,239 papers were submitted, and 2,699 were accepted. resulting in an acceptance rate of 24%. Oral Presentation β€” Top 0.56% (64/11239) of all submissions. I created this repository to help you search for crΓ¨me de la crΓ¨me of ICCV publications. If the paper you are looking for is not on my short list, take a peek at the full list of accepted papers. Some of the papers’ authors have not posted their work on arXiv or made a GitHub repository available, so the list here may be incomplete or contain errors. We are updating it, and your patience is greatly appreciated. If you want a complete version, please refer to the official repositories Oral list.

πŸ—žοΈ paperss

topic title repository / paper
3D from multi-view and sensors Multi-View 3D Point Tracking GitHub arXiv
3D from multi-view and sensors SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining GitHub arXiv
3D from multi-view and sensors WIR3D: Visually-Informed and Geometry-Aware 3D Shape Abstraction GitHub arXiv
3D from multi-view and sensors SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling GitHub arXiv
3D from multi-view and sensors Uncalibrated Structure from Motion on a Sphere GitHub arXiv
Geometric Computer Vision Forecasting Continuous Non-Conservative Dynamical Systems in SO(3) GitHub arXiv
Geometric Computer Vision Diving into the Fusion of Monocular Priors for Generalized Stereo Matching GitHub arXiv
Geometric Computer Vision RePoseD: Efficient Relative Pose Estimation With Known Depth Information GitHub arXiv
Geometric Computer Vision Deterministic Object Pose Confidence Region Estimation GitHub arXiv
3D Human Pose Estimation DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior GitHub arXiv
3D head avatar modeling HairCUP: Hair Compositional Universal Prior for 3D Gaussian Avatars GitHub arXiv
3D head avatar modeling FixTalk: Taming Identity Leakage for High-Quality Talking Head Generation in Extreme Cases GitHub arXiv
3D Scene Understanding Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion GitHub arXiv
3D Scene Understanding SuperDec: 3D Scene Decomposition with Superquadric Primitives GitHub arXiv
3D Scene Understanding ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR s3D Point Clouds GitHub arXiv
3D Scene Understanding GT-Loc: Unifying When and Where in Images Through a Joint Embedding Space GitHub arXiv
NeRF Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis GitHub arXiv
NeRF RayZer: A Self-supervised Large View Synthesis Model GitHub arXiv
Gaussian Splatting "EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis" GitHub arXiv
Embodied AI Learning Streaming Video Representation via Multitask Training GitHub arXiv
Generative AI TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models GitHub arXiv
Generative AI MaskControl: Spatio-Temporal Control for Masked Motion Synthesis GitHub arXiv
Generative AI Dynamic Typography: Bringing Text to Life via Video Diffusion Prior GitHub arXiv
Generative AI Generating Physically Stable and Buildable Brick Structures from Text GitHub arXiv
Generative AI LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering GitHub arXiv
Generative AI ReCamMaster: Camera-Controlled Generative Rendering from A Single Video GitHub arXiv
Generative AI Diffusion Image Prior GitHub arXiv
Generative AI LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer GitHub arXiv
Generative AI LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing GitHub arXiv
Generative AI MikuDance: Animating Character Art with Mixed Motion Dynamics GitHub arXiv
Segmentation CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation GitHub arXiv
Segmentation Easy3D: A Simple Yet Effective Method for 3D Interactive Segmentation GitHub arXiv
Segmentation E-SAM: Training-Free Segment Every Entity Model GitHub arXiv
Robotics Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction GitHub arXiv
Robotics Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image GitHub arXiv
Robotics Certifiably Optimal Anisotropic Rotation Averaging GitHub arXiv
Robotics Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos GitHub arXiv
Vision Foundation Models LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models GitHub arXiv
Vision Foundation Models Towards a Unified Copernicus Foundation Model for Earth Vision GitHub arXiv
Vision Foundation Models RS-vHeat: Heat Conduction Guided Efficient Remote Sensing Foundation Model GitHub arXiv
Vision Language Models Online Reasoning Video Segmentation with Just-in-Time Digital Twins GitHub arXiv
Multimodal LLMs Token Activation Map to Visually Explain Multimodal LLMs GitHub arXiv
Multimodal LLMs Scaling Laws for Native Multimodal Models GitHub arXiv
Multimodal Learning Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning GitHub arXiv
Multimodal Learning Understanding Co-speech Gestures in-the-wild GitHub arXiv
Multimodal Learning Differentiable Room Acoustic Rendering with Multi-View Vision Priors GitHub arXiv
Object Detection Counting Stacked Objects GitHub arXiv
Object Detection Automated Model Evaluation for Object Detection via Prediction Consistency and Reliability GitHub arXiv
Image Editing FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models GitHub arXiv
Video Restoration & Enhancement MIORe & VAR-MIORe: Benchmarks to Push the Boundaries of Restoration GitHub arXiv
Trustworthy AI Soft Local Completeness: Rethinking Completeness in Model Explainability GitHub arXiv
Image Retrieval Learning Visual Hierarchies in Hyperbolic Space for Image Retrieval GitHub arXiv
Model Acceleration Variance-Based Pruning for Accelerating and Compressing Trained Networks GitHub arXiv
Model Acceleration Importance-Based Token Merging for Efficient Image and Video Generation GitHub arXiv

🦸 contribution

We would love your help in making this repository even better! If you know of an amazing paper that isn't listed here, or if you have any suggestions for improvement, feel free to open an issue or submit a pull request.

About

😎 A curated list of ICCV 2025 Oral paper. In Progress

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages