Skip to content

Vincentqyw/cv-arxiv-daily

Repository files navigation

Updated on 2025.04.28

Usage instructions: here

Table of Contents
  1. SLAM
  2. SFM
  3. Visual Localization
  4. Keypoint Detection
  5. Image Matching
  6. NeRF

SLAM

Publish Date Title Authors PDF Code
2025-04-24 BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring Asier Bikandi et.al. 2504.17693 null
2025-04-24 Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images Zebo Huang et.al. 2504.17582 null
2025-04-24 Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization Guangyang Zeng et.al. 2504.17410 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-23 ToF-Splatting: Dense SLAM using Sparse Time-of-Flight Depth and Multi-Frame Integration Andrea Conti et.al. 2504.16545 null
2025-04-22 DERD-Net: Learning Depth from Event-based Ray Densities Diego de Oliveira Hitzges et.al. 2504.15863 null
2025-04-23 SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems Abhishek Tyagi et.al. 2504.15305 null
2025-04-20 Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction Weirong Chen et.al. 2504.14516 null
2025-04-20 SG-Reg: Generalizable and Efficient Scene Graph Registration Chuhao Liu et.al. 2504.14440 link
2025-04-19 Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering Jonathan Embley-Riches et.al. 2504.14135 null
2025-04-16 An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World Xingwu Ji et.al. 2504.11698 link
2025-04-18 Doppler-SLAM: Doppler-Aided Radar-Inertial and LiDAR-Inertial Simultaneous Localization and Mapping Dong Wang et.al. 2504.11634 link
2025-04-14 Region Based SLAM-Aware Exploration: Efficient and Robust Autonomous Mapping Strategy That Can Scale Megha Maheshwari et.al. 2504.10416 null
2025-04-14 RoboCup Rescue 2025 Team Description Paper UruBots Kevin Farias et.al. 2504.09778 null
2025-04-11 FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment Sebastián Barbas Laina et.al. 2504.08603 null
2025-04-11 PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection Xiong Li et.al. 2504.08280 null
2025-04-11 II-NVM: Enhancing Map Accuracy and Consistency with Normal Vector-Assisted Mapping Chengwei Zhao et.al. 2504.08204 link
2025-04-10 UWB Anchor Based Localization of a Planetary Rover Andreas Nüchter et.al. 2504.07658 null
2025-04-10 Event Signal Filtering via Probability Flux Estimation Jinze Chen et.al. 2504.07503 null
2025-04-07 Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAM Zhicong Sun et.al. 2504.04844 link
2025-04-06 SELC: Self-Supervised Efficient Local Correspondence Learning for Low Quality Images Yuqing Wang et.al. 2504.04497 null
2025-04-06 VSLAM-LAB: A Comprehensive Framework for Visual SLAM Methods and Datasets Alejandro Fontan et.al. 2504.04457 link
2025-04-05 Nonlinear Observer Design for Landmark-Inertial Simultaneous Localization and Mapping Mouaad Boughellaba et.al. 2504.04239 null
2025-04-04 WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments Jianhao Zheng et.al. 2504.03886 null
2025-04-03 SLACK: Attacking LiDAR-based SLAM with Adversarial Point Injections Prashant Kumar et.al. 2504.03089 null
2025-04-03 Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision Xiaofeng Han et.al. 2504.02477 null
2025-04-03 MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM Renwu Li et.al. 2504.02437 null
2025-04-02 A Chefs KISS -- Utilizing semantic information in both ICP and SLAM framework Sven Ochs et.al. 2504.02086 null
2025-04-01 Semantic SLAM with Rolling-Shutter Cameras and Low-Precision INS in Outdoor Environments Yuchen Zhang et.al. 2504.01997 null
2025-04-02 Strengthening Multi-Robot Systems for SAR: Co-Designing Robotics and Communication Towards 6G Juan Bravo-Arrabal et.al. 2504.01940 null
2025-04-02 Dynamic Initialization for LiDAR-inertial SLAM Jie Xu et.al. 2504.01451 link
2025-04-02 ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue Thomas Pritchard et.al. 2504.01261 link
2025-03-31 SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection Yannick Burkhardt et.al. 2504.00139 null
2025-03-30 A Visual-Inertial Motion Prior SLAM for Dynamic Environments Weilong Sun et.al. 2503.23429 null
2025-03-30 AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos Felix Wimbauer et.al. 2503.23282 link
2025-03-27 HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM Ziren Gong et.al. 2503.21778 null
2025-03-27 STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM Yongxu Wang et.al. 2503.21425 null
2025-03-25 Scene-agnostic Pose Regression for Visual Localization Junwei Zheng et.al. 2503.19543 null
2025-03-25 First Results on UAV-aided User Localization Using ToA and OpenAirInterface in 5G NR Omid Esrafilian et.al. 2503.19529 null
2025-03-25 MM-LINS: a Multi-Map LiDAR-Inertial System for Over-Degenerate Environments Yongxin Ma et.al. 2503.19506 link
2025-03-24 Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control Tohid Kargar Tasooji et.al. 2503.19135 null
2025-03-24 GI-SLAM: Gaussian-Inertial SLAM Xulang Liu et.al. 2503.18275 null
2025-03-22 LightLoc: Learning Outdoor LiDAR Localization at Light Speed Wen Li et.al. 2503.17814 link
2025-03-21 Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions Muhua Zhang et.al. 2503.17005 null
2025-03-20 4D Gaussian Splatting SLAM Yanyan Li et.al. 2503.16710 null
2025-03-20 Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education Giovanni Adorni et.al. 2503.16307 null
2025-03-20 Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors Tian Yi Lim et.al. 2503.16275 null
2025-03-19 A Sigma Point-based Low Complexity Algorithm for Multipath-based SLAM in MIMO Systems Anna Masiero et.al. 2503.15286 null
2025-03-19 ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents Hao Liang et.al. 2503.14948 null
2025-03-18 3D Densification for Multi-Map Monocular VSLAM in Endoscopy X. Anadón et.al. 2503.14346 null
2025-03-18 GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial Fusion SLAM for Dynamic Legged Robotics Tingyang Xiao et.al. 2503.14247 link
2025-03-18 A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios Huy-Hoang Bui et.al. 2503.13982 link
2025-03-17 Digital Beamforming Enhanced Radar Odometry Jingqi Jiang et.al. 2503.13252 null
2025-03-17 Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes Tatsuro Sakai et.al. 2503.12768 null
2025-03-16 KISS-SLAM: A Simple, Robust, and Accurate 3D LiDAR SLAM System With Enhanced Generalization Capabilities Tiziano Guadagnino et.al. 2503.12660 null
2025-03-16 Deblur Gaussian Splatting SLAM Francesco Girlanda et.al. 2503.12572 null
2025-03-16 M2UD: A Multi-model, Multi-scenario, Uneven-terrain Dataset for Ground Robot with Localization and Mapping Evaluation Yanpeng Jia et.al. 2503.12387 null
2025-03-13 OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions Maxim Popov et.al. 2503.10331 null
2025-03-12 Online Language Splatting Saimouli Katragadda et.al. 2503.09447 null
2025-03-12 MonoSLAM: Robust Monocular SLAM with Global Structure Optimization Bingzheng Jiang et.al. 2503.09296 null
2025-03-11 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-11 GigaSLAM: Large-Scale Monocular SLAM with Hierachical Gaussian Splats Kai Deng et.al. 2503.08071 link
2025-03-10 POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality Joey Wilson et.al. 2503.07819 null
2025-03-08 HIPPO-MAT: Decentralized Task Allocation Using GraphSAGE and Multi-Agent Deep Reinforcement Learning Lavanya Ratnabala et.al. 2503.07662 null
2025-03-10 AirSwarm: Enabling Cost-Effective Multi-UAV Research with COTS drones Xiaowei Li et.al. 2503.06890 link
2025-03-08 InfoFusion Controller: Informed TRRT Star with Mutual Information based on Fusion of Pure Pursuit and MPC for Enhanced Path Planning Seongjun Choi et.al. 2503.06010 link
2025-03-07 THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks Chaoran Xiong et.al. 2503.05112 null
2025-03-07 Adaptive-LIO: Enhancing Robustness and Precision through Environmental Adaptation in LiDAR Inertial Odometry Chengwei Zhao et.al. 2503.05077 link
2025-03-06 MarsLGPR: Mars Rover Localization with Ground Penetrating Radar Anja Sheppard et.al. 2503.04944 null
2025-03-06 On the Connection Between Magnetic-Field Odometry Aided Inertial Navigation and Magnetic-Field SLAM Isaac Skog et.al. 2503.04286 null
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235 null
2025-03-06 DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems Joshua Bird et.al. 2503.04126 null
2025-03-05 Equivariant Filter Design for Range-only SLAM Yixiao Ge et.al. 2503.03973 null
2025-03-05 Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments Jie Deng et.al. 2503.03373 link
2025-03-05 OpenGV 2.0: Motion prior-assisted calibration and SLAM with vehicle-mounted surround-view systems Kun Huang et.al. 2503.03230 null
2025-03-05 Distributed Certifiably Correct Range-Aided SLAM Alexander Thoms et.al. 2503.03192 link
2025-03-04 Introspective Loop Closure for SLAM with 4D Imaging Radar Maximilian Hilger et.al. 2503.02383 null
2025-03-04 DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting Haoyuan Li et.al. 2503.02223 link
2025-03-03 Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM Marco Giberna et.al. 2503.02050 null
2025-03-03 vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding Ali Tourani et.al. 2503.01783 null
2025-03-03 MUSt3R: Multi-view Network for Stereo 3D Reconstruction Yohann Cabon et.al. 2503.01661 null
2025-03-03 OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding Dianyi Yang et.al. 2503.01646 null
2025-03-03 MLINE-VINS: Robust Monocular Visual-Inertial SLAM With Flow Manhattan and Line Features Chao Ye et.al. 2503.01571 link
2025-03-03 AI-Driven Relocation Tracking in Dynamic Kitchen Environments Arash Nasr Esfahani et.al. 2503.01547 link
2025-03-03 Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning Xintao Chao et.al. 2503.01543 null
2025-03-03 RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation Shu Pan et.al. 2503.01434 null
2025-02-27 BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground Yufei Wei et.al. 2502.20078 null
2025-02-26 Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects Petri Mäkinen et.al. 2502.19169 null
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932 null
2025-02-25 S-Graphs 2.0 -- A Hierarchical-Semantic Optimization and Loop Closure for SLAM Hriday Bavle et.al. 2502.18044 link
2025-02-25 MegaLoc: One Retrieval to Place Them All Gabriele Berton et.al. 2502.17237 link
2025-02-24 SLABIM: A SLAM-BIM Coupled Dataset in HKUST Main Building Haoming Huang et.al. 2502.16856 link
2025-02-27 Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM Yao Zhang et.al. 2502.16495 null
2025-02-19 Slamming: Training a Speech Language Model on One GPU in a Day Gallil Maimon et.al. 2502.15814 link
2025-02-21 RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes Sicheng Yu et.al. 2502.15633 null
2025-02-20 Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2502.14931 null
2025-02-19 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments Vincent Ress et.al. 2502.13803 null
2025-02-19 Active Illumination for Visual Ego-Motion Estimation in the Dark Francesco Crocetti et.al. 2502.13708 null
2025-02-17 From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations Matteo Scucchia et.al. 2502.12303 null
2025-02-19 pySLAM: An Open-Source, Modular, and Extensible Framework for SLAM Luigi Freda et.al. 2502.11955 link
2025-02-17 Anti-Degeneracy Scheme for Lidar SLAM based on Particle Filter in Geometry Feature-Less Environments Yanbin Li et.al. 2502.11486 null
2025-02-16 GS-GVINS: A Tightly-integrated GNSS-Visual-Inertial Navigation System Augmented by 3D Gaussian Splatting Zelin Zhou et.al. 2502.10975 null
2025-02-19 MonoForce: Learnable Image-conditioned Physics Engine Ruslan Agishev et.al. 2502.10156 link
2025-02-13 Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions Dario Pisanti et.al. 2502.09795 null
2025-02-13 DenseSplat: Densifying Gaussian Splatting SLAM with Neural Radiance Prior Mingrui Li et.al. 2502.09111 null
2025-02-12 LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features Shujie Zhou et.al. 2502.08676 link
2025-02-10 Occupancy-SLAM: An Efficient and Robust Algorithm for Simultaneously Optimizing Robot Poses and Occupancy Map Yingyu Wang et.al. 2502.06292 link
2025-02-09 PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map Yue Pan et.al. 2502.05752 link
2025-02-07 Joint State and Noise Covariance Estimation Kasra Khosoussi et.al. 2502.04584 null
2025-02-05 GARAD-SLAM: 3D GAussian splatting for Real-time Anti Dynamic SLAM Mingrui Li et.al. 2502.03228 null
2025-02-04 SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification Yifu Tao et.al. 2502.02657 null
2025-02-04 HeRCULES: Heterogeneous Radar Dataset in Complex Urban Environment for Multi-session Radar SLAM Hanjun Kim et.al. 2502.01946 null
2025-02-03 Statistical enhance learning for modeling and prediction tennis matches at Grand Slam tournaments Nourah Buhamra et.al. 2502.01613 null
2025-02-03 Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter Dabin Kim et.al. 2502.01092 null
2025-02-01 FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps Maximilian Leitenstern et.al. 2502.00395 link
2025-01-31 LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks Liudi Yang et.al. 2501.19382 link
2025-01-31 Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping Yiming Huang et.al. 2501.19319 link
2025-01-31 GO: The Great Outdoors Multimodal Dataset Peng Jiang et.al. 2501.19274 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110 null
2025-01-28 SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios Yinqi Chen et.al. 2501.16754 null
2025-01-27 Visual-Lidar Map Alignment for Infrastructure Inspections Jake McLaughlin et.al. 2501.14486 link
2025-01-24 Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy Video Xiaohao Xu et.al. 2501.14319 link
2025-01-24 HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting Javier Yu et.al. 2501.14147 null
2025-01-23 FAST-LIVO2 on Resource-Constrained Platforms: LiDAR-Inertial-Visual Odometry with Efficient Memory and Computation Bingyang Zhou et.al. 2501.13876 null
2025-01-23 VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM Gyuhyeon Pak et.al. 2501.13402 null
2025-01-22 Grid-based Submap Joining: An Efficient Algorithm for Simultaneously Optimizing Global Occupancy Map and Local Submap Frames Yingyu Wang et.al. 2501.12764 null
2025-01-21 DynoSAM: Open-Source Smoothing and Mapping Framework for Dynamic SLAM Jesse Morris et.al. 2501.11893 link
2025-01-21 Survey on Monocular Metric Depth Estimation Jiuling Zhang et.al. 2501.11841 null
2025-01-19 OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors Dominik Kulmer et.al. 2501.11111 link
2025-01-19 Factor Graph-Based Active SLAM for Spacecraft Proximity Operations Lorenzo Ticozzi et.al. 2501.10950 null
2025-01-23 Mesh2SLAM in VR: A Fast Geometry-Based SLAM Framework for Rapid Prototyping in Virtual Reality Applications Carlos Augusto Pinheiro de Sousa et.al. 2501.09600 null
2025-01-16 Comparison of Various SLAM Systems for Mobile Robot in an Indoor Environment Maksim Filipenko et.al. 2501.09490 null
2025-01-15 Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures Pengru Deng et.al. 2501.09203 null
2025-01-15 AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning Assaf Lahiany et.al. 2501.09160 null
2025-01-15 SLC $^2$ -SLAM: Semantic-guided Loop Closure with Shared Latent Code for NeRF SLAM Yuhang Ming et.al. 2501.08880 null
2025-01-15 GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping Sheng Hong et.al. 2501.08672 null
2025-01-16 BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module Dongzhihan Wang et.al. 2501.08659 null
2025-01-15 Self-Organizing Edge Computing Distribution Framework for Visual SLAM Jussi Kalliola et.al. 2501.08629 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2025-01-13 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015 null
2025-01-12 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927 link
2025-01-11 SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors Zhen Hong et.al. 2501.06469 null
2025-01-09 Scaffold-SLAM: Structured 3D Gaussians for Simultaneous Localization and Photorealistic Mapping Wen Tianci et.al. 2501.05242 null
2025-01-07 SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment Yuchun Fan et.al. 2501.03681 link
2025-01-06 HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos Jinglei Zhang et.al. 2501.02973 null
2025-01-09 LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments Haosong Yue et.al. 2501.02580 link
2025-01-04 ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle Yinchuan Wang et.al. 2501.02166 link
2024-12-31 PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM Runnan Chen et.al. 2501.00352 null
2024-12-30 Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields Evgenii Kruzhkov et.al. 2412.20976 null
2024-12-28 MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing Shuo Wang et.al. 2412.20082 null
2024-12-27 DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction Kai Xu et.al. 2412.19584 null
2024-12-26 MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo Byeonggwon Lee et.al. 2412.19130 null
2024-12-23 End-to-end Generative Spatial-Temporal Ultrasonic Odometry and Mapping Framework Fuhua Jia et.al. 2412.17343 null
2024-12-23 LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation Riku Uemura et.al. 2412.17282 null
2024-12-23 Selective Kalman Filter: When and How to Fuse Multi-Sensor Information to Overcome Degeneracy in SLAM Jie Xu et.al. 2412.17235 null
2025-01-03 Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry Zhaoxing Zhang et.al. 2412.16923 null
2024-12-21 Query Quantized Neural SLAM Sijia Jiang et.al. 2412.16476 link
2024-12-20 SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training Wenxi Chen et.al. 2412.15649 link
2024-12-18 Energy-Efficient SLAM via Joint Design of Sensing, Communication, and Exploration Speed Zidong Han et.al. 2412.13912 null
2024-12-18 Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation Sait Akturk et.al. 2412.13752 null
2024-12-18 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching Fernando Amodeo et.al. 2412.13639 link
2024-12-17 NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment Andrea Dunn Beltran et.al. 2412.13176 null
2024-12-18 Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera Zhengdi Yu et.al. 2412.12861 null
2024-12-16 Global SLAM in Visual-Inertial Systems with 5G Time-of-Arrival Integration Meisam Kabiri et.al. 2412.12406 null
2024-12-16 MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors Riku Murai et.al. 2412.12392 null
2024-12-16 Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges Martin Aubard et.al. 2412.11840 null
2024-12-19 RoMeO: Robust Metric Visual Odometry Junda Cheng et.al. 2412.11530 null
2024-12-14 Affine EKF: Exploring and Utilizing Sufficient and Necessary Conditions for Observability Maintenance to Improve EKF Consistency Yang Song et.al. 2412.10809 link
2024-12-13 RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting Lizhi Bai et.al. 2412.09868 null
2024-12-12 SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos Yuzheng Liu et.al. 2412.09401 link
2024-12-12 eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction Jad Mansour et.al. 2412.09209 link
2024-12-12 Drift-free Visual SLAM using Digital Twins Roxane Merat et.al. 2412.08496 null
2024-12-10 A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM Zongbo Liao et.al. 2412.07513 null
2024-12-08 DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments Juwon Kim et.al. 2412.05839 null
2024-12-06 MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos Zhengqi Li et.al. 2412.04463 null
2024-12-05 Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset Fuzhang Han et.al. 2412.04287 link
2024-12-10 MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application Hyesu Jang et.al. 2412.03887 null
2024-12-04 Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars John McConnell et.al. 2412.03760 null
2024-12-04 BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement Miguel Arturo Vega Torres et.al. 2412.03434 link
2024-12-04 NeRF and Gaussian Splatting SLAM in the Wild Fabian Schmidt et.al. 2412.03263 link
2024-12-04 MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras Huai Yu et.al. 2412.03146 link
2024-12-04 An indoor DSO-based ceiling-vision odometry system for indoor industrial environments Abdelhak Bougouffa et.al. 2412.02950 null
2024-12-03 ROVER: A Multi-Season Dataset for Visual SLAM Fabian Schmidt et.al. 2412.02506 link
2024-12-04 RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting Zhenzhong Cao et.al. 2412.01217 link
2024-11-28 Visual SLAMMOT Considering Multiple Motion Models Peilin Tian et.al. 2411.19134 null
2024-11-27 ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching Yangrui Dong et.al. 2411.18174 null
2024-11-27 HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction Wei Zhang et.al. 2411.17982 null
2024-11-26 MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework Xiangcheng Hu et.al. 2411.17928 link
2024-11-29 DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting Christian Homeyer et.al. 2411.17660 link
2024-11-25 MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM Vladimir Yugay et.al. 2411.16785 null
2024-11-24 Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors Soumava Paul et.al. 2411.15966 null
2024-11-24 Near-Range Environmental Perception for Inland Waterway Vessels: A Comparative Study of LiDAR and Automotive FMCW RADAR Sensors R. Herrmann et.al. 2411.15901 null
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800 null
2024-11-23 Gassidy: Gaussian Splatting SLAM in Dynamic Environments Long Wen et.al. 2411.15476 null
2024-11-22 OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping Tomas Berriel Martins et.al. 2411.15043 link
2024-11-22 A Benchmark Dataset for Collaborative SLAM in Service Environments Harin Park et.al. 2411.14775 link
2024-11-21 InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation Marziyeh Bamdad et.al. 2411.14358 link
2024-11-20 Robust Monocular Visual Odometry using Curriculum Learning Assaf Lahiany et.al. 2411.13438 null
2024-11-20 Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds Jelena Trisovic et.al. 2411.13310 null
2024-11-19 3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality Hanbeom Chang et.al. 2411.12514 null
2024-11-19 LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments Renxiang Xiao et.al. 2411.12185 null
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-18 The Blue Horizontal-Branch Stars From the LAMOST Survey: Atmospheric Parameters Jie Ju et.al. 2411.11250 null
2024-11-17 A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality Wei-Hsiang Lien et.al. 2411.10940 null
2024-11-16 DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment Mangyu Kong et.al. 2411.10722 link
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546 null
2024-11-15 BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation Yufei Wei et.al. 2411.10195 null
2024-11-13 DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization Yueming Xu et.al. 2411.08373 null
2024-11-13 MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation Peng Wang et.al. 2411.08279 link
2024-11-12 Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments Ankit Shaw et.al. 2411.08231 null
2024-11-12 NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN Sonia Raychaudhuri et.al. 2411.07848 null
2024-11-11 Lost in Tracking Translation: A Comprehensive Analysis of Visual SLAM in Human-Centered XR and IoT Ecosystems Yasra Chandio et.al. 2411.07146 null
2024-11-11 Learning from Feedback: Semantic Enhancement for Object SLAM Using Foundation Models Jungseok Hong et.al. 2411.06752 null
2024-11-11 HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation Xiaolong Wang et.al. 2411.06700 null
2024-11-08 Development of an indoor localization and navigation system based on monocular SLAM for mobile robots Thanh Nguyen Canh et.al. 2411.05337 null
2024-11-07 Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping Sayat Ibrayev et.al. 2411.04797 null
2024-11-07 MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation Sayan Paul et.al. 2411.04796 null
2024-11-09 DEIO: Deep Event Inertial Odometry Weipeng Guan et.al. 2411.03928 link
2024-11-06 Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward Shashi Kumar et.al. 2411.03866 null
2024-11-06 LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior Jiahui Wang et.al. 2411.03610 link
2024-11-05 LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting Huibin Zhao et.al. 2411.02703 null
2024-11-04 Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing Xinran Zhang et.al. 2411.02553 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804 null
2024-10-31 XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM Xiaomeng Wang et.al. 2410.23690 link
2024-10-30 LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM Yucheng Huang et.al. 2410.23231 link
2024-10-30 ISAC Prototype System for Multi-Domain Cooperative Communication Networks Jie Yang et.al. 2410.22956 null
2024-10-30 SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark HyunJun Jung et.al. 2410.22715 link
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213 null
2024-10-29 EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments Linus Nwankwo et.al. 2410.22200 null
2024-10-28 NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments Taiyi Pan et.al. 2410.21615 link
2024-10-28 coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM Emiliano Höss et.al. 2410.21149 link
2024-11-01 RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior Mingjiang Liang et.al. 2410.20358 null
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-22 AG-SLAM: Active Gaussian Splatting SLAM Wen Jiang et.al. 2410.17422 null
2024-10-22 Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study J. Jorge et.al. 2410.17171 null
2024-10-19 EndoMetric: Near-light metric scale monocular SLAM Raúl Iranzo et.al. 2410.15065 null
2024-10-17 Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot Dongkun Han et.al. 2410.13612 null
2024-10-17 TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal Yanpeng Jia et.al. 2410.13240 null
2024-10-16 QueensCAMP: an RGB-D dataset for robust Visual SLAM Hudson M. S. Bruno et.al. 2410.12520 link
2024-10-18 PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM Guanghao Li et.al. 2410.12324 null
2024-10-16 Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem Yichen Sha et.al. 2410.12169 null
2024-10-15 V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting Tuan Dang et.al. 2410.12068 link
2024-10-15 GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information Wancai Zheng et.al. 2410.11356 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 link
2024-10-14 MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator Taozhe Li et.al. 2410.10669 null
2024-10-13 Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph Benoit Casseau et.al. 2410.09896 null
2024-10-12 SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs Wenxi Chen et.al. 2410.09503 link
2024-10-12 An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation Wei Liang et.al. 2410.09443 null
2024-10-12 ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras Junkai Niu et.al. 2410.09374 link
2024-10-11 Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System Zheng Liu et.al. 2410.08935 link
2024-10-11 Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints Yicheng He et.al. 2410.08780 null
2024-10-10 ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization Mason B. Peterson et.al. 2410.08262 link
2024-10-10 IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera Jian Huang et.al. 2410.08107 link
2024-10-08 Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching Gongxin Yao et.al. 2410.06285 null
2024-10-08 Submodular Optimization for Keyframe Selection & Usage in SLAM David Thorne et.al. 2410.05576 null
2024-10-07 SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones Denis Davletshin et.al. 2410.05405 null
2024-10-07 Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection Ang He et.al. 2410.05017 null
2024-10-05 A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems Nikola Radulov et.al. 2410.04242 link
2024-10-05 High-Speed Stereo Visual SLAM for Low-Powered Computing Devices Ashish Kumar et.al. 2410.04090 link
2024-10-04 EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM Shi Chen et.al. 2410.03812 null
2024-10-04 Estimating Body and Hand Motion in an Ego-sensed World Brent Yi et.al. 2410.03665 null
2024-10-03 LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features Zihao Dong et.al. 2410.02961 null
2024-10-02 ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space Hogyun Kim et.al. 2410.01325 null
2024-10-01 Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency William Dubois et.al. 2410.00758 null
2024-10-02 CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM Dapeng Feng et.al. 2410.00486 link
2024-09-30 Additively Manufactured Open-Source Quadruped Robots for Multi-Robot SLAM Applications Zachary Fuge et.al. 2410.00122 null
2024-09-30 Direct Multipath-Based SLAM Mingchao Liang et.al. 2409.20552 null
2024-09-30 Robust Gaussian Splatting SLAM by Leveraging Loop Closure Zunjie Zhu et.al. 2409.20111 null
2024-09-30 DynORecon: Dynamic Object Reconstruction for Navigation Yiduo Wang et.al. 2409.19928 null
2024-09-29 CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation Yifan Duan et.al. 2409.19597 null
2024-09-29 CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought Yexing Du et.al. 2409.19510 link
2024-09-29 Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface Ziniu Wu et.al. 2409.19499 null
2024-09-27 Royal Reveals: LiDAR Mapping of Kronborg Castle, Echoes of Hamlet's Halls Leon Davies et.al. 2409.18752 null
2024-09-26 BlinkTrack: Feature Tracking over 100 FPS via Events and Images Yichen Shen et.al. 2409.17981 null
2024-09-26 Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry Qi Zhang et.al. 2409.17729 null
2024-09-26 Event-based Stereo Depth Estimation: A Survey Suman Ghosh et.al. 2409.17680 null
2024-09-25 Efficient Submap-based Autonomous MAV Exploration using Visual-Inertial SLAM Configurable for LiDARs or Depth Cameras Sotiris Papatheodorou et.al. 2409.16972 null
2024-09-25 Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM Phu Pham et.al. 2409.16944 null
2024-09-25 Inline Photometrically Calibrated Hybrid Visual SLAM Nicolas Abboud et.al. 2409.16810 link
2024-09-25 Topological SLAM in colonoscopies leveraging deep features and topological priors Javier Morlana et.al. 2409.16806 link
2024-09-25 Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots Masoud Dayani Najafabadi et.al. 2409.16595 link
2024-09-25 Task-driven SLAM Benchmarking Yanwei Du et.al. 2409.16573 link
2024-09-24 SoMaSLAM: 2D Graph SLAM for Sparse Range Sensing with Soft Manhattan World Constraints Jeahn Han et.al. 2409.15736 null
2024-09-23 Spectral Graph Theoretic Methods for Enhancing Network Robustness in Robot Localization Neelkamal Somisetty et.al. 2409.15506 null
2024-09-22 SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms Niraj Pudasaini et.al. 2409.14515 null
2024-09-21 Point Cloud Structural Similarity-based Underwater Sonar Loop Detection Donghwi Jung et.al. 2409.14020 link
2024-09-20 HMD $^2$ : Environment-aware Motion Generation from Single Egocentric Head-Mounted Device Vladimir Guzov et.al. 2409.13426 null
2024-09-20 Learning Visual Information Utility with PIXER Yash Turkar et.al. 2409.13151 null
2024-09-19 MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting Yan Song Hu et.al. 2409.13055 null
2024-09-19 Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting Boying Li et.al. 2409.12518 link
2024-09-18 Bundle Adjustment in the Eager Mode Zitong Zhan et.al. 2409.12190 null
2024-09-23 Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping Jaehyung Jung et.al. 2409.12051 null
2024-09-18 Metric-Semantic Factor Graph Generation based on Graph Neural Networks Jose Andres Millan-Romera et.al. 2409.11972 null
2024-09-18 Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments Lei Cheng et.al. 2409.11854 null
2024-09-18 ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation Yanlin Jin et.al. 2409.11692 null
2024-09-18 SLAM assisted 3D tracking system for laparoscopic surgery Jingwei Song et.al. 2409.11688 null
2024-09-17 GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure Ziheng Xu et.al. 2409.10982 null
2024-09-17 Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells Ankit Butola et.al. 2409.10971 null
2024-09-17 Evaluating and Improving the Robustness of LiDAR-based Localization and Mapping Bo Yang et.al. 2409.10824 link
2024-09-16 P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty Yufan Zhang et.al. 2409.10143 link
2024-09-16 SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning Amogh Joshi et.al. 2409.09990 null
2024-09-16 Enhancing Visual Inertial SLAM with Magnetic Measurements Bharat Joshi et.al. 2409.09904 null
2024-09-15 Marginalizing and Conditioning Gaussians onto Linear Approximations of Smooth Manifolds with Applications in Robotics Zi Cong Guo et.al. 2409.09871 null
2024-09-15 Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping Yi Liu et.al. 2409.09763 null
2024-09-15 High Definition Map Mapping and Update: A General Overview and Future Directions Benny Wijaya et.al. 2409.09726 null
2024-09-14 MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry Yuheng Qiu et.al. 2409.09479 null
2024-09-14 Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM Haoying Li et.al. 2409.09410 null
2024-09-14 GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians Dasong Gao et.al. 2409.09295 link
2024-09-14 Panoramic Direct LiDAR-assisted Visual Odometry Zikang Yuan et.al. 2409.09287 link
2024-09-11 Object Depth and Size Estimation using Stereo-vision and Integration with SLAM Layth Hamad et.al. 2409.07623 null
2024-09-11 Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry Anbo Tao et.al. 2409.06948 null
2024-09-10 Technical Report of Mobile Manipulator Robot for Industrial Environments Erfan Amoozad Khalili et.al. 2409.06693 null
2024-09-10 Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios Zhiqiang Chen et.al. 2409.04961 link
2024-09-08 FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat Changfei Fu et.al. 2409.03457 null
2024-09-03 Integration of Augmented Reality and Mobile Robot Indoor SLAM for Enhanced Spatial Awareness Michael D. Friske et.al. 2409.01915 null
2024-09-03 Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric Tingchen Ma et.al. 2409.01856 null
2024-09-02 Saying goodbyes to rotating your phone: Magnetometer calibration during SLAM Ilari Vallivaara et.al. 2409.01242 null
2024-09-02 Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection Manon Kok et.al. 2409.01091 null
2024-09-02 Robust Vehicle Localization and Tracking in Rain using Street Maps Yu Xiang Tan et.al. 2409.01038 link
2024-08-31 UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM Mostafa Mansour et.al. 2409.00362 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-08-30 Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning Shuyang Zhang et.al. 2408.17005 link
2024-08-29 Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry Michael Adlerstein et.al. 2408.16472 null
2024-08-28 Single-Photon 3D Imaging with Equi-Depth Photon Histograms Kaustubh Sadekar et.al. 2408.16150 null
2024-08-28 BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR Miguel Arturo Vega Torres et.al. 2408.15870 link
2024-08-30 Addressing the challenges of loop detection in agricultural environments Nicolás Soncini et.al. 2408.15761 link
2024-08-28 ES-PTAM: Event-based Stereo Parallel Tracking and Mapping Suman Ghosh et.al. 2408.15605 link
2024-08-28 PointEMRay: A Novel Efficient SBR Framework on Point Based Geometry Kaiqiao Yang et.al. 2408.15583 null
2024-09-02 Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration Rongge Zhang et.al. 2408.14726 link
2024-08-26 A Survey on Reinforcement Learning Applications in SLAM Mohammad Dehghani Tezerjani et.al. 2408.14518 null
2024-08-28 FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry Chunran Zheng et.al. 2408.14035 link
2024-08-21 Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild Turcan Tuna et.al. 2408.11809 null
2024-08-21 LiFCal: Online Light Field Camera Calibration via Bundle Adjustment Aymeric Fleith et.al. 2408.11682 null
2024-08-21 Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars Zhihao Lin et.al. 2408.11582 null
2024-08-21 RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions Transform Maximilian Hilger et.al. 2408.11576 link
2024-08-21 Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models Kento Kawaharazuka et.al. 2408.11380 null
2024-08-20 LoopSplat: Loop Closure by Registering 3D Gaussian Splats Liyuan Zhu et.al. 2408.10154 link
2024-08-19 Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM Sanghyun Hahn et.al. 2408.09727 link
2024-08-17 GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System Shuo Wang et.al. 2408.09191 null
2024-08-15 GOReloc: Graph-based Object-Level Relocalization for Visual SLAM Yutong Wang et.al. 2408.07917 link
2024-08-14 Inverse k-visibility for RSSI-based Indoor Geometric Mapping Junseo Kim et.al. 2408.07757 null
2024-08-14 Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition Hogyun Kim et.al. 2408.07330 link
2024-08-12 CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments Yanpeng Jia et.al. 2408.05981 null
2024-08-21 Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis Zhongche Qu et.al. 2408.05635 null
2024-08-10 TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping Seoyeon Jang et.al. 2408.05453 null
2024-08-08 Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods Yiming Zhou et.al. 2408.04268 null
2024-08-07 Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM Yan Song Hu et.al. 2408.03825 null
2024-08-07 AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System Kuan Xu et.al. 2408.03520 link
2024-08-06 BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications G. Manni et.al. 2408.03078 link
2024-08-04 SLAMS-Propelled Electron Acceleration at High-Mach Number Astrophysical Shocks Vladimir Zeković et.al. 2408.02084 null
2024-08-03 Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing Fabian Schmidt et.al. 2408.01716 link
2024-08-03 Deep Patch Visual SLAM Lahav Lipson et.al. 2408.01654 link
2024-08-02 Momentum Capture and Prediction System Based on Wimbledon Open2023 Tournament Data Chang Liu et.al. 2408.01544 null
2024-08-07 IG-SLAM: Instant Gaussian SLAM F. Aykut Sarikamis et.al. 2408.01126 null
2024-08-01 Collecting Larg-Scale Robotic Datasets on a High-Speed Mobile Platform Yuxin Lin et.al. 2408.00545 null
2024-08-01 High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets Jian Li et.al. 2408.00538 link
2024-07-31 SuperVINS: A visual-inertial SLAM framework integrated deep learning features Hongkun Luo et.al. 2407.21348 link
2024-07-30 NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding Hongjia Zhai et.al. 2407.20853 null
2024-07-29 A flexible framework for accurate LiDAR odometry, map manipulation, and localization José Luis Blanco-Claraco et.al. 2407.20465 link
2024-07-28 Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data Azmyin Md. Kamal et.al. 2407.19518 null
2024-07-26 Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation Aditya Penumarti et.al. 2407.19046 null
2024-07-26 HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM Zhe Xin et.al. 2407.18813 null
2024-07-25 CodedVO: Coded Visual Odometry Sachin Shah et.al. 2407.18240 null
2024-07-28 HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation Zhenzhi Wang et.al. 2407.17438 link
2024-07-22 Memory Management for Real-Time Appearance-Based Loop Closure Detection Mathieu Labbé et.al. 2407.15890 null
2024-07-22 Reinforcement Learning Meets Visual Odometry Nico Messikommer et.al. 2407.15626 link
2024-07-22 Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM Mathieu Labbe et.al. 2407.15305 null
2024-07-21 Semi-Supervised Pipe Video Temporal Defect Interval Localization Zhu Huang et.al. 2407.15170 null
2024-07-21 VoxDepth: Rectification of Depth Images on Edge Devices Yashashwee Chakrabarty et.al. 2407.15067 null
2024-07-20 From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM Lorenzo Montano-Oliván et.al. 2407.14797 null
2024-07-19 MSSP : A Versatile Multi-Scenario Adaptable Intelligent Robot Simulation Platform Based on LIDAR-Inertial Fusion Qiyan Li et.al. 2407.14102 null
2024-07-18 A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion Jianxiang Xu et.al. 2407.13878 link
2024-07-18 Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM Baicheng Li et.al. 2407.13338 null
2024-07-18 Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain Bach Nguyen Gia et.al. 2407.13159 link
2024-07-17 Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge Andrea Albanese et.al. 2407.12663 null
2024-07-17 Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM Markus Weißflog et.al. 2407.12408 null
2024-07-19 Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion Sangjun Lee et.al. 2407.12405 link
2024-07-17 Fusion LiDAR-Inertial-Encoder data for High-Accuracy SLAM Manh Do Duc et.al. 2407.11870 null
2024-07-17 GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection Jingwen Yu et.al. 2407.11736 link
2024-07-16 Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems Jianzhu Huai et.al. 2407.11705 null
2024-07-16 Batch SLAM with PMBM Data Association Sampling and Graph-Based Optimization Yu Ge et.al. 2407.11643 null
2024-07-16 I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM Gwangtak Bae et.al. 2407.11347 null
2024-07-16 FR-SLAM: A SLAM Improvement Method Based on Floor Plan Registration Jiantao Feng et.al. 2407.11299 null
2024-07-15 Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method Adam Korycki et.al. 2407.11238 null
2024-07-12 An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks Seyed Alireza Rahimi Azghadi et.al. 2407.09242 null
2024-07-11 SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM Neng Wang et.al. 2407.08106 link
2024-07-09 Hyperion -- A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM David Hug et.al. 2407.07074 link
2024-07-15 A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM Yasra Chandio et.al. 2407.06889 null
2024-07-08 Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots Siva Krishna Ravipati et.al. 2407.06077 link
2024-07-10 Co-RaL: Complementary Radar-Leg Odometry with 4-DoF Optimization and Rolling Contact Sangwoo Jung et.al. 2407.05820 null
2024-07-07 Active Collaborative Visual SLAM exploiting ORB Features Muhammad Farhan Ahmed et.al. 2407.05453 null
2024-07-06 VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking Xuefeng Jiang et.al. 2407.05017 null
2024-07-06 Symmetric Linear Arc Monadic Datalog and Gadget Reductions Manuel Bodirsky et.al. 2407.04924 null
2024-07-03 Ultra-Lightweight Collaborative Mapping for Robot Swarms Vlad Niculescu et.al. 2407.03136 null
2024-07-01 RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields Haochen Jiang et.al. 2407.01303 link
2024-07-01 Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation Lianjie Guo et.al. 2407.01292 link
2024-07-01 Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization Ruofei Bai et.al. 2407.01013 link
2024-06-30 Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation Adnan Abdullah et.al. 2407.00848 null
2024-06-30 OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration Fengyuan Yang et.al. 2407.00574 null
2024-06-24 Compressing Search with Language Models Thomas Mulc et.al. 2407.00085 null
2024-06-28 CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services DongKi Noh et.al. 2406.19634 null
2024-06-25 Benchmarking SLAM Algorithms in the Cloud: The SLAM Hive System Xinzhe Liu et.al. 2406.17586 null
2024-07-02 SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation Xu Liu et.al. 2406.17249 link
2024-06-24 From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking Xiaohao Xu et.al. 2406.16850 link
2024-06-23 Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy Chen Wang et.al. 2406.16087 null
2024-06-19 Simultaneous Map and Object Reconstruction Nathaniel Chodosh et.al. 2406.13896 null
2024-06-14 Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization Wonho Song et.al. 2406.11599 null
2024-06-16 Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry Boris Chidlovskii et.al. 2406.11019 null
2024-06-15 Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM Yinjie Li et.al. 2406.10494 link
2024-06-12 From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers Swaminathan Gurumurthy et.al. 2406.07785 link
2024-06-27 Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF) Gyubeom Im et.al. 2406.06427 null
2024-06-10 Notes on Various Errors and Jacobian Derivations for SLAM Gyubeom Im et.al. 2406.06422 null
2024-06-23 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374 link
2024-06-15 Visual-Inertial SLAM as Simple as A, B, VINS Nathaniel Merrill et.al. 2406.05969 null
2024-06-09 MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps Jianhao Zheng et.al. 2406.05849 null
2024-06-06 Open Problem: Active Representation Learning Nikola Milosevic et.al. 2406.03845 null
2024-06-04 ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization Chen Mao et.al. 2406.01906 link
2024-06-03 The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry Paolo Cudrano et.al. 2406.01797 null
2024-06-03 Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry Takayuki Kanai et.al. 2406.00929 null
2024-06-02 Visual place recognition for aerial imagery: A survey Ivan Moskalenko et.al. 2406.00885 link
2024-05-30 Structure Gaussian SLAM with Manhattan World Hypothesis Shuhong Liu et.al. 2405.20031 null
2024-05-30 Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar Wouter Jansen et.al. 2405.19869 null
2024-05-30 SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization Jiang Wang et.al. 2405.19813 link
2024-05-30 TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM Peifeng Jiang et.al. 2405.19614 null
2024-05-27 CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy Richard Elvira et.al. 2405.16932 null
2024-05-26 Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians Erik Sandström et.al. 2405.16544 link
2024-05-24 NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes Lizhi Bai et.al. 2405.15151 null
2024-05-23 ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization Han Song et.al. 2405.15082 null
2024-05-23 Synergistic Global-space Camera and Human Reconstruction from Videos Yizhou Zhao et.al. 2405.14855 null
2024-05-23 CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments Yang Zhou et.al. 2405.14731 link
2024-05-23 Efficient Robot Learning for Perception and Mapping Niclas Vödisch et.al. 2405.14688 null
2024-05-22 Monocular Gaussian SLAM with Language Extended Loop Closure Tian Lan et.al. 2405.13748 null
2024-05-26 NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments Dongha Chung et.al. 2405.12563 link
2024-05-20 EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving Boyi Liu et.al. 2405.12120 null
2024-05-24 Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation Hyungtae Lim et.al. 2405.11176 null
2024-05-18 MotionGS : Compact Gaussian Splatting SLAM by Motion Filter Xinli Guo et.al. 2405.11129 link
2024-05-17 CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion Gang Wang et.al. 2405.10793 null
2024-05-17 Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map Liang Zhao et.al. 2405.10743 null
2024-05-10 MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization Pengcheng Zhu et.al. 2405.06241 null
2024-05-07 Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map Yuxuan Xia et.al. 2405.04290 null
2024-05-07 IMU-Aided Event-based Stereo Visual Odometry Junkai Niu et.al. 2405.04071 link
2024-04-27 An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation Olivier Brochu Dufour et.al. 2404.17745 null
2024-04-26 Camera Motion Estimation from RGB-D-Inertial Scene Flow Samuel Cerezo et.al. 2404.17251 link
2024-04-23 Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization Lahav Lipson et.al. 2404.15263 link
2024-04-18 SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints Spencer Carmichael et.al. 2404.12339 null
2024-04-17 VBR: A Vision Benchmark in Rome Leonardo Brizi et.al. 2404.11322 link
2024-04-14 Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration Yanhao Zhang et.al. 2404.09169 link
2024-04-06 Salient Sparse Visual Odometry With Pose-Only Supervision Siyu Chen et.al. 2404.04677 null
2024-03-25 A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments Gianluca D'Amico et.al. 2403.17084 null
2024-03-19 On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine Jagatpreet Singh Nir et.al. 2403.13170 null
2024-03-18 The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions Margaret Hansen et.al. 2403.12194 null
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639 null
2024-03-16 Efficient Domain Adaptation for Endoscopic Visual Odometry Junyang Wu et.al. 2403.10860 null
2024-03-14 Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO) Matthew Lisondra et.al. 2403.09882 null
2024-03-02 Grid-based Fast and Structural Visual Odometry Zhang Zhihe et.al. 2403.01110 null
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961 link
2024-02-22 Secure Navigation using Landmark-based Localization in a GPS-denied Environment Ganesh Sapkota et.al. 2402.14280 null
2024-02-19 Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment Ganesh Sapkota et.al. 2402.12551 null
2024-02-07 Online and Certifiably Correct Visual Odometry and Mapping Devansh R Agrawal et.al. 2402.05254 null
2024-02-06 YOLOPoint Joint Keypoint and Object Detection Anton Backhaus et.al. 2402.03989 link
2024-01-19 Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning André O. Françani et.al. 2401.10857 null
2024-01-17 Event-Based Visual Odometry on Non-Holonomic Ground Vehicles Wanting Xu et.al. 2401.09331 link
2024-01-11 On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering Feng Zhu et.al. 2401.05836 null
2023-12-19 Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry Olaya Álvarez-Tuñón et.al. 2401.05396 link
2024-01-07 Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people Ali Samadzadeh et.al. 2401.03604 link
2024-01-03 LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry Weirong Chen et.al. 2401.01887 link
2023-12-28 SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction Zikang Yuan et.al. 2312.16800 link
2023-12-20 NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields Jens Naumann et.al. 2312.13471 null
2023-12-22 Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM Junru Lin et.al. 2312.13332 null
2023-12-20 Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach Habib Boloorchi Tabrizi et.al. 2312.13162 link
2023-12-20 Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera Abdulkadhem A. Abdulkadhem et.al. 2312.12680 null
2023-12-15 Deep Event Visual Odometry Simon Klenk et.al. 2312.09800 link
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889 null
2023-12-04 iMatching: Imperative Correspondence Learning Zitong Zhan et.al. 2312.02141 link
2023-11-30 Event-based Visual Inertial Velometer Xiuyuan Lu et.al. 2311.18189 null
2023-11-21 CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems Young-Hee Lee et.al. 2311.12580 null
2023-11-10 Dense Visual Odometry Using Genetic Algorithm Slimane Djema et.al. 2311.06149 null
2023-11-07 Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM Seongwook Yoon et.al. 2311.03722 null
2023-10-23 Converting Depth Images and Point Clouds for Feature-based Pose Estimation Robert Lösch et.al. 2310.14924 link
2023-10-17 Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms Yanyan Li et.al. 2310.10931 link
2023-10-12 Jointly Optimized Global-Local Visual Localization of UAVs Haoling Li et.al. 2310.08082 null
2023-10-10 l-dyno: framework to learn consistent visual features using robot's motion Kartikeya Singh et.al. 2310.06249 link
2023-10-08 XVO: Generalized Visual Odometry via Cross-Modal Self-Training Lei Lai et.al. 2309.16772 null
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 link
2023-09-23 Tag-based Visual Odometry Estimation for Indoor UAVs Localization Massimiliano Bertoni et.al. 2309.13311 null
2023-09-22 Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms Olivier Gamache et.al. 2309.13139 link
2023-09-20 Conformalized Multimodal Uncertainty Regression and Reasoning Domenico Parente et.al. 2309.11018 null
2023-09-20 OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving Heng Li et.al. 2309.11011 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link
2023-09-21 Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration Hongbo Zhao et.al. 2309.10314 null
2023-09-18 End-to-End Learned Event- and Image-based Visual Odometry Roberto Pellerito et.al. 2309.09947 link
2023-09-14 An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments Yehao Liu et.al. 2309.07408 null
2023-09-11 Evaluating Visual Odometry Methods for Autonomous Driving in Rain Yu Xiang Tan et.al. 2309.05249 null
2023-09-08 Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Akankshya Kar et.al. 2309.04147 null
2023-09-04 EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity Zijie Jiang et.al. 2309.01296 null
2023-08-27 Deep Learning for Visual Localization and Mapping: A Survey Changhao Chen et.al. 2308.14039 null
2023-08-19 Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters Xiao Liu et.al. 2308.09870 link
2023-08-12 4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion Guirong Zhuo et.al. 2308.06573 null
2023-08-10 Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU U. V. B. L. Udugama et.al. 2308.05515 null
2023-08-02 A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry Cora A. Dimmig et.al. 2308.01398 null
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125 null
2023-08-02 Preliminary Design of the Dragonfly Navigation Filter Ben Schilling et.al. 2307.13513 null
2023-07-19 Optimizing the extended Fourier Mellin Transformation Algorithm Wenqing Jiang et.al. 2307.10015 link
2023-07-15 Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents Ke Cao et.al. 2307.07763 null
2023-07-26 Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression Jianeng Wang et.al. 2306.01188 null
2023-07-06 OSPC: Online Sequential Photometric Calibration Jawad Haidar et.al. 2305.17673 null
2023-05-15 Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface Shifan Zhu et.al. 2305.08962 null
2023-05-10 Transformer-based model for monocular visual odometry: a video understanding approach André O. Françani et.al. 2305.06121 link
2023-04-29 Modality-invariant Visual Odometry for Embodied Vision Marius Memmel et.al. 2305.00348 link
2023-04-21 FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving Yuxuan Liu et.al. 2304.10719 null
2023-07-08 Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Visual Bootstrapping Hanyu Cai et.al. 2304.08978 null
2023-04-12 SiLK -- Simple Learned Keypoints Pierre Gleize et.al. 2304.06194 link
2023-04-11 ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster Yifei Dong et.al. 2304.04943 null
2023-03-21 Learning a Depth Covariance Function Eric Dexheimer et.al. 2303.12157 null
2023-03-21 Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network Alessandro Navone et.al. 2303.11725 null
2023-03-20 VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors Thien Hoang Nguyen et.al. 2303.10903 null
2023-03-17 CoVIO: Online Continual Learning for Visual-Inertial Odometry Niclas Vödisch et.al. 2303.10149 link
2023-03-15 UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry Chaoyang Jiang et.al. 2303.08550 null
2023-03-13 Discovering Multiple Algorithm Configurations Leonid Keselman et.al. 2303.07434 null
2023-03-09 Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation Masahiro Hirano et.al. 2303.05192 null
2023-03-16 Stereo Event-based Visual-Inertial Odometry Kunfeng Wang et.al. 2303.05086 link
2023-03-07 Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor Eduardo Gallo et.al. 2303.03804 null
2023-03-03 Lightweight, Uncertainty-Aware Conformalized Visual Odometry Alex C. Stutts et.al. 2303.02207 null
2023-02-24 FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets Yelena Randall et.al. 2302.12772 null
2023-02-27 CP+: Camera Poses Augmentation with Large-scale LiDAR Maps Jiadi Cui et.al. 2302.12198 null
2023-02-19 EdgeVO: An Efficient and Accurate Edge-based Visual Odometry Hui Zhao et.al. 2302.09493 null
2023-01-27 HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera Mostafa Ahmadi et.al. 2301.11823 null
2023-01-26 Distributed Optimization Methods for Multi-Robot Systems: Part I -- A Tutorial Ola Shorinwa et.al. 2301.11313 null
2023-01-24 Generalized Object Search Kaiyu Zheng et.al. 2301.10121 null
2023-01-22 Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories Hanlin Chen et.al. 2301.09194 null
2023-01-21 Dense RGB SLAM with Neural Implicit Maps Heng Li et.al. 2301.08930 null
2023-01-18 Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information Junshi Chen et.al. 2301.07560 null
2023-01-17 COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM Manthan Patel et.al. 2301.07147 link
2023-01-31 Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems Pierre-Yves Lajoie et.al. 2301.06230 link
2023-01-13 A LiDAR-Inertial-Visual SLAM System with Loop Detection Kangcheng Liu et.al. 2301.05604 null
2023-01-11 AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization Ying Chen et.al. 2301.04620 link
2023-01-12 TBV Radar SLAM -- trust but verify loop candidates Daniel Adolfsson et.al. 2301.04397 link
2022-12-31 Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges Maxwell McManus et.al. 2301.03359 null
2023-01-09 Motion Addition and Motion Optimization Liqun Qi et.al. 2301.03174 null
2023-01-08 Towards Open World NeRF-Based SLAM Daniil Lisus et.al. 2301.03102 null
2023-01-06 CyberLoc: Towards Accurate Long-term Visual Localization Liu Liu et.al. 2301.02403 null
2023-01-03 LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation Shreyansh Daftry et.al. 2301.01350 null
2022-12-31 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions Patrick Wenzel et.al. 2301.01147 null
2023-01-03 BS3D: Building-scale 3D Reconstruction from RGB-D Images Janne Mustaniemi et.al. 2301.01057 null
2023-01-10 An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping Masoud Dayani Najafabadi et.al. 2301.00618 link
2022-12-25 A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion Nadia Figueroa et.al. 2212.14772 null
2022-12-29 An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping Kangcheng Liu et.al. 2212.14209 link
2022-12-27 Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands Felipe Gómez-Cuba et.al. 2212.13477 link
2022-12-26 ESVIO: Event-based Stereo Visual Inertial Odometry Peiyu Chen et.al. 2212.13184 link
2022-12-24 A Comprehensive Review on Autonomous Navigation Saeid Nahavandi et.al. 2212.12808 null
2022-12-23 Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation Marina Lotti et.al. 2212.12388 null
2022-12-23 Implementation of a Blind navigation method in outdoors/indoors areas Mohammad Javadian Farzaneh et.al. 2212.12185 null
2022-12-22 S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations Hriday Bavle et.al. 2212.11770 link
2022-12-22 Active SLAM: A Review On Last Decade Muhammad Farhan Ahmed et.al. 2212.11654 null
2022-12-27 Motion, Unit Dual Quaternion and Motion Optimization Liqun Qi et.al. 2212.11593 null
2022-12-22 Vision-Based Environmental Perception for Autonomous Driving Fei Liu et.al. 2212.11453 null
2022-12-19 Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models Yong Cheng et.al. 2212.09553 null
2022-12-16 Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments Lasitha Weerakoon et.al. 2212.08633 null
2022-12-16 rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments Bo Wei et.al. 2212.08418 null
2023-03-02 AirVO: An Illumination-Robust Point-Line Visual Odometry Kuan Xu et.al. 2212.07595 link
2022-12-14 Autonomous Vehicle Navigation with LIDAR using Path Planning Rahul M K et.al. 2212.07155 null
2022-12-14 RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping Hyowon Kim et.al. 2212.07141 null
2022-12-13 Know What You Don't Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version) Daniil Lisus et.al. 2212.06923 null
2022-12-13 SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance Chenyangguang Zhang et.al. 2212.06524 null
2022-12-13 Localization and Navigation System for Indoor Mobile Robot Yanbaihui Liu et.al. 2212.06391 null
2022-12-12 Evaluation of RGB-D SLAM in Large Indoor Environments Kirill Muravyev et.al. 2212.05980 null
2022-12-19 A Light-Weight LiDAR-Inertial SLAM System with Loop Closing Kangcheng Liu et.al. 2212.05743 link
2022-12-12 An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds Kangcheng Liu et.al. 2212.05705 link
2022-12-09 SLAM for Visually Impaired People: A Survey Marziyeh Bamdad et.al. 2212.04745 null
2022-12-09 Ego-Body Pose Estimation via Ego-Head Pose Estimation Jiaman Li et.al. 2212.04636 null
2022-12-06 Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles Sushant Veer et.al. 2212.03323 link
2022-12-06 PRISM: Probabilistic Real-Time Inference in Spatial World Models Atanas Mirchev et.al. 2212.02988 null
2022-12-06 RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps Florian Sauerbeck et.al. 2212.02085 link
2022-12-05 DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization Xuebo Tian et.al. 2212.02077 null
2022-12-05 ObjectMatch: Robust Registration using Canonical Object Correspondences Can Gümeli et.al. 2212.01985 null
2022-12-02 Sparse SPN: Depth Completion from Sparse Keypoints Yuqun Wu et.al. 2212.00987 null
2022-12-01 maplab 2.0 -- A Modular and Multi-Modal Mapping Framework Andrei Cramariuc et.al. 2212.00654 link
2022-12-01 AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body -- Theory and Experiments Mehregan Dor et.al. 2212.00350 null
2022-11-30 MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves Pranjali Pathre et.al. 2211.16882 null
2022-11-29 PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images Hartmut Surmann et.al. 2211.16266 link
2022-11-29 MmWave Mapping and SLAM for 5G and Beyond Yu Ge et.al. 2211.16024 null
2022-11-28 Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map Xi Zheng et.al. 2211.15127 null
2022-11-29 BALF: Simple and Efficient Blur Aware Local Feature Detector Zhenjun Zhao et.al. 2211.14731 null
2022-11-27 Development of a Modular Real-time Shared-control System for a Smart Wheelchair Vaishanth Ramaraj et.al. 2211.14711 null
2022-11-26 A1 SLAM: Quadruped SLAM using the A1's Onboard Sensors Jerred Chen et.al. 2211.14432 link
2022-11-23 ActiveRMAP: Radiance Field for Active Mapping And Planning Huangying Zhan et.al. 2211.12656 null
2022-11-22 Vision-based localization methods under GPS-denied conditions Zihao Lu et.al. 2211.11988 null
2022-11-21 Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques David Ramirez et.al. 2211.11836 null
2022-11-21 ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields Mohammad Mahdi Johari et.al. 2211.11704 null
2022-11-24 Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths Erik Leitinger et.al. 2211.09241 null
2022-11-16 Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery Hao Qu et.al. 2211.08904 null
2022-11-20 Detecting Line Segments in Motion-blurred Images with Events Huai Yu et.al. 2211.07365 link
2022-11-13 Automatic Eye-in-Hand Calibration using EKF Aditya Ramakrishnan et.al. 2211.06881 null
2022-11-12 Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling Zhihao Wang et.al. 2211.06557 link
2022-11-11 Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications Jie Yang et.al. 2211.05982 null
2022-11-10 Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time Ignacio Torroba et.al. 2211.05601 link
2022-11-07 When Geometry is not Enough: Using Reflector Markers in Lidar SLAM Gerhard Kurz et.al. 2211.03484 null
2022-11-07 Detecting Invalid Map Merges in Lifelong SLAM Matthias Holoch et.al. 2211.03423 null
2022-11-06 Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU Yibin Wu et.al. 2211.03174 link
2022-11-07 Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments Daniel Adolfsson et.al. 2211.02445 link
2022-11-03 DyOb-SLAM : Dynamic Object Tracking SLAM System Rushmian Annoy Wadud et.al. 2211.01941 null
2022-11-03 Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM Yang Chen et.al. 2211.01749 null
2022-11-04 $D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm Hao Xu et.al. 2211.01538 link
2022-11-02 Semantic SuperPoint: A Deep Semantic Descriptor Gabriel S. Gama et.al. 2211.01098 link
2022-11-02 Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation Myung-Hwan Jeon et.al. 2211.00960 link
2022-10-31 Mapping Extended Landmarks for Radar SLAM Shuai Sun et.al. 2210.17207 null
2022-10-25 MAROAM: Map-based Radar SLAM through Two-step Feature Selection Dequan Wang et.al. 2210.13797 null
2022-10-25 S3E: A Large-scale Multimodal Dataset for Collaborative SLAM Dapeng Feng et.al. 2210.13723 link
2022-10-24 NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields Antoni Rosinol et.al. 2210.13641 link
2022-10-24 Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging Geng Wang et.al. 2210.13556 null
2022-10-28 VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points Andreas Georgis et.al. 2210.12756 null
2022-10-22 SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation Junliang Chen et.al. 2210.12417 null
2022-10-21 DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm Shipeng Zhong et.al. 2210.11978 link
2022-10-21 Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments Shubham Kedia et.al. 2210.11652 null
2022-10-22 Visual SLAM: What are the Current Trends and What to Expect? Ali Tourani et.al. 2210.10491 null
2022-10-18 Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM Geon Choi et.al. 2210.09636 null
2022-10-16 D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments Ayman Beghdadi et.al. 2210.08647 null
2022-10-16 Indoor Smartphone SLAM with Learned Echoic Location Features Wenjie Luo et.al. 2210.08493 null
2022-10-15 Self-Improving SLAM in Dynamic Environments: Learning When to Mask Adrian Bojko et.al. 2210.08350 link
2022-10-13 Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems Pushyami Kaveti et.al. 2210.07315 link
2022-10-12 RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map Xuecheng Xu et.al. 2210.05984 link
2022-10-11 Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization Yuanzheng He et.al. 2210.05600 null
2022-10-11 Autonomous Asteroid Characterization Through Nanosatellite Swarming Kaitlin Dennison et.al. 2210.05518 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517 null
2022-10-11 Multi-Object Navigation with dynamically learned neural implicit representations Pierre Marza et.al. 2210.05129 link
2022-10-12 Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation Yulun Tian et.al. 2210.05020 null
2022-10-10 Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios Xingyu Chen et.al. 2210.04562 null
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236 null
2022-10-06 SCORE: A Second-Order Conic Initialization for Range-Aided SLAM Alan Papalia et.al. 2210.03177 link
2022-10-06 Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding Kirill Mazur et.al. 2210.03043 null
2022-10-06 Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence Osian Morgan et.al. 2210.02642 null
2022-10-05 MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation Hanwei Zhang et.al. 2210.02038 null
2022-10-04 O2S: Open-source open shuttle Nwankwo Linus et.al. 2210.01627 null
2022-10-04 Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing Weiying Wang et.al. 2210.01320 null
2022-10-03 Probabilistic Volumetric Fusion for Dense Monocular SLAM Antoni Rosinol et.al. 2210.01276 null
2022-10-03 DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams John McConnell et.al. 2210.00867 link
2022-10-03 A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments Ha Sier et.al. 2210.00812 link
2022-10-01 Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2 Ali Eslamian et.al. 2210.00278 null
2022-09-30 PyPose: A Library for Robot Learning with Physics-based Optimization Chen Wang et.al. 2209.15428 link
2022-09-29 DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment Mariia Gladkova et.al. 2209.14965 null
2022-09-28 Robust Incremental Smoothing and Mapping (riSAM) Daniel McGann et.al. 2209.14359 null
2022-09-27 Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping Chi-Ming Chung et.al. 2209.13274 link
2022-09-24 Graph Neural Networks for Multi-Robot Active Information Acquisition Mariliza Tzes et.al. 2209.12091 null
2022-09-24 Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes Jonathan J. Y. Kim et.al. 2209.11894 null
2022-09-23 involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs Gilad Rotman et.al. 2209.11591 null
2022-09-23 Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot David Balaban et.al. 2209.11432 null
2022-09-22 SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation Xiao Han et.al. 2209.10817 null
2022-09-22 Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio Wenhao Qiu et.al. 2209.10726 null
2022-09-21 Visual Localization and Mapping in Dynamic and Changing Environments João Carlos Virgolino Soares et.al. 2209.10710 null
2022-09-20 Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM Sabir Hossain et.al. 2209.10047 null
2022-09-20 WGICP: Differentiable Weighted GICP-Based Lidar Odometry Sanghyun Son et.al. 2209.09777 null
2022-09-20 PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention José Arce et.al. 2209.09699 link
2022-09-19 MeSLAM: Memory Efficient SLAM based on Neural Fields Evgenii Kruzhkov et.al. 2209.09357 null
2022-09-19 LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM Letian Zhang et.al. 2209.08810 null
2022-09-18 HGI-SLAM: Loop Closure With Human and Geometric Importance Features Shuhul Mujoo et.al. 2209.08608 null
2022-09-18 Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM Jiarui Tan et.al. 2209.08578 link
2022-09-17 DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments Shihao Shen et.al. 2209.08430 link
2022-09-17 OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM Matthieu Zins et.al. 2209.08338 null
2022-09-17 PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments Adam Dai et.al. 2209.08248 link
2022-09-16 ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM Aditya Arun et.al. 2209.08091 null
2022-09-16 iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking Yuhang Ming et.al. 2209.07919 null
2022-09-16 TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM Mathieu Gonzalez et.al. 2209.07888 null
2022-09-15 Landmark Management in the Application of Radar SLAM Shuai Sun et.al. 2209.07199 link
2022-09-15 PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization Xianwei Meng et.al. 2209.07061 null
2022-09-14 Semantic Visual Simultaneous Localization and Mapping: A Survey Kaiqi Chen et.al. 2209.06428 null
2022-09-13 Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets Islam Ali et.al. 2209.06316 null
2022-09-12 A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding Tin Lai et.al. 2209.05222 null
2022-09-12 Attitude-Guided Loop Closure for Cameras with Negative Plane Ze Wang et.al. 2209.05167 link
2022-09-09 General Place Recognition Survey: Towards the Real-world Autonomy Age Peng Yin et.al. 2209.04497 link
2022-09-08 ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology Julio A. Placed et.al. 2209.03693 link
2022-09-08 R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator Jiarong Lin et.al. 2209.03666 link
2022-09-06 Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection Brendon Forsgren et.al. 2209.02658 link
2022-09-05 Neuromorphic Visual Odometry with Resonator Networks Alpha Renner et.al. 2209.02000 null
2022-09-05 MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM Pavel Karpyshev et.al. 2209.01936 null
2022-09-05 ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics Boyi Liu et.al. 2209.01774 null
2022-09-04 CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud Evgeny Yudin et.al. 2209.01605 null
2022-08-31 PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM Yifan Duan et.al. 2208.14848 null
2022-08-30 BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition Peng Yin et.al. 2208.14543 null
2022-08-27 Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes Ali Safa et.al. 2208.12997 null
2022-08-25 FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms Jianhao Jiao et.al. 2208.11865 null
2022-08-25 Lidar SLAM for Autonomous Driving Vehicles Farhad Aghili et.al. 2208.11855 null
2022-08-24 DynaVINS: A Visual-Inertial SLAM for Dynamic Environments Seungwon Song et.al. 2208.11500 link
2022-08-22 Doppler Exploitation in Bistatic mmWave Radio SLAM Yu Ge et.al. 2208.10204 null
2022-08-21 Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping Lintong Zhang et.al. 2208.09825 link
2022-08-26 JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario Longrui Dong et.al. 2208.09777 null
2022-08-15 BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM Yunge Cui et.al. 2208.07473 link
2022-08-12 Handling Constrained Optimization in Factor Graphs for Autonomous Navigation Barbara Bazzana et.al. 2208.06325 null
2022-08-11 RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild Jason Y. Zhang et.al. 2208.05963 null
2022-08-08 Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation Yifei Ren et.al. 2208.04274 link
2022-08-08 SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty Shuai Zhang et.al. 2208.03945 link
2022-08-05 A Survey on Visual Map Localization Using LiDARs and Cameras Elhousni Mahdi et.al. 2208.03376 null
2022-08-04 SROS2: Usable Cyber Security Tools for ROS 2 Victor Mayoral Vilches et.al. 2208.02615 link
2022-08-03 Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms Bharath Garigipati et.al. 2208.02063 null
2022-08-02 Present and Future of SLAM in Extreme Underground Environments Kamak Ebadi et.al. 2208.01787 null
2022-08-01 Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion Simon Boche et.al. 2208.00709 null
2022-07-29 Neural Density-Distance Fields Itsuki Ueda et.al. 2207.14455 link
2022-07-25 DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions Tristan Laidlow et.al. 2207.12244 null
2022-07-25 Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration Kenji Koide et.al. 2207.11942 null
2022-07-22 NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction Yunlong Ran et.al. 2207.10985 null
2022-07-22 Dense RGB-D-Inertial SLAM with Map Deformations Tristan Laidlow et.al. 2207.10940 null
2022-07-22 PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes BaoSheng Zhang et.al. 2207.10916 null
2022-07-21 Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion Suman Ghosh et.al. 2207.10494 link
2022-07-21 Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions Quentin Serdel et.al. 2207.10489 link
2022-07-21 On applicability of von Karman's momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity Yujin Lu et.al. 2207.10413 null
2022-07-19 Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM Tuvy Lemberg et.al. 2207.09103 null
2022-07-18 DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM Weicai Ye et.al. 2207.08794 link
2022-07-18 Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction Marco Orsingher et.al. 2207.08439 null
2022-07-18 ORB-based SLAM accelerator on SoC FPGA Vibhakar Vemulapati et.al. 2207.08405 null
2022-07-14 Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset Riccardo Giubilato et.al. 2207.06815 null
2022-07-14 Semi-supervised Vector-Quantization in Visual SLAM using HGCN Amir Zarringhalam et.al. 2207.06738 null
2022-07-14 Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders Amir Zarringhalam et.al. 2207.06732 null
2022-07-13 SLAM: SLO-Aware Memory Optimization for Serverless Applications Gor Safaryan et.al. 2207.06183 null
2022-07-19 Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras Fangwen Shu et.al. 2207.06058 link
2022-07-12 Accelerating Certifiable Estimation with Preconditioned Eigensolvers David M. Rosen et.al. 2207.05257 null
2022-07-12 Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features Meiyu Zhi et.al. 2207.05244 null
2022-07-14 SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial Chih-Yuan Chiu et.al. 2207.05043 null
2022-07-08 BlindSpotNet: Seeing Where We Cannot See Taichi Fukuda et.al. 2207.03870 null
2022-07-08 Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints Philipp Glira et.al. 2207.03785 null
2022-07-08 Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements Ran Liu et.al. 2207.03700 null
2022-07-07 RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments Qihao Peng et.al. 2207.03539 null
2022-07-06 VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization Marius Laska et.al. 2207.02668 null
2022-07-06 A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models Axel Garcia-Vega et.al. 2207.02396 null
2022-07-04 VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM Ling Gao et.al. 2207.01404 null
2022-07-04 VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM Danpeng Chen et.al. 2207.01158 null
2022-07-03 Wireless Channel Prediction in Partially Observed Environments Mingsheng Yin et.al. 2207.00934 null
2022-07-01 A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers Julio A. Placed et.al. 2207.00254 null
2022-07-01 Keeping Less is More: Point Sparsification for Visual SLAM Yeonsoo Park et.al. 2207.00225 null
2022-06-30 Controlled and impulsive compression of an entrapped air bubble during impact Utkarsh Jain et.al. 2206.15297 null
2022-06-30 Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery Yuehao Wang et.al. 2206.15255 link
2022-06-27 IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments Abanob Soliman et.al. 2206.13455 link
2022-06-26 An Efficient Global Optimality Certificate for Landmark-Based SLAM Connor Holmes et.al. 2206.12961 link
2022-06-21 Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping Davide Tateo et.al. 2206.10263 link
2022-06-20 Data Fusion for Radio Frequency SLAM with Robust Sampling Erik Leitinger et.al. 2206.09746 null
2022-06-19 RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments Chenglong Qian et.al. 2206.09463 null
2022-06-17 Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments Khairuldanial Ismail et.al. 2206.08733 null
2022-06-17 An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions Yijun Yuan et.al. 2206.08712 link
2022-06-13 ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy Hao Bai et.al. 2206.06435 null
2022-06-10 Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming Javier Cremona et.al. 2206.05066 link
2022-06-09 SparseFormer: Attention-based Depth Completion Network Frederik Warburg et.al. 2206.04557 null
2022-06-07 Robot Self-Calibration Using Actuated 3D Sensors Arne Peters et.al. 2206.03430 null
2022-06-07 Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map Haodong Yuan et.al. 2206.03062 null
2022-06-05 DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions Alena Savinykh et.al. 2206.02199 null
2022-06-04 C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy Erez Posner et.al. 2206.01961 null
2022-06-01 PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry Dong-Uk Seo et.al. 2206.00266 link
2022-05-27 A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching Arno Solin et.al. 2205.13821 null
2022-05-31 LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments Yun Chang et.al. 2205.13135 link
2022-05-25 Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM Milad Ramezani et.al. 2205.12595 null
2022-05-24 Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM Christopher E. Denniston et.al. 2205.12402 link
2022-05-22 ALITA: A Large-scale Incremental Dataset for Long-term Autonomy Peng Yin et.al. 2205.10737 link
2022-05-19 FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2 Jeffrey Ichnowski et.al. 2205.09778 link
2022-05-17 Global Data Association for SLAM with 3D Grassmannian Manifold Objects Parker C. Lusk et.al. 2205.08556 null
2022-05-19 Cluster on Wheels Yuanyuan Yang et.al. 2205.08151 null
2022-05-12 Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry Shihao Shen et.al. 2205.05916 link
2022-05-12 S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization Ran Cheng et.al. 2205.05861 null
2022-05-14 Multi-modal Semantic SLAM for Complex Dynamic Environments Han Wang et.al. 2205.04300 link
2022-05-06 OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations Carmen Delgado et.al. 2205.03256 null
2022-05-05 CNN-Augmented Visual-Inertial SLAM with Planar Constraints Pan Ji et.al. 2205.02940 null
2022-05-05 PMBM-based SLAM Filters in 5G mmWave Vehicular Networks Hyowon Kim et.al. 2205.02502 null
2022-05-04 BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking Dorian Henning et.al. 2205.02301 null
2022-05-04 A Global Asymptotic Convergent Observer for SLAM Seyed Hamed Hashemi et.al. 2205.01953 null
2022-05-04 Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation Nathaniel Merrill et.al. 2205.01823 link
2022-05-03 GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping Pan Ji et.al. 2205.01656 null
2022-04-29 Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM Jinwoo Jeon et.al. 2204.13877 link
2022-04-27 The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection Konstantinos A. Tsintotas et.al. 2204.12831 null
2022-04-27 Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment Wenyu Li et.al. 2204.12769 null
2022-04-29 MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment Tingchen Ma et.al. 2204.11621 null
2022-04-23 Indoor simultaneous localization and mapping based on fringe projection profilometry Yang Zhao et.al. 2204.11020 null
2022-04-22 Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria Julio A. Placed et.al. 2204.10631 null
2022-04-22 Fast Autonomous Robotic Exploration Using the Underlying Graph Structure Julio A. Placed et.al. 2204.10610 null
2022-04-22 Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions Yutong Hu et.al. 2204.10552 null
2022-04-22 Implicit Object Mapping With Noisy Data Jad Abou-Chakra et.al. 2204.10516 link
2022-04-19 Photometric single-view dense 3D reconstruction in endoscopy Victor M. Batlle et.al. 2204.09083 null
2022-04-18 Pulsar skips: Understanding variations in the regular periods of rotating neutron stars Clayton Miller et.al. 2204.08449 null
2022-04-18 Tracking monocular camera pose and deformation for SLAM inside the human body Juan J. Gomez Rodriguez et.al. 2204.08309 null
2022-04-18 Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker Hanjing Ye et.al. 2204.08163 null
2022-04-14 ViViD++: Vision for Visibility Dataset Alex Junho Lee et.al. 2204.06183 null
2022-04-12 HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud Zhixing Hou et.al. 2204.05481 null
2022-04-12 RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room Cong Gao et.al. 2204.05467 null
2022-04-11 Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context Lizhou Liao et.al. 2204.04932 link
2022-04-04 Monitoring social distancing with single image depth estimation Alessio Mingozzi et.al. 2204.01693 null
2022-04-01 Bi-directional Loop Closure for Visual SLAM Ihtisham Ali et.al. 2204.01524 null
2022-04-04 IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers Lei Sun et.al. 2204.01324 link
2022-04-03 Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor Wenyan Ou et.al. 2204.01154 null
2022-04-02 UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps Ayyappa Swamy Thatavarthy et.al. 2204.00865 link
2022-03-31 Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects Yujie Lu et.al. 2204.00035 null
2022-03-30 GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios Chih-Yuan Chiu et.al. 2203.16690 null
2022-03-29 Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field Mostafa Osman et.al. 2203.15866 null
2022-03-29 Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform Mingjun Li et.al. 2203.15439 null
2022-03-29 Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots Pranay Mathur et.al. 2203.15272 null
2022-03-28 Are High-Resolution Event Cameras Really Needed? Daniel Gehrig et.al. 2203.14672 null
2022-03-25 Spectral Measurement Sparsification for Pose-Graph SLAM Kevin J. Doherty et.al. 2203.13897 link
2022-03-25 FD-SLAM: 3-D Reconstruction Using Features and Dense Matching Xingrui Yang et.al. 2203.13861 null
2022-03-25 Gravity-constrained point cloud registration Vladimír Kubelka et.al. 2203.13799 null
2022-03-24 MD-SLAM: Multi-cue Direct SLAM Luca Di Giammarino et.al. 2203.13237 link
2022-03-24 Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video Shun Taguchi et.al. 2203.12804 null
2022-03-19 Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems Jie Yang et.al. 2203.10267 null
2022-03-16 Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR Ian D. Miller et.al. 2203.08925 link
2022-03-15 Neural RF SLAM for unsupervised positioning and mapping with channel state information Shreya Kadambi et.al. 2203.08264 null
2022-03-15 Simultaneous Localisation and Mapping with Quadric Surfaces Tristan Laidlow et.al. 2203.08040 null
2022-03-14 Drift Reduced Navigation with Deep Explainable Features Mohd Omama et.al. 2203.06897 link
2022-03-11 An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs Keisuke Sugiura et.al. 2203.05763 null
2022-03-10 High Definition, Inexpensive, Underwater Mapping Bharat Joshi et.al. 2203.05640 link
2022-03-10 SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning Jaehoon Choi et.al. 2203.05332 null
2022-03-08 Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM Pierre-Yves Lajoie et.al. 2203.04446 link
2022-03-08 SLAM-Supported Self-Training for 6D Object Pose Estimation Ziqi Lu et.al. 2203.04424 link
2022-03-08 An Online Semantic Mapping System for Extending and Enhancing Visual SLAM Thorsten Hempel et.al. 2203.03944 null
2022-03-07 Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms Qingqing Li et.al. 2203.03454 link
2022-03-07 OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition Junyi Ma et.al. 2203.03397 link
2022-03-06 Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM Kazushi Aiba et.al. 2203.02887 null
2022-03-06 RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects Ran Long et.al. 2203.02882 null
2022-03-03 STUN: Self-Teaching Uncertainty Estimation for Place Recognition Kaiwen Cai et.al. 2203.01851 link
2022-03-03 Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning Niclas Vödisch et.al. 2203.01578 link
2022-03-02 FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry Chunran Zheng et.al. 2203.00893 link
2022-03-02 Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation Yulun Tian et.al. 2203.00851 null
2022-03-01 Descriptellation: Deep Learned Constellation Descriptors for SLAM Chunwei Xing et.al. 2203.00567 null
2022-03-01 Collaborative Robot Mapping using Spectral Graph Analysis Lukas Bernreiter et.al. 2203.00308 null
2022-02-26 RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization Nikolaos Kourtzanidis et.al. 2202.13221 link
2022-02-25 Probabilistic Data Association for Semantic SLAM at Scale Elad Michael et.al. 2202.12802 link
2022-02-24 TwistSLAM: Constrained SLAM in Dynamic Environment Mathieu Gonzalez et.al. 2202.12384 null
2022-02-24 Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion Hyeonsoo Jang et.al. 2202.12108 null
2022-02-23 MITI: SLAM Benchmark for Laparoscopic Surgery Regine Hartwig et.al. 2202.11496 null
2022-02-23 DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization Xuebo Tian et.al. 2202.11431 null
2022-02-23 Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets Islam Ali et.al. 2202.11312 null
2022-02-22 SAGE: SLAM with Appearance and Geometry Prior for Endoscopy Xingtong Liu et.al. 2202.09487 link
2022-02-18 OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure Stefan Leutenegger et.al. 2202.09199 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146 link
2022-02-18 An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems Qiang Liu et.al. 2202.08952 null
2022-02-17 Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study Giovanni Cioffi et.al. 2202.08894 link
2022-02-17 LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building Jiashi Zhang et.al. 2202.08487 null
2022-02-16 Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments Jinkun Wang et.al. 2202.08359 null
2022-02-11 Overhead Image Factors for Underwater Sonar-based SLAM John McConnell et.al. 2202.05811 null
2022-02-10 Scale Estimation with Dual Quadrics for Monocular Object SLAM Shuangfu Song et.al. 2202.04816 null
2022-02-08 A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition Nie Jiwei et.al. 2202.03677 null
2022-01-25 Autonomous Vehicles: Open-Source Technologies, Considerations, and Development Oussama Saoudi et.al. 2202.03148 null
2022-02-07 Temporal Point Cloud Completion with Pose Disturbance Jieqi Shi et.al. 2202.03084 null
2022-02-04 DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments Xinggang Hu et.al. 2202.01938 null
2022-02-01 A Model for Multi-View Residual Covariances based on Perspective Deformation Alejandro Fontan et.al. 2202.00765 null
2022-01-30 Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM Xinghe Chu et.al. 2201.12726 null
2022-01-28 RGB-D SLAM Using Attention Guided Frame Association Ali Caglayan et.al. 2201.12047 null
2022-02-04 Learning to Act with Affordance-Aware Multimodal Neural SLAM Zhiwei Jia et.al. 2201.09862 link
2022-01-22 Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems Xi Zheng et.al. 2201.09048 link
2022-01-17 SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System Giseop Kim et.al. 2201.06423 null
2022-01-14 SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions Ali Samadzadeh et.al. 2201.05386 link
2022-01-19 Multi-Hypothesis Scan Matching through Clustering Giorgio Iavicoli et.al. 2201.03814 null
2022-01-11 Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM Kevin J. Doherty et.al. 2201.03773 null
2022-01-10 High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM Brian M. Hopkinson et.al. 2201.03364 link
2022-01-10 Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition M. Usman Maqbool Bhutta et.al. 2201.03212 link
2022-01-04 Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds Xueliang Wen et.al. 2201.00959 null
2021-12-29 Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic Khen Elimelech et.al. 2112.14428 null
2021-12-19 M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots Jie Yin et.al. 2112.13659 link
2021-12-27 UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping Hyunjun Lim et.al. 2112.13515 link
2021-12-25 Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs Yusheng Wang et.al. 2112.13224 null
2021-12-25 Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping Peng Huang et.al. 2112.13222 null
2021-12-24 3D Point Cloud Reconstruction and SLAM as an Input Ziyu Li et.al. 2112.12907 null
2021-12-22 NICE-SLAM: Neural Implicit Scalable Encoding for SLAM Zihan Zhu et.al. 2112.12130 link
2021-12-18 Fast and Robust Registration of Partially Overlapping Point Clouds Eduardo Arnold et.al. 2112.09922 link
2021-12-17 Symmetry-aware Neural Architecture for Embodied Visual Navigation Shuang Liu et.al. 2112.09515 null
2021-12-27 Homography Decomposition Networks for Planar Object Tracking Xinrui Zhan et.al. 2112.07909 link
2021-12-14 Autonomous Navigation System from Simultaneous Localization and Mapping Micheal Caracciolo et.al. 2112.07723 link
2021-12-12 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation Bolivar Solarte et.al. 2112.06180 link
2021-12-11 Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization Amay Saxena et.al. 2112.05921 null
2021-12-07 Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems Gideon Billings et.al. 2112.03826 link
2021-12-05 Iterated Posterior Linearization PMB Filter for 5G SLAM Yu Ge et.al. 2112.02575 null
2021-12-03 Fast Direct Stereo Visual SLAM Jiawei Mo et.al. 2112.01890 link
2021-12-02 MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment Jie Ren et.al. 2112.01349 link
2021-12-01 Research on Event Accumulator Settings for Event-Based SLAM Kun Xiao et.al. 2112.00427 link
2021-11-29 An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments Assem Sadek et.al. 2111.14666 null
2021-11-29 Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report Hartmut Surmann et.al. 2111.14542 null
2021-11-24 Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment V. Ayala-Alfaro et.al. 2111.12690 null
2021-11-24 Autonomous bot with ML-based reactive navigation for indoor environment Yash Srivastava et.al. 2111.12542 null
2021-11-22 A General Framework for Lifelong Localization and Mapping in Changing Environment Min Zhao et.al. 2111.10946 link
2021-11-17 Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network Xiaoming Zhao et.al. 2111.09006 null
2021-11-10 Comparing dominance of tennis' big three via multiple-output Bayesian quantile regression models Bruno Santos et.al. 2111.05631 null
2021-11-10 TomoSLAM: factor graph optimization for rotation angle refinement in microtomography Mark Griguletskii et.al. 2111.05562 null
2021-11-07 Hierarchical Segment-based Optimization for SLAM Yuxin Tian et.al. 2111.04101 null
2021-11-07 Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM Shing Yan Loo et.al. 2111.04096 null
2021-11-05 MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry Joan P. Company-Corcoles et.al. 2111.03408 null
2021-10-31 Loop closure detection using local 3D deep descriptors Youjie Zhou et.al. 2111.00440 link
2021-10-27 Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification Mingsheng Yin et.al. 2110.14789 link
2021-10-27 Efficient Placard Discovery for Semantic Mapping During Frontier Exploration David Balaban et.al. 2110.14742 null
2021-10-26 Robust Multi-view Registration of Point Sets with Laplacian Mixture Model Jin Zhang et.al. 2110.13744 null
2021-10-25 WOLF: A modular estimation framework for robotics based on factor graphs Joan Sola et.al. 2110.12919 null
2021-10-21 Real-Time Ground-Plane Refined LiDAR SLAM Fan Yang et.al. 2110.11517 null
2021-10-21 SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words Jonathan J. Y. Kim et.al. 2110.11491 null
2021-10-21 InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion Zhenkun Zhu et.al. 2110.11040 null
2021-10-20 SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training Ankur Bapna et.al. 2110.10329 null
2021-10-18 Enhancing exploration algorithms for navigation with visual SLAM Kirill Muravyev et.al. 2110.09156 null
2021-10-18 Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment Rui Tian et.al. 2110.08977 null
2021-10-16 Partial Hierarchical Pose Graph Optimization for SLAM Alexander Korovko et.al. 2110.08639 null
2021-10-14 Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach Shumon Koga et.al. 2110.07546 null
2021-10-13 Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity Ran Liu et.al. 2110.06541 null
2021-10-12 Learning Efficient Multi-Agent Cooperative Visual Exploration Chao Yu et.al. 2110.05734 null
2021-10-07 Self-Supervised Depth Completion for Active Stereo Frederik Warburg et.al. 2110.03234 null
2021-10-06 InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes Zhenkun Zhu et.al. 2110.02593 null
2021-10-03 AEROS: Adaptive RObust least-Squares for Graph-Based SLAM Milad Ramezani et.al. 2110.02018 null
2021-10-04 Fast Uncertainty Quantification for Active Graph SLAM Julio A. Placed et.al. 2110.01289 link
2021-10-04 Geometry-based Graph Pruning for Lifelong SLAM Gerhard Kurz et.al. 2110.01286 null
2021-10-03 Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration Marcus Greiff et.al. 2110.01099 null
2021-10-02 Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows Qiangqiang Huang et.al. 2110.00876 link

(back to top)

SFM

Publish Date Title Authors PDF Code
2025-04-24 Dynamic Camera Poses and Where to Find Them Chris Rockwell et.al. 2504.17788 null
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-23 A Low-Cost Photogrammetry System for 3D Plant Modeling and Phenotyping Joe Hrzich et.al. 2504.16840 null
2025-04-23 PRaDA: Projective Radial Distortion Averaging Daniil Sinitsyn et.al. 2504.16499 null
2025-04-21 Traversing the Star-Forming Main Sequence with Molecular Gas Stacks of z~1.6 Cluster Galaxies Alex Pigarelli et.al. 2504.15381 null
2025-04-21 Towards Understanding Camera Motions in Any Video Zhiqiu Lin et.al. 2504.15376 null
2025-04-21 StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models Yeona Hong et.al. 2504.14915 null
2025-04-17 Volume Encoding Gaussians: Transfer Function-Agnostic 3D Gaussians for Volume Rendering Landon Dyken et.al. 2504.13339 null
2025-04-15 EDGS: Eliminating Densification for Efficient Convergence of 3DGS Dmytro Kotovenko et.al. 2504.13204 null
2025-04-15 Deep Learning-based Bathymetry Retrieval without In-situ Depths using Remote Sensing Imagery and SfM-MVS DSMs with Data Gaps Panagiotis Agrafiotis et.al. 2504.11416 link
2025-04-12 A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds Jizong Peng et.al. 2504.09129 null
2025-04-11 Stereophotoclinometry Revisited Travis Driver et.al. 2504.08252 null
2025-04-08 Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring José A. Pilartes-Congo et.al. 2504.06464 null
2025-04-07 Decoding the variability in the star-formation histories of z ~ 0.8 galaxies Jenny T. Wan et.al. 2504.05281 null
2025-04-05 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS Zhisheng Huang et.al. 2504.04294 null
2025-04-04 An Algebraic Geometry Approach to Viewing Graph Solvability Federica Arrigoni et.al. 2504.03637 null
2025-04-04 Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video Jiaxin Guo et.al. 2504.03198 null
2025-04-03 Adaptive Frequency Enhancement Network for Remote Sensing Image Semantic Segmentation Feng Gao et.al. 2504.02647 link
2025-04-09 FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking Ulas Gunes et.al. 2504.01732 null
2025-03-31 LITA-GS: Illumination-Agnostic Novel View Synthesis via Reference-Free 3D Gaussian Splatting and Physical Priors Han Zhou et.al. 2504.00219 null
2025-03-30 AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos Felix Wimbauer et.al. 2503.23282 link
2025-03-24 Ground Penetrating Radar-Assisted Multimodal Robot Odometry Using Subsurface Feature Matrix Haifeng Li et.al. 2503.18301 null
2025-03-22 3D Modeling: Camera Movement Estimation and path Correction for SFM Model using the Combination of Modified A-SIFT and Stereo System Usha Kumari et.al. 2503.17668 null
2025-03-25 ProtoGS: Efficient and High-Quality Rendering with 3D Gaussian Prototypes Zhengqing Gao et.al. 2503.17486 null
2025-03-21 ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration Johan Edstedt et.al. 2503.17093 null
2025-03-20 From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction Ayberk Acar et.al. 2503.16263 null
2025-03-22 Euclid Quick Data Release (Q1). A first view of the star-forming main sequence in the Euclid Deep Fields Euclid Collaboration et.al. 2503.15314 null
2025-03-18 Multi-view Reconstruction via SfM-guided Monocular Depth Estimation Haoyu Guo et.al. 2503.14483 null
2025-03-18 A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios Huy-Hoang Bui et.al. 2503.13982 link
2025-03-17 Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios Iryna Repinetska et.al. 2503.13710 null
2025-03-17 Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization Yiwei Xu et.al. 2503.13086 null
2025-03-15 SFMNet: Sparse Focal Modulation for 3D Object Detection Oren Shrout et.al. 2503.12093 null
2025-03-11 A Framework for Reducing the Complexity of Geometric Vision Problems and its Application to Two-View Triangulation with Approximation Bounds Felix Rydell et.al. 2503.08142 null
2025-03-11 DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection Johan Edstedt et.al. 2503.07347 link
2025-03-18 Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion Mona Sheikh Zeinoddin et.al. 2503.07204 null
2025-03-10 VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation Hanzhi Chen et.al. 2503.07135 null
2025-03-09 AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation Yang Zou et.al. 2503.06660 null
2025-03-07 LiDAR-enhanced 3D Gaussian Splatting Mapping Jian Shen et.al. 2503.05425 null
2025-03-06 PLMP -- Point-Line Minimal Problems for Projective SfM Kim Kiehn et.al. 2503.04351 null
2025-03-03 MUSt3R: Multi-view Network for Stereo 3D Reconstruction Yohann Cabon et.al. 2503.01661 null
2025-03-03 ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization Anas Abdelkarim et.al. 2503.01311 link
2025-03-05 A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping Jialei He et.al. 2503.01202 null
2025-03-02 MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain Rui Yi Yong et.al. 2503.00853 null
2025-03-02 PSRGS:Progressive Spectral Residual of 3D Gaussian for High-Frequency Recovery BoCheng Li et.al. 2503.00848 null
2025-03-02 Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration Jinjiang You et.al. 2503.00737 link
2025-02-28 The THESAN-ZOOM project: Burst, quench, repeat -- unveiling the evolution of high-redshift galaxies along the star-forming main sequence William McClymont et.al. 2503.00106 null
2025-02-27 Best Foot Forward: Robust Foot Reconstruction in-the-wild Kyle Fogarty et.al. 2502.20511 null
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932 null
2025-03-04 Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model Yaxuan Huang et.al. 2502.16779 null
2025-02-20 CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting Qilin Zhang et.al. 2502.14684 link
2025-02-19 Structure-from-Sherds++: Robust Incremental 3D Reassembly of Axially Symmetric Pots from Unordered and Mixed Fragment Collections Seong Jong Yoo et.al. 2502.13986 null
2025-02-19 IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras Dongki Jung et.al. 2502.12545 null
2025-02-12 Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors Vishwanath Pratap Singh et.al. 2502.08587 null
2025-02-10 FOCUS -- Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences Oliver Boyne et.al. 2502.06367 link
2025-02-09 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Jing-Xuan Zhang et.al. 2502.05766 link
2025-02-10 Building Rome with Convex Optimization Haoyu Han et.al. 2502.04640 null
2025-02-04 SiLVR: Scalable Lidar-Visual Radiance Field Reconstruction with Uncertainty Quantification Yifu Tao et.al. 2502.02657 null
2025-02-05 GP-GS: Gaussian Processes for Enhanced Gaussian Splatting Zhihao Guo et.al. 2502.02283 link
2025-02-03 XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications Shangjin Zhai et.al. 2502.01297 null
2025-01-29 Segmentation-Aware Generative Reinforcement Network (GRN) for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain (cLBP) Assessment Zixue Zeng et.al. 2501.17690 link
2025-01-28 Automatic Calibration of a Multi-Camera System with Limited Overlapping Fields of View for 3D Surgical Scene Reconstruction Tim Flückiger et.al. 2501.16221 null
2025-01-25 Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos Zhen-Hui Dong et.al. 2501.15096 null
2025-01-24 MATCHA:Towards Matching Anything Fei Xue et.al. 2501.14945 null
2025-01-24 Light3R-SfM: Towards Feed-forward Structure-from-Motion Sven Elflein et.al. 2501.14914 null
2025-01-24 Dense-SfM: Structure from Motion with Dense Consistent Matching JongMin Lee et.al. 2501.14277 null
2025-01-21 Theory of quantum-geometric charge and spin Josephson diode effects in strongly spin-polarized hybrid structures with noncoplanar spin textures Niklas L. Schulz et.al. 2501.12232 null
2025-01-14 Selective Attention Merging for low resource tasks: A case study of Child ASR Natarajan Balaji Shankar et.al. 2501.08468 link
2025-01-14 SplatMAP: Online Dense Monocular SLAM with 3D Gaussian Splatting Yue Hu et.al. 2501.07015 null
2025-02-02 CULTURE3D: Cultural Landmarks and Terrain Dataset for 3D Applications Xinyi Zheng et.al. 2501.06927 link
2025-01-11 Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis Aditya Rauniyar et.al. 2501.06431 null
2025-01-09 Existence of dynamical fluctuation in AMPT generated data for Au+Au collisions at 10 AGeV Somen Gope et.al. 2501.05175 null
2025-01-06 Targetless Intrinsics and Extrinsic Calibration of Multiple LiDARs and Cameras with IMU using Continuous-Time Estimation Yuezhang Lv et.al. 2501.02821 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-02 EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy Ao Gao et.al. 2501.01003 null
2024-12-30 KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences Keng-Wei Chang et.al. 2412.20767 null
2024-12-27 Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images Xudong Cai et.al. 2412.19518 null
2024-12-25 Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition Shujie Hu et.al. 2412.18832 null
2024-12-23 Reconstructing People, Places, and Cameras Lea Müller et.al. 2412.17806 null
2024-12-18 Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation Rémi Marsal et.al. 2412.14103 null
2024-12-16 Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection Beomseok Lee et.al. 2412.11978 null
2024-12-18 SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video Jongmin Park et.al. 2412.09982 null
2024-12-12 CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework Yushan Han et.al. 2412.08344 null
2024-12-10 Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling Hui Deng et.al. 2412.07230 null
2024-12-08 Unveiling True Talent: The Soccer Factor Model for Skill Evaluation Alexandre Andorra et.al. 2412.05911 null
2024-12-08 Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features Yuanbo Xiangli et.al. 2412.05826 null
2024-12-06 MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos Zhengqi Li et.al. 2412.04463 null
2024-12-03 ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification Pan Zhang et.al. 2412.02044 link
2024-12-02 SfM-Free 3D Gaussian Splatting via Hierarchical Training Bo Ji et.al. 2412.01553 link
2024-12-02 MVImgNet2.0: A Larger-scale Dataset of Multi-view Images Xiaoguang Han et.al. 2412.01430 null
2024-12-02 TAS-TsC: A Data-Driven Framework for Estimating Time of Arrival Using Temporal-Attribute-Spatial Tri-space Coordination of Truck Trajectories Mengran Li et.al. 2412.01122 null
2024-12-02 Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM Alejandro Fontan et.al. 2412.01116 null
2024-11-27 RoMo: Robust Motion Segmentation Improves Structure from Motion Lily Goli et.al. 2411.18650 null
2024-11-26 The MAGPI Survey: radial trends in star formation across different cosmological simulations in comparison with observations at $z \sim$ 0.3 Marcie Mun et.al. 2411.17882 null
2024-11-25 Characterizing Stellar and Gas Properties in NGC 628: Spatial Distributions, Radial Gradients, and Resolved Scaling Relations Peng Wei et.al. 2411.16150 null
2024-11-24 ZeroGS: Training 3D Gaussian Splatting from Unposed Images Yu Chen et.al. 2411.15779 null
2024-11-20 DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild Weicai Ye et.al. 2411.13291 null
2024-11-15 SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction Yutao Tang et.al. 2411.12592 link
2024-11-15 The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods Yifu Tao et.al. 2411.10546 null
2024-11-13 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization Mijeong Kim et.al. 2411.08879 null
2024-11-13 Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model Yutao Shen et.al. 2411.08453 null
2024-11-08 From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS Haoran Zhang et.al. 2411.05362 link
2024-10-29 A Cascade Approach for APT Campaign Attribution in System Event Logs: Technique Hunting and Subgraph Matching Yi-Ting Huang et.al. 2410.22602 null
2024-10-29 LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues Hanqing Jiang et.al. 2410.22213 null
2024-10-17 Stochastic Flow Matching for Resolving Small-Scale Physics Stathi Fotiadis et.al. 2410.19814 null
2024-10-25 A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint Changshi Mu et.al. 2410.19473 link
2024-10-30 Large Spatial Model: End-to-end Unposed Images to Semantic 3D Zhiwen Fan et.al. 2410.18956 link
2024-10-23 CO-CAVITY project: Molecular gas and star formation in void galaxies M. I. Rodríguez et.al. 2410.18078 null
2024-10-23 PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting Yu Wang et.al. 2410.17505 null
2024-10-20 Neural Active Structure-from-Motion in Dark and Textureless Environment Kazuto Ichimaru et.al. 2410.15378 null
2024-10-17 SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation Shiao Xie et.al. 2410.13486 null
2024-10-16 Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks Orchid Chetia Phukan et.al. 2410.12947 null
2024-10-16 Gravity-aligned Rotation Averaging with Circular Regression Linfei Pan et.al. 2410.12763 link
2024-10-16 Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals Orchid Chetia Phukan et.al. 2410.12645 null
2024-10-15 SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection Yizhe Liu et.al. 2410.12080 link
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-09 Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models Ange Lou et.al. 2410.07434 null
2024-10-09 Deep HI Mapping of M 106 Group with FAST Yao Liu et.al. 2410.07038 null
2024-10-09 MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data Mingu Kang et.al. 2410.06442 null
2024-10-08 Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? Charalambos Tzamos et.al. 2410.05984 link
2024-10-04 Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering Laura Fink et.al. 2410.03861 link
2024-10-01 MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages Marco Gaido et.al. 2410.01036 link
2024-10-01 Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance Hongchao Shu et.al. 2410.00386 null
2024-09-29 Robust Incremental Structure-from-Motion with Hybrid Features Shaohui Liu et.al. 2409.19811 null
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152 null
2024-09-27 Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras Yipeng Lu et.al. 2409.18673 null
2024-09-26 BlinkTrack: Feature Tracking over 100 FPS via Events and Images Yichen Shen et.al. 2409.17981 null
2024-09-25 How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not Francesco Verdini et.al. 2409.17044 null
2024-09-24 Frequency-based View Selection in Gaussian Splatting Reconstruction Monica M. Q. Li et.al. 2409.16470 null
2024-10-07 Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion Juan-Diego Florez et.al. 2409.16465 null
2024-09-24 Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research Vandita Shukla et.al. 2409.15914 null
2024-09-23 Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments Francisco Roza de Moraes et.al. 2409.15602 null
2024-09-23 Evaluating Robot Influence on Pedestrian Behavior Models for Crowd Simulation and Benchmarking Subham Agrawal et.al. 2409.14844 null
2024-09-21 Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models Orchid Chetia Phukan et.al. 2409.14131 null
2024-09-17 GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module Yichen Zhang et.al. 2409.11307 null
2024-09-13 Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints Shan Chen et.al. 2409.08613 null
2024-09-09 KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction Davide Di Nucci et.al. 2409.05407 null
2024-09-06 The Arizona Molecular ISM Survey with the SMT: Variations in the CO(2-1)/CO(1-0) Line Ratio Across the Galaxy Population Ryan P. Keenan et.al. 2409.03963 null
2024-09-05 Active Galactic Nuclei in the Green Valley at z $\sim$ 0.7 Charity Woodrum et.al. 2409.03197 null
2024-09-04 Object Gaussian for Monocular 6D Pose Estimation from Sparse Views Luqing Luo et.al. 2409.02581 null
2024-09-11 Geometry-aware Feature Matching for Large-Scale Structure from Motion Gonglin Chen et.al. 2409.02310 null
2024-09-04 The study of strongly intensive observables for $π^{\pm,0}$ in $pp$ collisions at LHC energy in the framework of PYTHIA model Tumpa Biswas et.al. 2409.00525 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-09-15 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966 null
2024-08-20 TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks Jinjie Mai et.al. 2408.10739 null
2024-08-16 Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS Wei Sun et.al. 2408.08723 null
2024-08-15 CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning Wei Zhu et.al. 2408.08134 link
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648 null
2024-08-07 Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM Yan Song Hu et.al. 2408.03825 null
2024-08-05 Context-aware Mamba-based Reinforcement Learning for social robot navigation Syed Muhammad Mustafa et.al. 2408.02661 null
2024-08-04 Birational geometry of critical loci in Algebraic Vision Marina Bertolini et.al. 2408.02067 null
2024-08-04 PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone Xin Yang et.al. 2408.02053 null
2024-08-02 Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris Kentaro Uno et.al. 2408.01035 null
2024-08-01 LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting Zhenyu Bao et.al. 2408.00254 null
2024-07-29 Global Structure-from-Motion Revisited Linfei Pan et.al. 2407.20219 link
2024-08-06 Revisit Self-supervised Depth Estimation with Local Structure-from-Motion Shengjie Zhu et.al. 2407.19166 null
2024-07-23 The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations Hao Liu et.al. 2407.16452 null
2024-07-22 Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures Ruizhe Wang et.al. 2407.15435 null
2024-07-16 NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models Francesco Milano et.al. 2407.12207 link
2024-07-15 LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning Zhuozhu Jian et.al. 2407.10782 null
2024-07-15 Towards Scale-Aware Full Surround Monodepth with Transformers Yuchen Yang et.al. 2407.10406 null
2024-07-14 3DEgo: 3D Editing on the Go! Umar Khalid et.al. 2407.10102 null
2024-07-10 Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization Jinjie Mai et.al. 2407.08023 link
2024-07-10 Euclid preparation. Forecasting the recovery of galaxy physical properties and their relations with template-fitting and machine-learning methods Euclid Collaboration et.al. 2407.07940 null
2024-07-10 Controlling Space and Time with Diffusion Models Daniel Watson et.al. 2407.07860 null
2024-07-09 Computer vision tasks for intelligent aerospace missions: An overview Huilin Chen et.al. 2407.06513 null
2024-07-08 Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views Jiawei Guo et.al. 2407.05666 null
2024-07-05 Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization Shaohan Li et.al. 2407.04260 null
2024-07-15 SfM on-the-fly: Get better 3D from What You Capture Zongqian Zhan et.al. 2407.03939 null
2024-07-03 Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction Jiaxin Guo et.al. 2407.02918 link
2024-07-02 Indoor 3D Reconstruction with an Unknown Camera-Projector Pair Zhaoshuai Qi et.al. 2407.01945 null
2024-06-27 SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas John Lambert et.al. 2406.19390 link
2024-06-27 STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning Yanan Zhang et.al. 2406.19362 null
2024-06-26 VDG: Vision-Only Dynamic Gaussian for Driving Simulation Hao Li et.al. 2406.18198 null
2024-06-25 Consensus Learning with Deep Sets for Essential Matrix Estimation Dror Moran et.al. 2406.17414 link
2024-06-24 Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction Tong Qin et.al. 2406.16289 null
2024-06-21 The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization Ivan Nikolić et.al. 2406.15237 link
2024-06-19 MVSBoost: An Efficient Point Cloud-based 3D Reconstruction Umair Haroon et.al. 2406.13515 null
2024-06-17 MegaScenes: Scene-Level View Synthesis at Scale Joseph Tung et.al. 2406.11819 link
2024-06-15 Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models Ruchao Fan et.al. 2406.10507 link
2024-06-14 On the Evaluation of Speech Foundation Models for Spoken Language Understanding Siddhant Arora et.al. 2406.10083 null
2024-06-12 Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement Maxime Pietrantoni et.al. 2406.08463 null
2024-06-12 SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models Chun Yin et.al. 2406.08445 null
2024-06-10 Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis Xin Jin et.al. 2406.06216 link
2024-06-07 The Star-Forming Main Sequence in JADES and CEERS at $z>1.4$ : Investigating the Burstiness of Star Formation Leonardo Clarke et.al. 2406.05178 null
2024-06-13 Gaussian Splatting with Localized Points Management Haosen Yang et.al. 2406.04251 null
2024-06-05 L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration Yibo Liu et.al. 2406.03298 link
2024-06-04 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Dejia Xu et.al. 2406.02509 null
2024-05-29 Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy Zijie Jiang et.al. 2405.18863 null
2024-05-29 3D Reconstruction with Fast Dipole Sums Hanyu Chen et.al. 2405.16788 null
2024-05-26 MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups Yusen Xie et.al. 2405.16599 null
2024-05-26 Categorical Flow Matching on Statistical Manifolds Chaoran Cheng et.al. 2405.16441 link
2024-05-22 Exploring Galaxy Properties of eCALIFA with Contrastive Learning G. Martínez-Solaeche et.al. 2405.13471 null
2024-05-23 Switched Flow Matching: Eliminating Singularities via Switching ODEs Qunxi Zhu et.al. 2405.11605 null
2024-05-28 NeRO: Neural Road Surface Reconstruction Ruibo Wang et.al. 2405.10554 link
2024-05-15 Three Dimensional Spatial Cognition: Bees and Bats Robert Worden et.al. 2405.09413 null
2024-05-09 Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media Zhizhen Zhang et.al. 2405.05760 null
2024-05-09 Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment Simon Weber et.al. 2405.05079 link
2024-05-07 Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications Markus Hillemann et.al. 2405.04345 null
2024-05-07 Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling Jiawei Shi et.al. 2405.04309 null
2024-05-06 Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion Yunfeng Li et.al. 2405.03177 link
2024-05-03 HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 Miriam Jäger et.al. 2405.02005 null
2024-04-25 The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time Marcie Mun et.al. 2404.16319 null
2024-04-22 Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer Eric Brachmann et.al. 2404.14351 null
2024-04-22 RESFM: Robust Equivariant Multiview Structure from Motion Fadi Khatib et.al. 2404.14280 null
2024-04-22 Does Gaussian Splatting need SFM Initialization? Yalda Foroutan et.al. 2404.12547 null
2024-05-07 A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion Feng Yu et.al. 2404.11590 link
2024-04-18 DeblurGS: Gaussian Splatting for Camera Motion Blur Jeongtaek Oh et.al. 2404.11358 null
2024-05-21 LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives Jiadi Cui et.al. 2404.09748 null
2024-04-12 MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance Yuqun Wu et.al. 2404.08252 null
2024-04-11 Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation Keonhee Han et.al. 2404.07933 null
2024-04-07 NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization Peng Tu et.al. 2404.04875 null
2024-04-04 GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis Emmanouil Nikolakakis et.al. 2404.03126 null
2024-03-29 InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds Zhiwen Fan et.al. 2403.20309 link
2024-03-29 HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes Zhuopeng Li et.al. 2403.20032 null
2024-03-26 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation Jiahao Chen et.al. 2403.17537 null
2024-03-25 INPC: Implicit Neural Point Clouds for Radiance Field Rendering Florian Hahlbohm et.al. 2403.16862 null
2024-03-18 An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation Zewen Xu et.al. 2403.11639 null
2024-03-14 Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting Jaewoo Jung et.al. 2403.09413 link
2024-03-13 Refractive COLMAP: Refractive Structure-from-Motion Revisited Mengkun She et.al. 2403.08640 null
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156 link
2024-03-11 SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection Yifu Tao et.al. 2403.06877 null
2024-03-24 BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling Cheng Peng et.al. 2403.04926 link
2024-02-22 GaussianPro: 3D Gaussian Splatting with Progressive Propagation Kai Cheng et.al. 2402.14650 null
2024-02-25 A Robust Error-Resistant View Selection Method for 3D Reconstruction Shaojie Zhang et.al. 2402.11431 null
2024-02-17 Dense Matchers for Dense Tracking Tomáš Jelínek et.al. 2402.11287 null
2024-03-11 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592 link
2024-01-22 HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs Zelin Gao et.al. 2401.11711 null
2024-01-19 SCENES: Subpixel Correspondence Estimation With Epipolar Supervision Dominik A. Kloepfer et.al. 2401.10886 null
2024-01-15 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data Mathilde Letard et.al. 2401.09481 link
2024-01-17 3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey Thiago Lopes Trugillo da Silveira et.al. 2401.09252 null
2024-01-17 ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization Weiyao Wang et.al. 2401.08937 null
2024-01-16 Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions Yi-Fan Zuo et.al. 2401.08043 link
2024-01-10 Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects Tianhang Cheng et.al. 2401.05236 link
2024-01-07 A Classification of Critical Configurations for any Number of Projective Views Martin Bråtelund et.al. 2401.03450 link
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-16 Transformers in Unsupervised Structure-from-Motion Hemang Chawla et.al. 2312.10529 link
2023-12-14 HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video Xueying Wang et.al. 2312.08863 null
2023-12-14 CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning Qingsong Yan et.al. 2312.08760 null
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865 link
2023-12-11 Gaussian Splatting SLAM Hidenobu Matsuki et.al. 2312.06741 null
2023-12-10 SuperPrimitive: Scene Reconstruction at a Primitive Level Kirill Mazur et.al. 2312.05889 null
2023-12-07 Visual Geometry Grounded Deep Structure From Motion Jianyuan Wang et.al. 2312.04563 null
2023-11-30 Distributed Global Structure-from-Motion with a Deep Front-End Ayush Baid et.al. 2311.18801 link
2023-11-21 Robot Hand-Eye Calibration using Structure-from-Motion Nicolas Andreff et.al. 2311.11808 null
2023-11-18 LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation Sébastien Henry et.al. 2311.11171 null
2023-11-10 MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty Rémi Marsal et.al. 2311.06137 link
2023-11-08 VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering Linus Franke et.al. 2311.04634 link
2023-10-22 A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video Jan Emily Mangulabnan et.al. 2310.14364 null
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605 null
2023-10-09 Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration Chunge Bai et.al. 2310.05504 link
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134 null
2023-11-29 Pose-Free Generalizable Rendering Transformer Zhiwen Fan et.al. 2310.03704 link
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092 null
2023-10-01 Propagating Semantic Labels in Video Data David Balaban et.al. 2310.00783 null
2023-09-22 Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning Jonathan Sauder et.al. 2309.12804 null
2023-09-21 On-the-Fly SfM: What you capture is What you get Zongqian Zhan et.al. 2309.11883 link
2023-09-19 Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water Jayesh Tripathi et.al. 2309.10269 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927 link
2023-09-08 Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry Akankshya Kar et.al. 2309.04147 null
2023-09-01 SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation Youhong Wang et.al. 2309.00526 null
2023-09-01 Dense Voxel 3D Reconstruction Using a Monocular Event Camera Haodong Chen et.al. 2309.00385 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984 link
2023-08-26 Disjoint Pose and Shape for 3D Face Reconstruction Raja Kumar et.al. 2308.13903 null
2023-08-30 CamP: Camera Preconditioning for Neural Radiance Fields Keunhong Park et.al. 2308.10902 null
2023-08-18 Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling Haorui Ji et.al. 2308.10705 null
2023-08-14 Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation Tao Liu et.al. 2308.07231 link
2023-08-11 Efficient Large-scale AUV-based Visual Seafloor Mapping Mengkun She et.al. 2308.06147 null
2023-08-04 EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems Weihan Wang et.al. 2308.02670 null
2023-08-15 Tirtha -- An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites Jyotirmaya Shivottam et.al. 2308.01246 link
2023-08-02 Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network Shenbagaraj Kannapiran et.al. 2308.01125 null
2023-07-27 PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking Yang Zheng et.al. 2307.15055 link
2023-07-28 SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jerome Revaud et.al. 2307.11702 null
2023-07-19 Lazy Visual Localization via Motion Averaging Siyan Dong et.al. 2307.09981 null
2023-07-10 Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor San Jiang et.al. 2307.04520 null
2023-07-07 RGB-D Mapping and Tracking in a Plenoxel Radiance Field Andreas L. Teigen et.al. 2307.03404 link
2023-06-29 The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes David Recasens et.al. 2306.16917 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669 link
2023-06-28 PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment Jianyuan Wang et.al. 2306.15667 null
2023-06-24 3D Reconstruction of Spherical Images based on Incremental Structure from Motion San Jiang et.al. 2306.12770 link
2023-06-15 NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations Varun Jampani et.al. 2306.09109 link
2023-06-15 Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization Dror Aiger et.al. 2306.09012 link
2023-06-10 3D reconstruction using Structure for Motion Kshitij Karnawat et.al. 2306.06360 link
2023-06-02 Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images Marcela Mera-Trujillo et.al. 2306.01938 null
2023-05-31 FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow Cameron Smith et.al. 2306.00180 null
2023-05-19 SIDAR: Synthetic Image Dataset for Alignment & Restoration Monika Kwiatkowski et.al. 2305.12036 link
2023-05-09 Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization Clémentin Boittiaux et.al. 2305.05301 link
2023-05-09 Rotation Synchronization via Deep Matrix Factorization Gk Tejus et.al. 2305.05268 link
2023-04-20 A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion Miriam Jäger et.al. 2304.10664 null
2023-04-14 Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments Felix Ott et.al. 2304.07250 null
2023-04-12 Visual Localization using Imperfect 3D Models from the Internet Vojtech Panek et.al. 2304.05947 link
2023-04-08 Photometric Correction for Infrared Sensors Jincheng Zhang et.al. 2304.03930 null
2023-04-07 DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium Antyanta Bangunharcana et.al. 2304.03560 link
2023-04-05 Semantic Validation in Structure from Motion Joseph Rowell et.al. 2304.02420 link
2023-03-31 Learning Internal Representations of 3D Transformations from 2D Projected Inputs Marissa Connor et.al. 2303.17776 null
2023-03-30 3D Line Mapping Revisited Shaohui Liu et.al. 2303.17504 link
2023-03-27 TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering Jaehoon Choi et.al. 2303.15060 null
2023-03-26 On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks HyunJun Jung et.al. 2303.14840 link
2023-03-24 Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container Jinguang Tong et.al. 2303.13805 link
2023-03-24 Progressively Optimized Local Radiance Fields for Robust View Synthesis Andreas Meuleman et.al. 2303.13791 null
2023-03-15 RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters Shuja Khalid et.al. 2303.08695 null
2023-03-09 Revisiting Rotation Averaging: Uncertainties and Robust Losses Ganlin Zhang et.al. 2303.05195 link
2023-02-28 Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images Zhongli Fan et.al. 2302.14239 link
2023-03-25 BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling Sameera Ramasinghe et.al. 2302.13543 null
2023-02-21 EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images Zhichao Ye et.al. 2302.10544 link
2023-02-18 Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering Tatsuro Yamane et.al. 2302.09208 null
2023-02-12 Uncertainty-Driven Dense Two-View Structure from Motion Weirong Chen et.al. 2302.00523 null
2023-01-28 AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion Yu Chen et.al. 2301.12135 null
2023-01-20 A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles Zhefan Xu et.al. 2301.08422 link
2023-03-21 Robust Dynamic Radiance Fields Yu-Lun Liu et.al. 2301.02239 link
2022-12-24 Polarimetric Multi-View Inverse Rendering Jinyu Zhao et.al. 2212.12721 null
2022-12-13 Accidental Turntables: Learning 3D Pose by Watching Objects Turn Zezhou Cheng et.al. 2212.06300 null
2022-12-04 3D Object Aided Self-Supervised Monocular Depth Estimation Songlin Wei et.al. 2212.01768 null
2022-12-02 High-Res Facial Appearance Capture from Polarized Smartphone Images Dejan Azinović et.al. 2212.01160 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069 link
2022-11-24 JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models Sepidehsadat Hosseini et.al. 2211.13785 null
2022-11-24 SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks Sergio Izquierdo et.al. 2211.13551 link
2022-11-22 Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces Yuxi Xiao et.al. 2211.12018 link
2022-11-21 Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques David Ramirez et.al. 2211.11836 null
2022-11-14 Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion René Haas et.al. 2211.07195 null
2022-10-13 Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach Zhiang Chen et.al. 2210.07349 null
2022-10-11 DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion Yuxi Xiao et.al. 2210.05517 null
2022-10-07 Leveraging Structure from Motion to Localize Inaccessible Bus Stops Indu Panigrahi et.al. 2210.03646 link
2022-10-01 Structure-Aware NeRF without Posed Camera via Epipolar Constraint Shu Chen et.al. 2210.00183 link
2022-10-05 FAST-LIO, Then Bayesian ICP, Then GTSFM Jerred Chen et.al. 2210.00146 null
2022-09-20 BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction Ahalya Ravendran et.al. 2209.09470 null
2022-09-19 A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion Gerry Chen et.al. 2209.08690 null
2022-09-14 End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes Qiao Chen et.al. 2209.06926 null
2022-09-07 Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021 Hartmut Surmann et.al. 2209.03084 null
2022-08-27 Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data Thomas A. Ciarfuglia et.al. 2208.13001 null
2022-08-12 Handling Constrained Optimization in Factor Graphs for Autonomous Navigation Barbara Bazzana et.al. 2208.06325 null
2022-08-04 Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training Yao-Chih Lee et.al. 2208.02709 link
2022-07-31 One Object at a Time: Accurate and Robust Structure From Motion for Robots Aravind Battaje et.al. 2208.00487 null
2022-07-23 Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks Daniel Posada et.al. 2207.11413 null
2022-07-25 MeshLoc: Mesh-Based Visual Localization Vojtech Panek et.al. 2207.10762 link
2022-07-19 ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild Wang Zhao et.al. 2207.09137 link
2022-07-16 Organic Priors in Non-Rigid Structure from Motion Suryansh Kumar et.al. 2207.06262 null
2022-07-06 A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models Axel Garcia-Vega et.al. 2207.02396 null
2022-06-24 Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set San Jiang et.al. 2206.11499 null
2022-06-13 TC-SfM: Robust Track-Community-Based Structure-from-Motion Lei Wang et.al. 2206.05866 null
2022-06-10 EigenFairing: 3D Model Fairing using Image Coherence Pragyana Mishra et.al. 2206.05309 null
2022-06-01 Semantic Room Wireframe Detection from a Single View David Gillsjö et.al. 2206.00491 link
2022-05-31 Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction Qiancheng Fu et.al. 2205.15848 null
2022-05-09 Is my Depth Ground-Truth Good Enough? HAMMER -- Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression HyunJun Jung et.al. 2205.04565 null
2022-05-07 Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs Pedro F. Proença et.al. 2205.03522 null
2022-05-06 EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms Levi Burner et.al. 2205.03467 null
2022-04-20 Learned Monocular Depth Priors in Visual-Inertial Initialization Yunwen Zhou et.al. 2204.09171 null
2022-04-10 Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective Hui Deng et.al. 2204.04730 null
2022-04-08 Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems Debao Huang et.al. 2204.04145 null
2022-04-07 SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation Yi Wei et.al. 2204.03636 link
2022-04-06 Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion Lukas Bommes et.al. 2204.02733 link
2022-04-05 Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows Sheng Liu et.al. 2204.02509 link
2022-03-31 Fast, Accurate and Memory-Efficient Partial Permutation Synchronization Shaohan Li et.al. 2203.16505 null
2022-03-28 Visual Odometry for RGB-D Cameras Afonso Fontes et.al. 2203.15119 null
2022-03-28 Optimizing Elimination Templates by Greedy Parameter Search Evgeniy Martyushev et.al. 2203.14901 link
2022-03-23 Event-Based Dense Reconstruction Pipeline Kun Xiao et.al. 2203.12270 null
2022-03-21 DiffPoseNet: Direct Differentiable Camera Pose Estimation Chethan M. Parameshwara et.al. 2203.11174 null
2022-03-02 Asynchronous Optimisation for Event-based Visual Odometry Daqi Liu et.al. 2203.01037 null
2022-03-02 Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation Yulun Tian et.al. 2203.00851 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146 link
2022-01-20 GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry Yunhan Zhao et.al. 2201.08131 null
2022-01-13 Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching Yunpeng Shi et.al. 2201.04797 link
2022-01-10 High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM Brian M. Hopkinson et.al. 2201.03364 link
2022-01-06 De-rendering 3D Objects in the Wild Felix Wimbauer et.al. 2201.02279 link
2021-12-29 On the Instability of Relative Pose Estimation and RANSAC's Role Hongyi Fan et.al. 2112.14651 null
2021-12-16 Road-aware Monocular Structure from Motion and Homography Estimation Wei Sui et.al. 2112.08635 null
2021-12-10 Critical configurations for three projective views Martin Bråtelund et.al. 2112.05478 null
2021-12-09 Critical configurations for two projective views, a new approach Martin Bråtelund et.al. 2112.05074 null
2021-12-06 Dense Depth Priors for Neural Radiance Fields from Sparse Input Views Barbara Roessle et.al. 2112.03288 link
2021-12-10 MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment Jie Ren et.al. 2112.01349 link
2021-11-11 Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft Pascal Schoppmann et.al. 2111.06271 null
2021-11-10 Damage Estimation and Localization from Sparse Aerial Imagery Rene Garcia Franceschini et.al. 2111.03708 null
2021-11-03 Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems Swarnabja Bhaumik et.al. 2111.02064 null
2021-10-14 Modeling dynamic target deformation in camera calibration Annika Hagemann et.al. 2110.07322 null
2021-10-13 Hyperspectral 3D Mapping of Underwater Environments Maxime Ferrera et.al. 2110.06571 null
2021-09-24 Automatic Map Update Using Dashcam Videos Aziza Zhanabatyrova et.al. 2109.12131 null
2021-09-16 Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs Gabriel Moreira et.al. 2109.08046 link
2021-09-06 Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications Tejas Mane et.al. 2109.02740 null
2021-09-02 Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency Beatrix-Emőke Fülöp-Balogh et.al. 2109.01018 null
2021-09-01 On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation Eric Brachmann et.al. 2109.00524 link
2021-08-31 DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension Roman Shapovalov et.al. 2109.00033 null
2021-08-29 Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration Seyed-Mahdi Nasiri et.al. 2108.12876 null
2021-08-23 Burst Imaging for Light-Constrained Structure-From-Motion Ahalya Ravendran et.al. 2108.09895 null

(back to top)

Visual Localization

Publish Date Title Authors PDF Code
2025-04-24 A Guide to Structureless Visual Localization Vojtech Panek et.al. 2504.17636 null
2025-04-23 Rethinking Vision Transformer for Large-Scale Fine-Grained Image Retrieval Xin Jiang et.al. 2504.16691 null
2025-04-22 Media Content Atlas: A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs Merve Cerit et.al. 2504.16323 link
2025-04-19 A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling Kyle Buettner et.al. 2504.14359 null
2025-04-17 SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs Haoxuan Li et.al. 2504.13172 null
2025-04-16 Generalized Visual Relation Detection with Diffusion Models Kaifeng Gao et.al. 2504.12100 null
2025-04-15 Visual Re-Ranking with Non-Visual Side Information Gustav Hanning et.al. 2504.11134 link
2025-04-15 TMCIR: Token Merge Benefits Composed Image Retrieval Chaoyang Wang et.al. 2504.10995 null
2025-04-14 Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition Changwei Wang et.al. 2504.09881 null
2025-04-12 Evolved Hierarchical Masking for Self-Supervised Learning Zhanzhou Feng et.al. 2504.09155 null
2025-04-11 HAL-NeRF: High Accuracy Localization Leveraging Neural Radiance Fields Asterios Reppas et.al. 2504.08901 null
2025-04-11 Hypergraph Vision Transformers: Images are More than Nodes, More than Edges Joshua Fixelle et.al. 2504.08710 null
2025-04-11 FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations Cheng-Yu Hsieh et.al. 2504.08368 null
2025-04-11 PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection Xiong Li et.al. 2504.08280 null
2025-04-10 Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval Zehong Ma et.al. 2504.07718 null
2025-04-09 A Pointcloud Registration Framework for Relocalization in Subterranean Environments David Akhihiero et.al. 2504.07231 null
2025-04-09 Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception Ruotian Peng et.al. 2504.06666 null
2025-04-08 To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition Davide Sferrazza et.al. 2504.06116 link
2025-04-06 NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval Peng Gao et.al. 2504.04339 null
2025-04-04 REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval Shabnam Choudhury et.al. 2504.03169 null
2025-04-06 Re-thinking Temporal Search for Long-Form Video Understanding Jinhui Ye et.al. 2504.02259 link
2025-04-02 A Chefs KISS -- Utilizing semantic information in both ICP and SLAM framework Sven Ochs et.al. 2504.02086 null
2025-04-02 Prompt-Guided Attention Head Selection for Focus-Oriented Image Retrieval Yuji Nozawa et.al. 2504.01348 null
2025-04-01 IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval Bangwei Liu et.al. 2504.00954 null
2025-04-01 Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data Yiqun Duan et.al. 2504.00812 null
2025-03-31 CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization Yingrui Ji et.al. 2503.24182 null
2025-03-31 LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds Masahiko Tsuji et.al. 2503.23664 null
2025-03-30 Multiview Image-Based Localization Cameron Fiore et.al. 2503.23577 null
2025-03-27 LOCORE: Image Re-ranking with Long-Context Sequence Modeling Zilin Xiao et.al. 2503.21772 link
2025-03-27 Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck Adrian Bulat et.al. 2503.21757 null
2025-03-27 UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation Yehui Shen et.al. 2503.21338 link
2025-03-27 FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval Zixu Li et.al. 2503.21309 link
2025-03-27 Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing Shuai Li et.al. 2503.21236 null
2025-03-25 CoLLM: A Large Language Model for Composed Image Retrieval Chuong Huynh et.al. 2503.19910 link
2025-03-25 Scene-agnostic Pose Regression for Visual Localization Junwei Zheng et.al. 2503.19543 null
2025-03-25 From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting Zhiwei Huang et.al. 2503.19358 null
2025-03-25 Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval Haoqiang Lin et.al. 2503.19296 link
2025-03-23 LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space Zhangyu Wang et.al. 2503.18142 null
2025-03-23 Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning Xiang Fang et.al. 2503.17938 null
2025-03-23 What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images Dongheng Lin et.al. 2503.17899 null
2025-03-22 good4cir: Generating Detailed Synthetic Captions for Composed Image Retrieval Pranavi Kolouju et.al. 2503.17871 null
2025-03-21 Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2503.17109 link
2025-03-21 Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions Muhua Zhang et.al. 2503.17005 null
2025-03-20 PromptHash: Affinity-Prompted Collaborative Cross-Modal Learning for Adaptive Hashing Retrieval Qiang Zou et.al. 2503.16064 link
2025-03-20 Automating 3D Dataset Generation with Neural Radiance Fields P. Schulz et.al. 2503.15997 link
2025-03-18 3D Densification for Multi-Map Monocular VSLAM in Endoscopy X. Anadón et.al. 2503.14346 null
2025-03-18 A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios Huy-Hoang Bui et.al. 2503.13982 link
2025-03-17 Scale Efficient Training for Large Datasets Qing Zhou et.al. 2503.13385 null
2025-03-17 Multi-Platform Teach-and-Repeat Navigation by Visual Place Recognition Based on Deep-Learned Local Features Václav Truhlařík et.al. 2503.13090 null
2025-03-17 All You Need to Know About Training Image Retrieval Models Gabriele Berton et.al. 2503.13045 link
2025-03-12 Exploring the best way for UAV visual localization under Low-altitude Multi-view Observation Condition: a Benchmark Yibin Ye et.al. 2503.10692 link
2025-03-13 ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning Pengfei Luo et.al. 2503.10166 link
2025-03-12 Revisiting Medical Image Retrieval via Knowledge Consolidation Yang Nan et.al. 2503.09370 null
2025-03-11 CQVPR: Landmark-aware Contextual Queries for Visual Place Recognition Dongyue Li et.al. 2503.08170 null
2025-03-10 Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization Michael Green et.al. 2503.07038 null
2025-03-10 Zero-Shot Hashing Based on Reconstruction With Part Alignment Yan Jiang et.al. 2503.07037 null
2025-03-10 Improving Visual Place Recognition with Sequence-Matching Receptiveness Prediction Somayeh Hussaini et.al. 2503.06840 null
2025-03-09 RoboDesign1M: A Large-scale Dataset for Robot Design Understanding Tri Le et.al. 2503.06796 null
2025-03-09 StructVPR++: Distill Structural and Semantic Knowledge with Weighting Samples for Visual Place Recognition Yanqing Shen et.al. 2503.06601 link
2025-03-09 TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification Huaqi Tao et.al. 2503.06501 null
2025-03-08 NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features Hongjia Zhai et.al. 2503.06117 null
2025-03-07 Data-Efficient Generalization for Zero-shot Composed Image Retrieval Zining Chen et.al. 2503.05204 null
2025-03-06 RadIR: A Scalable Framework for Multi-Grained Medical Image Retrieval via Radiology Report Mining Tengfei Zhang et.al. 2503.04653 null
2025-03-06 ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images Yanqing Shen et.al. 2503.04475 link
2025-03-06 Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes Hui Zhang et.al. 2503.04235 null
2025-03-06 Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior Haitao Wu et.al. 2503.04207 null
2025-03-06 Image-Based Relocalization and Alignment for Long-Term Monitoring of Dynamic Underwater Environments Beverley Gorry et.al. 2503.04096 link
2025-03-04 TeTRA-VPR: A Ternary Transformer Approach for Compact Visual Place Recognition Oliver Grainge et.al. 2503.02511 null
2025-03-04 Introspective Loop Closure for SLAM with 4D Imaging Radar Maximilian Hilger et.al. 2503.02383 null
2025-03-04 Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models Kenta Tsukahara et.al. 2503.02256 null
2025-03-03 Composed Multi-modal Retrieval: A Survey of Approaches and Applications Kun Zhang et.al. 2503.01334 link
2025-03-03 AirRoom: Objects Matter in Room Reidentification Runmao Yao et.al. 2503.01130 null
2025-03-02 Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching Jinyu Miao et.al. 2503.00862 null
2025-03-01 Class-Independent Increment: An Efficient Approach for Multi-label Class-Incremental Learning Songlin Dong et.al. 2503.00515 null
2025-02-28 EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth Registration Kuangyi Chen et.al. 2503.00167 link
2025-02-28 CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval Zelong Sun et.al. 2502.20826 null
2025-02-28 SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition Shanshan Wan et.al. 2502.20676 null
2025-02-27 A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization Yejun Zhang et.al. 2502.20036 link
2025-02-27 On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation Ruben T. Lucassen et.al. 2502.19285 null
2025-02-26 BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure Haoxin Cai et.al. 2502.19242 link
2025-02-26 SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images Yangfan Xu et.al. 2502.18932 null
2025-02-25 MegaLoc: One Retrieval to Place Them All Gabriele Berton et.al. 2502.17237 link
2025-02-23 Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive Queries Yin Wu et.al. 2502.16636 link
2025-02-23 SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition Feng Lu et.al. 2502.16601 link
2025-02-21 ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval Guanqi Zhan et.al. 2502.15682 null
2025-02-20 Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition Tianyi Shang et.al. 2502.14195 link
2025-02-19 3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments Vincent Ress et.al. 2502.13803 null
2025-02-18 Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Shuo Xing et.al. 2502.13146 link
2025-02-19 IM360: Textured Mesh Reconstruction for Large-scale Indoor Mapping with 360 $^\circ$ Cameras Dongki Jung et.al. 2502.12545 null
2025-02-17 From Gaming to Research: GTA V for Synthetic Data Generation for Robotics and Navigations Matteo Scucchia et.al. 2502.12303 null
2025-02-17 Descriminative-Generative Custom Tokens for Vision-Language Models Pramuditha Perera et.al. 2502.12095 null
2025-02-17 ILIAS: Instance-Level Image retrieval At Scale Giorgos Kordopatis-Zilos et.al. 2502.11748 null
2025-02-17 Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition Jianyi Peng et.al. 2502.11742 link
2025-02-17 Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics Francesco Croce et.al. 2502.11725 link
2025-02-17 Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization Yuanze Xu et.al. 2502.11408 null
2025-02-13 ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Rotem Shalev-Arkushin et.al. 2502.09411 null
2025-02-12 SpeechCompass: Enhancing Mobile Captioning with Diarization and Directional Guidance via Multi-Microphone Localization Artem Dementyev et.al. 2502.08848 null
2025-02-12 Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions Prajwal Gatti et.al. 2502.08438 null
2025-02-11 Captured by Captions: On Memorization and its Mitigation in CLIP Models Wenhao Wang et.al. 2502.07830 null
2025-02-11 Ultrafast 4D scanning transmission electron microscopy for imaging of localized optical fields Petr Koutenský et.al. 2502.07338 null
2025-02-11 Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos Haowen Gao et.al. 2502.07327 null
2025-02-11 PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval Osman Tursun et.al. 2502.07215 null
2025-02-10 AstroLoc: Robust Space to Ground Image Localizer Gabriele Berton et.al. 2502.07003 null
2025-02-09 Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education Yanhao Jia et.al. 2502.05863 null
2025-02-07 Learning Street View Representations with Spatiotemporal Contrast Yong Li et.al. 2502.04638 null
2025-02-06 Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Marco Mistretta et.al. 2502.04263 link
2025-02-05 Human-Aligned Image Models Improve Visual Decoding from the Brain Nona Rajabi et.al. 2502.03081 null
2025-02-03 ConceptVAE: Self-Supervised Fine-Grained Concept Disentanglement from 2D Echocardiographies Costin F. Ciusdel et.al. 2502.01335 null
2025-01-31 LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks Liudi Yang et.al. 2501.19382 link
2025-01-27 Freestyle Sketch-in-the-Loop Image Segmentation Subhadeep Koley et.al. 2501.16022 null
2025-01-26 Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations Zijun Long et.al. 2501.15379 null
2025-01-24 Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection Viktor Kozák et.al. 2501.14587 null
2025-01-23 Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models Jakob Krogh Petersen et.al. 2501.14051 link
2025-01-22 Triplet Synthesis For Enhancing Composed Image Retrieval via Counterfactual Image Generation Kenta Uesugi et.al. 2501.13968 null
2025-01-19 Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection Zhipeng Yu et.al. 2501.11063 link
2025-01-18 A Resource-Efficient Training Framework for Remote Sensing Text--Image Retrieval Weihang Zhang et.al. 2501.10638 null
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 link
2025-01-12 SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval Bhavin Jawade et.al. 2501.08347 null
2025-01-14 VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes Ke Wu et.al. 2501.08286 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2025-01-12 Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation Zhenyang Feng et.al. 2501.06749 null
2025-01-06 Integrating Language-Image Prior into EEG Decoding for Cross-Task Zero-Calibration RSVP-BCI Xujin Li et.al. 2501.02841 null
2025-01-03 A Minimal Subset Approach for Efficient and Scalable Loop Closure Nikolaos Stathoulopoulos et.al. 2501.01791 link
2025-01-03 iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings Shuhei Tomoshige et.al. 2501.01642 null
2025-01-02 R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization Xudong Jiang et.al. 2501.01421 link
2025-01-02 Training Medical Large Vision-Language Models with Abnormal-Aware Feedback Yucheng Zhou et.al. 2501.01377 null
2025-01-02 Domain-invariant feature learning in brain MR imaging for content-based image retrieval Shuya Tobari et.al. 2501.01326 null
2024-12-28 GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting Atticus J. Zeller et.al. 2412.20056 link
2024-12-25 FOR: Finetuning for Object Level Open Vocabulary Image Retrieval Hila Levi et.al. 2412.18806 null
2024-12-24 ERVD: An Efficient and Robust ViT-Based Distillation Framework for Remote Sensing Image Retrieval Le Dong et.al. 2412.18136 link
2024-12-22 Where am I? Cross-View Geo-localization with Natural Language Descriptions Junyan Ye et.al. 2412.17007 null
2024-12-22 Large-Scale UWB Anchor Calibration and One-Shot Localization Using Gaussian Process Shenghai Yuan et.al. 2412.16880 null
2024-12-24 Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling Daichi Yashima et.al. 2412.16576 link
2024-12-20 A New Method to Capturing Compositional Knowledge in Linguistic Space Jiahe Wan et.al. 2412.15632 null
2024-12-20 Stabilizing Laplacian Inversion in Fokker-Planck Image Retrieval using the Transport-of-Intensity Equation Samantha J Alloo et.al. 2412.15513 null
2024-12-19 Learning Visual Composition through Improved Semantic Guidance Austin Stone et.al. 2412.15396 null
2024-12-19 MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Junjie Zhou et.al. 2412.14475 null
2024-12-18 Adversarial Hubness in Multi-Modal Retrieval Tingwei Zhang et.al. 2412.14113 link
2024-12-18 Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval Giacomo Pacini et.al. 2412.13834 null
2024-12-18 ConDo: Continual Domain Expansion for Absolute Pose Regression Zijun Li et.al. 2412.13452 link
2024-12-17 Three Things to Know about Deep Metric Learning Yash Patel et.al. 2412.12432 null
2024-12-15 Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval Zelong Sun et.al. 2412.11087 null
2024-12-18 Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2412.11077 link
2024-12-13 MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition Qiwen Gu et.al. 2412.09199 null
2024-12-12 A Flexible Plug-and-Play Module for Generating Variable-Length Liyang He et.al. 2412.08922 link
2024-12-11 Image Retrieval Methods in the Dissimilarity Space Madhu Kiran et.al. 2412.08618 null
2024-12-11 Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization Siyan Dong et.al. 2412.08376 link
2024-12-11 Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin Benjamin D. Killeen et.al. 2412.08020 null
2024-12-10 On Motion Blur and Deblurring in Visual Place Recognition Timur Ismagilov et.al. 2412.07751 null
2024-12-10 Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance Wanwen Chen et.al. 2412.07741 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition Connor Malone et.al. 2412.06153 null
2024-12-07 Compositional Image Retrieval via Instruction-Aware Contrastive Learning Wenliang Zhong et.al. 2412.05756 link
2024-12-06 DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification Ying Jin et.al. 2412.04828 null
2024-12-04 Distillation of Diffusion Features for Semantic Correspondence Frank Fundel et.al. 2412.03512 null
2024-12-04 Composed Image Retrieval for Training-Free Domain Conversion Nikos Efthymiadis et.al. 2412.03297 link
2024-12-03 A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration Thulio Amorim et.al. 2412.02881 null
2024-12-03 Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval Leah Bar et.al. 2412.02310 link
2024-12-02 Mutli-View 3D Reconstruction using Knowledge Distillation Aditya Dutt et.al. 2412.02039 link
2024-12-02 Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features MD Shaikh Rahman et.al. 2412.01555 null
2024-12-02 Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models Yi Liao et.al. 2412.01202 null
2024-12-01 EDTformer: An Efficient Decoder Transformer for Visual Place Recognition Tong Jin et.al. 2412.00784 null
2024-11-28 EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval Muhammad Huzaifa et.al. 2412.00139 null
2024-11-29 A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications Liqiang Zhang Ye Tian Dongyan Wei et.al. 2411.19845 null
2024-11-27 Optimizing Image Retrieval with an Extended b-Metric Space Abdelkader Belhenniche et.al. 2411.18800 null
2024-11-26 Learning Visual Hierarchies with Hyperbolic Embeddings Ziwei Wang et.al. 2411.17490 null
2024-11-24 Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy You Li et.al. 2411.16752 null
2024-11-24 AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks You Li et.al. 2411.16749 null
2024-11-25 Image Generation Diversity Issues and How to Tame Them Mischa Dombrowski et.al. 2411.16171 link
2024-11-24 PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments Haoang Li et.al. 2411.15800 null
2024-11-22 Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval Zengbao Sun et.al. 2411.14704 null
2024-11-20 Globally Correlation-Aware Hard Negative Generation Wenjie Peng et.al. 2411.13145 link
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-13 OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances Youqi Liao et.al. 2411.08665 link
2024-11-13 Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval Saul Santos et.al. 2411.08590 link
2024-11-22 Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments Ashkan Nejad et.al. 2411.08567 link
2024-11-13 MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation Peng Wang et.al. 2411.08279 link
2024-11-05 From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing Xintian Sun et.al. 2411.05826 null
2024-11-04 TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Maitreya Patel et.al. 2411.02545 null
2024-11-11 INQUIRE: A Natural World Text-to-Image Retrieval Benchmark Edward Vendrow et.al. 2411.02537 link
2024-11-20 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 Semantic Masking and Visual Feature Matching for Robust Localization Luisa Mao et.al. 2411.01804 null
2024-11-03 Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification MD Shaikh Rahman et.al. 2411.01473 null
2024-11-01 Identifying Implicit Social Biases in Vision-Language Models Kimia Hamidieh et.al. 2411.00997 null
2024-10-31 Nearest Neighbor Normalization Improves Multimodal Retrieval Neil Chowdhury et.al. 2410.24114 link
2024-10-31 MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval Haiwen Li et.al. 2410.23736 null
2024-10-30 Decoupling Semantic Similarity from Spatial Alignment for Neural Networks Tassilo Wald et.al. 2410.23107 link
2024-10-29 Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications Monica Riedler et.al. 2410.21943 link
2024-10-28 NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments Taiyi Pan et.al. 2410.21615 link
2024-10-25 Context-Based Visual-Language Place Recognition Soojin Woo et.al. 2410.19341 link
2024-10-24 ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval Zijia Zhao et.al. 2410.18715 link
2024-10-25 On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features Tomáš Pivoňka et.al. 2410.18573 null
2024-10-22 Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2410.17393 null
2024-10-20 GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning Haiwen Diao et.al. 2410.15266 link
2024-10-19 Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book Collection Marie Roald et.al. 2410.14969 link
2024-10-16 Development of Image Collection Method Using YOLO and Siamese Network Chan Young Shin et.al. 2410.12561 null
2024-10-16 LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment Juelin Zhu et.al. 2410.12269 link
2024-10-16 Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization Nanda Febri Istighfarin et.al. 2410.12240 null
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-11 Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System Zheng Liu et.al. 2410.08935 link
2024-10-16 Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP Eunji Kim et.al. 2410.08469 null
2024-10-11 A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification Eugene P. W. Ang et.al. 2410.08456 null
2024-10-10 A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks Hoin Jung et.al. 2410.07593 link
2024-10-09 Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval Mohammad Omama et.al. 2410.07022 null
2024-10-09 Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers Stephen Hausler et.al. 2410.06614 link
2024-10-09 MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging Noel C. F. Codella et.al. 2410.06542 null
2024-10-08 Temporal Image Caption Retrieval Competition -- Description and Results Jakub Pokrywka et.al. 2410.06314 null
2024-10-08 Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching Gongxin Yao et.al. 2410.06285 null
2024-10-08 GSLoc: Visual Localization with 3D Gaussian Splatting Kazii Botashev et.al. 2410.06165 null
2024-10-08 Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning Ayush Singh et.al. 2410.05928 null
2024-10-08 RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps Minsoo Kim et.al. 2410.05621 null
2024-10-11 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249 null
2024-10-06 LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation Jianhao Jiao et.al. 2410.04419 null
2024-10-02 Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension Zaiquan Yang et.al. 2410.01544 null
2024-10-03 EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections Francesc Net et.al. 2410.01536 link
2024-10-04 CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Safouane El Ghazouali et.al. 2410.01411 link
2024-09-30 Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Aleyna Kütük et.al. 2410.00266 null
2024-09-29 CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation Yifan Duan et.al. 2409.19597 null
2024-09-28 VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition Ahmad Khaliq et.al. 2409.19293 link
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152 null
2024-09-26 Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval Mankeerat Sidhu et.al. 2409.18733 null
2024-09-26 Revisit Anything: Visual Place Recognition via Image Segment Retrieval Kartik Garg et.al. 2409.18049 link
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-23 CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis Xiang Zhang et.al. 2409.15169 null
2024-09-21 Combining Absolute and Semi-Generalized Relative Poses for Visual Localization Vojtech Panek et.al. 2409.14269 null
2024-09-21 SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality Hongjia Zhai et.al. 2409.14067 null
2024-09-20 Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval Morris Florek et.al. 2409.13513 link
2024-09-18 Towards Global Localization using Multi-Modal Object-Instance Re-Identification Aneesh Chavan et.al. 2409.12002 link
2024-09-17 Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching Kurran Singh et.al. 2409.11555 null
2024-09-17 Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information Kunal Chelani et.al. 2409.11536 null
2024-09-17 Improving the Efficiency of Visually Augmented Language Models Paula Ontalvilla et.al. 2409.11148 link
2024-09-21 HGSLoc: 3DGS-based Heuristic Camera Pose Refinement Zhongyan Niu et.al. 2409.10925 null
2024-09-16 SOLVR: Submap Oriented LiDAR-Visual Re-Localisation Joshua Knights et.al. 2409.10247 null
2024-09-16 Garment Attribute Manipulation with Multi-level Attention Vittorio Casula et.al. 2409.10206 null
2024-09-14 Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval Amirreza Mahbod et.al. 2409.09430 link
2024-09-12 Structured Pruning for Efficient Visual Place Recognition Oliver Grainge et.al. 2409.07834 null
2024-09-10 GeoCalib: Learning Single-image Calibration with Geometric Optimization Alexander Veicht et.al. 2409.06704 link
2024-09-10 Weakly-supervised Camera Localization by Ground-to-satellite Image Registration Yujiao Shi et.al. 2409.06471 link
2024-09-10 A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions Zhicong Wu et.al. 2409.06381 null
2024-09-09 Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding Bram Willemsen et.al. 2409.05721 link
2024-09-09 Open-World Dynamic Prompt and Continual Visual Representation Learning Youngeun Kim et.al. 2409.05312 null
2024-09-12 Training-free ZS-CIR via Weighted Modality Fusion and Similarity Ren-Di Wu et.al. 2409.04918 link
2024-09-12 Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models Saghir Alfasly et.al. 2409.04631 null
2024-09-06 Reprojection Errors as Prompts for Efficient Scene Coordinate Regression Ting-Ru Liu et.al. 2409.04178 null
2024-09-06 Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments Therese Joseph et.al. 2409.03998 null
2024-09-04 Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications Abby Stylianou et.al. 2409.03012 null
2024-09-04 NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval Sepanta Zeighami et.al. 2409.02343 link
2024-09-03 Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment Konstantin Schall et.al. 2409.01936 link
2024-09-02 A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches Kim Jinwoo et.al. 2409.01219 null
2024-09-02 Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection Manon Kok et.al. 2409.01091 null
2024-09-02 Evidential Transformers for Improved Image Retrieval Danilo Dordevic et.al. 2409.01082 null
2024-09-05 EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System Bonan Liu et.al. 2409.00343 null
2024-09-04 Augmented Reality without Borders: Achieving Precise Localization Without Maps Albert Gassol Puigjaner et.al. 2408.17373 null
2024-09-02 RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance Avideep Mukherjee et.al. 2408.17095 null
2024-08-29 A compact neuromorphic system for ultra energy-efficient, on-device robot localization Adam D. Hines et.al. 2408.16754 link
2024-08-29 Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models Kengo Nakata et.al. 2408.16296 null
2024-08-28 Temporal Attention for Cross-View Sequential Image Localization Dong Yuan et.al. 2408.15569 link
2024-08-27 Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild Tianqi Wei et.al. 2408.14723 null
2024-08-25 LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task Ali Asgarov et.al. 2408.13909 link
2024-08-15 Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval Lifeng Zhou et.al. 2408.13705 null
2024-08-15 Coarse-to-fine Alignment Makes Better Speech-image Retrieval Lifeng Zhou et.al. 2408.13119 null
2024-08-21 FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization Son Tung Nguyen et.al. 2408.12037 link
2024-08-21 Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations Lintong Zhang et.al. 2408.11966 null
2024-08-21 UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Xiangyu Zhao et.al. 2408.11305 link
2024-08-20 GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting Changkun Liu et.al. 2408.11085 link
2024-08-19 BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval Zhenyu Lu et.al. 2408.10383 null
2024-08-23 Fashion Image-to-Image Translation for Complementary Item Retrieval Matteo Attimonelli et.al. 2408.09847 link
2024-08-20 MambaLoc: Efficient Camera Localisation via State Space Model Jialu Wang et.al. 2408.09680 null
2024-08-15 DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions Ryosuke Korekata et.al. 2408.07910 null
2024-08-13 A Miniature Vision-Based Localization System for Indoor Blimps Shicong Ma et.al. 2408.06648 null
2024-08-10 Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network Junyan Ye et.al. 2408.05475 link
2024-08-09 Spherical World-Locking for Audio-Visual Localization in Egocentric Videos Heeseung Yun et.al. 2408.05364 null
2024-08-06 AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval Pavel Suma et.al. 2408.03282 link
2024-08-05 CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration Gongxin Yao et.al. 2408.02394 null
2024-08-09 BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles Lun Luo et.al. 2408.01841 link
2024-08-02 On Validation of Search & Retrieval of Tissue Images in Digital Pathology H. R. Tizhoosh et.al. 2408.01570 null
2024-07-31 VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning Yuhang Ming et.al. 2407.21416 null
2024-07-31 SuperVINS: A visual-inertial SLAM framework integrated deep learning features Hongkun Luo et.al. 2407.21348 link
2024-07-30 Re-localization acceleration with Medoid Silhouette Clustering Hongyi Zhang et.al. 2407.20749 null
2024-07-29 A flexible framework for accurate LiDAR odometry, map manipulation, and localization José Luis Blanco-Claraco et.al. 2407.20465 link
2024-07-26 From 2D to 3D: AISG-SLA Visual Localization Challenge Jialin Gao et.al. 2407.18590 null
2024-07-24 Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation Yongqi Li et.al. 2407.17274 null
2024-07-24 Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments Wei Gao et.al. 2407.17078 null
2024-07-24 Pose Estimation from Camera Images for Underwater Inspection Luyuan Peng et.al. 2407.16961 null
2024-07-22 Memory Management for Real-Time Appearance-Based Loop Closure Detection Mathieu Labbé et.al. 2407.15890 null
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-22 Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM Mathieu Labbe et.al. 2407.15305 null
2024-07-22 Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation Mathieu Labbé et.al. 2407.15304 null
2024-07-19 Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization Yuehua Ding et.al. 2407.14643 null
2024-07-18 Visual Haystacks: Answering Harder Questions About Sets of Images Tsung-Han Wu et.al. 2407.13766 link
2024-07-17 Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM Markus Weißflog et.al. 2407.12408 null
2024-07-17 GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection Jingwen Yu et.al. 2407.11736 link
2024-07-16 EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis Ruijie Yang et.al. 2407.11401 null
2024-07-15 No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Walter Simoncini et.al. 2407.10964 link
2024-07-15 DINO Pre-training for Vision-based End-to-end Autonomous Driving Shubham Juneja et.al. 2407.10803 null
2024-07-15 Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval Youngsun Lim et.al. 2407.10683 null
2024-07-15 An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots J. J. Cabrera et.al. 2407.10596 link
2024-07-15 An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments J. J. Cabrera et.al. 2407.10536 null
2024-07-12 Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval Vaibhav Balloli et.al. 2407.08908 link
2024-07-11 Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates Owen Claxton et.al. 2407.08162 link
2024-07-12 Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal Xinyu Zhu et.al. 2407.08153 link
2024-07-11 SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM Neng Wang et.al. 2407.08106 link
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-09 CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding Wenhao Xu et.al. 2407.06611 null
2024-07-08 Pseudo-triplet Guided Few-shot Composed Image Retrieval Bohan Hou et.al. 2407.06001 null
2024-07-09 HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels Yingying Jiang et.al. 2407.05795 null
2024-07-05 Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning Mainak Singha et.al. 2407.04207 link
2024-07-04 Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models Chang-Sheng Kao et.al. 2407.03615 link
2024-07-03 Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach Pronay Debnath et.al. 2407.03486 null
2024-07-02 Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition Sergio Izquierdo et.al. 2407.02422 link
2024-07-01 Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval Aneeshan Sain et.al. 2407.01810 null
2024-07-01 Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval Hanwen Su et.al. 2407.00979 null
2024-07-01 Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios Connor Malone et.al. 2407.00863 null
2024-06-27 PathAlign: A vision-language model for whole slide images in histopathology Faruk Ahmed et.al. 2406.19578 null
2024-07-05 360 in the Wild: Dataset for Depth Prediction and View Synthesis Kibaek Park et.al. 2406.18898 null
2024-06-27 Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs Huaying Zhang et.al. 2406.18836 null
2024-06-26 WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images Yannik Glaser et.al. 2406.18765 null
2024-06-26 View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis Subin Varghese et.al. 2406.18012 null
2024-06-25 Tell Me Where You Are: Multimodal LLMs Meet Place Recognition Zonglin Lyu et.al. 2406.17520 null
2024-06-25 SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation Xu Liu et.al. 2406.17249 link
2024-06-23 Breaking the Frame: Image Retrieval by Visual Overlap Prediction Tong Wei et.al. 2406.16204 link
2024-06-19 Towards a multimodal framework for remote sensing image change retrieval and captioning Roger Ferrod et.al. 2406.13424 link
2024-06-19 CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval Christian Lülf et.al. 2406.13322 link
2024-06-17 Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization Huaiji Zhou et.al. 2406.11766 null
2024-06-22 Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment Jianan Jiang et.al. 2406.11551 link
2024-06-17 They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias Salma Abdel Magid et.al. 2406.11331 null
2024-06-17 Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion Guoyuan An et.al. 2406.11242 null
2024-06-14 Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval Genc Hoxha et.al. 2406.10107 null
2024-06-14 BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval Imanol Miranda et.al. 2406.09952 link
2024-06-13 Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases Meng Wang et.al. 2406.09317 link
2024-06-13 Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval Jaeseok Byun et.al. 2406.09188 null
2024-06-13 DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification Zhengrui Xu et.al. 2406.08773 link
2024-06-12 Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement Maxime Pietrantoni et.al. 2406.08463 null
2024-06-12 ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery Kam Woh Ng et.al. 2406.08457 link
2024-06-11 Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Renjie Pi et.al. 2406.07502 link
2024-06-11 Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Shuvendu Roy et.al. 2406.07450 link
2024-06-11 Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval Adrià Molina et.al. 2406.07315 null
2024-06-10 Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation Shenghao Li et.al. 2406.06374 link
2024-06-09 Unified Text-to-Image Generation and Retrieval Leigang Qu et.al. 2406.05814 null
2024-06-07 The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better Scott Geng et.al. 2406.05184 link
2024-06-07 PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction Eduard Poesina et.al. 2406.04746 link
2024-06-06 GLACE: Global Local Accelerated Coordinate Encoding Fangjinhua Wang et.al. 2406.04340 link
2024-06-06 Monocular Localization with Semantics Map for Autonomous Vehicles Jixiang Wan et.al. 2406.03835 null
2024-06-05 Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach Saehyung Lee et.al. 2406.03411 link
2024-06-04 MeshVPR: Citywide Visual Place Recognition Using 3D Meshes Gabriele Berton et.al. 2406.02776 null
2024-06-04 Can CLIP help CLIP in learning 3D? Cristian Sbrolli et.al. 2406.02202 null
2024-06-03 Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP Sriram Balasubramanian et.al. 2406.01583 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-02 Visual place recognition for aerial imagery: A survey Ivan Moskalenko et.al. 2406.00885 link
2024-06-01 NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization Wugang Meng et.al. 2406.00312 null
2024-05-31 DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models Linli Yao et.al. 2405.20985 link
2024-05-29 Multi-Modal Generative Embedding Model Feipeng Ma et.al. 2405.19333 null
2024-05-29 ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions Honglin Lin et.al. 2405.19226 null
2024-05-30 CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval Xintong Jiang et.al. 2405.19149 link
2024-05-29 SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation Zhenbei Wu et.al. 2405.18801 null
2024-05-29 Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs Jialiang Xu et.al. 2405.18740 link
2024-05-28 EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition Issar Tzachor et.al. 2405.18065 null
2024-05-28 AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval Sihe Zhang et.al. 2405.17718 null
2024-05-26 MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups Yusen Xie et.al. 2405.16599 null
2024-05-29 Composed Image Retrieval for Remote Sensing Bill Psomas et.al. 2405.15587 link
2024-05-24 Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval Yiming Wu et.al. 2405.15451 null
2024-05-20 UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization Wenjia Xu et.al. 2405.11936 link
2024-05-19 Register assisted aggregation for Visual Place Recognition Xuan Yu et.al. 2405.11526 null
2024-05-26 CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion Gang Wang et.al. 2405.10793 null
2024-05-16 FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models Adrian Bulat et.al. 2405.10286 null
2024-05-15 Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study Farnaz Khun Jush et.al. 2405.09334 null
2024-05-14 BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment Lihong Jin et.al. 2405.09001 null
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-13 OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition Qiuchi Xiang et.al. 2405.07966 link
2024-05-14 HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval Chao He et.al. 2405.07524 link
2024-05-13 JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation Xubo Luo et.al. 2405.07429 link
2024-05-12 BoQ: A Place is Worth a Bag of Learnable Queries Amar Ali-bey et.al. 2405.07364 link
2024-05-07 Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction Nematollah Saeidi et.al. 2405.04211 null
2024-05-06 A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions Sharath Raghvendra et.al. 2405.03664 null
2024-05-06 Knowledge-aware Text-Image Retrieval for Remote Sensing Images Li Mi et.al. 2405.03373 null
2024-05-06 Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval Jiacheng Cheng et.al. 2405.03190 null
2024-05-05 iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval Lorenzo Agnolucci et.al. 2405.02951 link
2024-05-01 Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval Young Kyun Jang et.al. 2405.00571 null
2024-04-30 Large Language Model Informed Patent Image Retrieval Hao-Cheng Lo et.al. 2404.19360 null
2024-04-30 XFeat: Accelerated Features for Lightweight Image Matching Guilherme Potje et.al. 2404.19174 null
2024-04-29 Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models Hongyi Zhu et.al. 2404.18746 null
2024-04-29 Dual-Modal Prompting for Sketch-Based Image Retrieval Liying Gao et.al. 2404.18695 null
2024-05-01 Semantic Line Combination Detector Jinwon Ko et.al. 2404.18399 link
2024-04-26 Learning text-to-video retrieval from image captioning Lucas Ventura et.al. 2404.17498 null
2024-04-25 CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching Samia Shafique et.al. 2404.16972 link
2024-04-29 Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval Ryoya Nara et.al. 2404.16398 null
2024-04-24 Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval Haokun Wen et.al. 2404.15875 link
2024-04-24 DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines Xin Jiang et.al. 2404.15771 null
2024-04-23 Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval Young Kyun Jang et.al. 2404.15516 null
2024-04-22 EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models Mathias Thorsager et.al. 2404.14236 null
2024-04-22 Hierarchical localization with panoramic views and triplet loss functions Marcos Alfaro et.al. 2404.14117 link
2024-04-20 High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces Baoru Huang et.al. 2404.13437 null
2024-04-20 Collaborative Visual Place Recognition through Federated Learning Mattia Dutto et.al. 2404.13324 null
2024-04-18 SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints Spencer Carmichael et.al. 2404.12339 null
2024-04-17 Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives Zhangchi Feng et.al. 2404.11317 link
2024-04-17 Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing Sanggeon Yun et.al. 2404.11025 null
2024-04-16 SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments Niklas Gard et.al. 2404.10527 link
2024-04-20 CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning Haojian Huang et.al. 2404.09640 link
2024-04-11 PRAM: Place Recognition Anywhere Model for Efficient Visual Localization Fei Xue et.al. 2404.07785 null
2024-04-16 2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure Bin Zhang et.al. 2404.07644 link
2024-04-11 Semantically-correlated memories in a dense associative model Thomas F Burns et.al. 2404.07123 link
2024-04-09 Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation Luca Barsellotti et.al. 2404.06542 null
2024-04-09 Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping Anas Gouda et.al. 2404.06277 link
2024-04-07 Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval Jinpeng Wang et.al. 2404.04998 link
2024-04-06 Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning Juncheng Yang et.al. 2404.04538 link
2024-04-05 Towards introspective loop closure in 4D radar SLAM Maximilian Hilger et.al. 2404.03940 null
2024-04-02 TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation Yehui Shen et.al. 2404.01587 link
2024-04-01 On Train-Test Class Overlap and Detection for Image Retrieval Chull Hwan Song et.al. 2404.01524 link
2024-04-01 NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification Juyeop Han et.al. 2404.01400 null
2024-03-31 On the Estimation of Image-matching Uncertainty in Visual Place Recognition Mubariz Zaffar et.al. 2404.00546 null
2024-03-31 NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation Diwei Sheng et.al. 2404.00504 null
2024-03-30 SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs Yang Miao et.al. 2404.00469 null
2024-03-30 Do Vision-Language Models Understand Compound Nouns? Sonal Kumar et.al. 2404.00419 link
2024-04-05 FairRAG: Fair Human Generation via Fair Retrieval Augmentation Robik Shrestha et.al. 2403.19964 null
2024-03-28 JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition Gabriele Berton et.al. 2403.19787 link
2024-03-28 MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions Kai Zhang et.al. 2403.19651 link
2024-03-27 AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation Changkun Liu et.al. 2403.18281 null
2024-03-26 Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge Dongjin Kim et.al. 2403.17420 link
2024-03-25 Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras Gokul B. Nair et.al. 2403.16425 link
2024-03-24 Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval Yucheng Suo et.al. 2403.16005 link
2024-03-24 BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval Yinda Chen et.al. 2403.15992 null
2024-03-22 Long-CLIP: Unlocking the Long-Text Capability of CLIP Beichen Zhang et.al. 2403.15378 link
2024-03-22 A Multimodal Approach for Cross-Domain Image Retrieval Lucas Iijima et.al. 2403.15152 null
2024-03-22 Piecewise-Linear Manifolds for Deep Metric Learning Shubhang Bhatnagar et.al. 2403.14977 null
2024-03-21 Enhancing Historical Image Retrieval with Compositional Cues Tingyu Lin et.al. 2403.14287 link
2024-03-20 Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval Aymene Berriche et.al. 2403.13747 null
2024-03-20 Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval Haoyu Liu et.al. 2403.13317 null
2024-03-19 Learning Neural Volumetric Pose Features for Camera Localization Jingyu Lin et.al. 2403.12800 null
2024-03-19 Quantixar: High-performance Vector Data Management System Gulshan Yadav et.al. 2403.12583 null
2024-03-17 3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization Peng Jiang et.al. 2403.11367 null
2024-03-17 MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data Paul S. Scotti et.al. 2403.11207 link
2024-03-16 Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval Shunsuke Tsubaki et.al. 2403.10756 null
2024-03-16 Vector search with small radiuses Gergely Szilvasy et.al. 2403.10746 null
2024-03-13 Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer Kenta Tsukahara et.al. 2403.10552 null
2024-03-20 Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression Huy-Hoang Bui et.al. 2403.10297 link
2024-03-15 Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline Fangming Yuan et.al. 2403.10283 null
2024-03-14 The NeRFect Match: Exploring NeRF Features for Visual Localization Qunjie Zhou et.al. 2403.09577 null
2024-03-14 VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition Benjamin Ramtoula et.al. 2403.09025 null
2024-03-13 PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models Siddharth Mishra-Sharma et.al. 2403.08851 link
2024-03-13 NeRF-Supervised Feature Point Detection and Description Ali Youssef et.al. 2403.08156 link
2024-03-12 It's All About Your Sketch: Democratising Sketch Control in Diffusion Models Subhadeep Koley et.al. 2403.07234 link
2024-03-12 You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval Subhadeep Koley et.al. 2403.07222 null
2024-03-12 Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers Subhadeep Koley et.al. 2403.07214 null
2024-03-11 How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? Subhadeep Koley et.al. 2403.07203 null
2024-03-11 EarthLoc: Astronaut Photography Localization by Indexing Earth from Space Gabriele Berton et.al. 2403.06758 link
2024-03-11 BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues Fudong Ge et.al. 2403.06600 link
2024-03-11 Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology Stefan Denner et.al. 2403.06567 link
2024-03-10 RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation Mathieu Labbé et.al. 2403.06341 null
2024-03-10 Texture image retrieval using a classification and contourlet-based features Asal Rouhafzay et.al. 2403.06048 null
2024-03-11 LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map Xinrui Wu et.al. 2403.05002 link
2024-03-11 Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed Yifan Wang et.al. 2403.04765 null
2024-03-07 mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar Chengzhen Meng et.al. 2403.04703 null
2024-03-06 Self-supervised Photographic Image Layout Representation Learning Zhaoran Zhao et.al. 2403.03740 link
2024-03-04 Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models Benedikt Blumenstiel et.al. 2403.02059 link
2024-03-03 Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval Yongchao Du et.al. 2403.01431 null
2024-03-01 Asymmetric Feature Fusion for Image Retrieval Hui Wu et.al. 2403.00671 null
2024-03-01 Structure Similarity Preservation Learning for Asymmetric Image Retrieval Hui Wu et.al. 2403.00648 link
2024-02-29 CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition Feng Lu et.al. 2402.19231 link
2024-02-28 Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport Bin Li et.al. 2402.18411 link
2024-02-28 Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning Hanyao Wang et.al. 2402.18400 null
2024-02-28 Representing 3D sparse map points and lines for camera relocalization Bach-Thuan Bui et.al. 2402.18011 link
2024-02-27 Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control Thong Nguyen et.al. 2402.17535 link
2024-02-29 Active propulsion noise shaping for multi-rotor aircraft localization Gabriele Serussi et.al. 2402.17289 link
2024-02-27 NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer Bingxi Liu et.al. 2402.17159 link
2024-02-25 Deep Homography Estimation for Visual Place Recognition Feng Lu et.al. 2402.16086 link
2024-02-25 VOLoc: Visual Place Recognition by Querying Compressed Lidar Map Xudong Cai et.al. 2402.15961 link
2024-02-28 Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries Zijun Long et.al. 2402.15276 null
2024-02-23 Fine-tuning CLIP Text Encoders with Two-step Paraphrasing Hyunjae Kim et.al. 2402.15120 null
2024-02-22 Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition Feng Lu et.al. 2402.14505 link
2024-02-16 Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition Chenming Hu et.al. 2402.10476 null
2024-02-15 Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task Mirko Nava et.al. 2402.09886 link
2024-02-14 Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency Yannis Kalantidis et.al. 2402.09237 null
2024-02-13 Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast Xiangming Gu et.al. 2402.08567 link
2024-02-13 Learning to Produce Semi-dense Correspondences for Visual Localization Khang Truong Giang et.al. 2402.08359 link
2024-02-10 Semantic Object-level Modeling for Robust Visual Camera Relocalization Yifan Zhu et.al. 2402.06951 null
2024-02-09 Large Language Models for Captioning and Retrieving Remote Sensing Images João Daniel Silva et.al. 2402.06475 null
2024-02-09 PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes Xinggang Hu et.al. 2402.06131 null
2024-02-21 MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction Heng Zhou et.al. 2402.03762 null
2024-02-04 Region-Based Representations Revisited Michal Shlapentokh-Rothman et.al. 2402.02352 link
2024-02-03 Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization Bo Yang et.al. 2402.02141 link
2024-02-01 BrainSLAM: SLAM on Neural Population Activity Data Kipp Freud et.al. 2402.00588 null
2024-02-01 Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering Tianxiao Gao et.al. 2402.00330 link
2024-01-31 Improved Scene Landmark Detection for Camera Localization Tien Do et.al. 2401.18083 link
2024-01-31 Local Feature Matching Using Deep Learning: A Survey Shibiao Xu et.al. 2401.17592 link
2024-01-29 Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Shiyin Dong et.al. 2401.16459 null
2024-01-29 Cross-Modal Coordination Across a Diverse Set of Input Modalities Jorge Sánchez et.al. 2401.16347 null
2024-01-29 Regressing Transformers for Data-efficient Visual Place Recognition María Leyva-Vallina et.al. 2401.16304 null
2024-01-27 Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval Ayush Dubey et.al. 2401.15362 null
2024-01-24 Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode Naresh Kumar Lahajal et.al. 2401.13613 null
2024-01-23 PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion Shyam Sundar Kannan et.al. 2401.13082 null
2024-01-23 SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization Mingyang Li et.al. 2401.13076 link
2024-01-25 CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios Xiangshuo Qiao et.al. 2401.10475 link
2024-01-19 PhotoScout: Synthesis-Powered Multi-Modal Image Search Celeste Barnaby et.al. 2401.10464 null
2024-01-19 Cross-Modality Perturbation Synergy Attack for Person Re-identification Yunpeng Gong et.al. 2401.10090 null
2024-01-16 Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging Zahra Tabatabaei et.al. 2401.08272 null
2024-01-16 Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments Bruno Arcanjo et.al. 2401.08263 null
2024-01-15 Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing Jakob Hackstein et.al. 2401.07782 link
2024-01-14 HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval Zexuan Qiu et.al. 2401.07212 link
2024-01-11 UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization Rouwan Wu et.al. 2401.05971 link
2024-01-10 Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval Eunyi Lyou et.al. 2401.04860 link
2024-01-05 Benchmarking PathCLIP for Pathology Image Analysis Sunyi Zheng et.al. 2401.02651 null
2024-01-03 DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM with Joint Semantic Encoding Mingrui Li et.al. 2401.01545 null
2024-01-02 BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving Dafeng Wei et.al. 2401.01065 null
2023-12-31 Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval Liang Wang et.al. 2401.00371 link
2023-12-29 Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering Long-Kun Du et.al. 2401.00032 null
2023-12-27 LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization Sai Shubodh Puligilla et.al. 2312.16648 null
2023-12-26 Recursive Distillation for Open-Set Distributed Robot Localization Kenta Tsukahara et.al. 2312.15897 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-23 CaLDiff: Camera Localization in NeRF via Pose Diffusion Rashik Shrestha et.al. 2312.15242 null
2023-12-20 Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition Bruno Arcanjo et.al. 2312.12995 null
2023-12-19 VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering Chun-Mei Feng et.al. 2312.12273 link
2023-12-18 Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback Boaz Lerner et.al. 2312.11078 link
2023-12-17 PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields Boming Zhao et.al. 2312.10649 null
2023-12-17 DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition Sijie Wang et.al. 2312.10616 link
2023-12-16 Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval Decheng Liu et.al. 2312.10320 link
2023-12-15 Data-Efficient Multimodal Fusion on a Single GPU Noël Vouitsis et.al. 2312.10144 link
2023-12-13 Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques Hamed Qazanfari et.al. 2312.10089 null
2023-12-15 Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval Zhe Ma et.al. 2312.09716 link
2023-12-14 Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition Oliver Grainge et.al. 2312.09028 null
2023-12-14 Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking Shitong Sun et.al. 2312.08924 null
2023-12-13 C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation Florian Fervers et.al. 2312.08060 null
2023-12-12 Contextually Affinitive Neighborhood Refinery for Deep Clustering Chunlin Yu et.al. 2312.07806 link
2023-12-12 Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval Qiwei Tian et.al. 2312.07364 link
2023-12-12 Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection Jonathan J. Y. Kim et.al. 2312.06991 null
2023-12-11 Dynamic Weighted Combiner for Mixed-Modal Image Retrieval Fuxiang Huang et.al. 2312.06179 link
2023-12-06 Lite-Mind: Towards Efficient and Versatile Brain Representation Network Zixuan Gong et.al. 2312.03781 link
2023-12-08 FreestyleRet: Retrieving Images from Style-Diversified Queries Hao Li et.al. 2312.02428 link
2023-12-04 Implicit Learning of Scene Geometry from Poses for Global Localization Mohammad Altillawi et.al. 2312.02029 null
2023-12-04 Language-only Efficient Training of Zero-shot Composed Image Retrieval Geonmo Gu et.al. 2312.01998 link
2023-12-03 G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training Che Liu et.al. 2312.01522 link
2023-12-01 Improve Supervised Representation Learning with Masked Image Modeling Kaifeng Chen et.al. 2312.00950 null
2023-12-05 Grounding Everything: Emerging Localization Properties in Vision-Language Transformers Walid Bousselham et.al. 2312.00878 link
2023-12-01 Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras Mohammad Altillawi et.al. 2312.00500 null
2023-11-30 HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance Zhuohao Yin et.al. 2311.18273 link
2023-11-30 Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models Raviteja Vemulapalli et.al. 2311.18237 link
2023-11-29 Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce Chang Liu et.al. 2311.17954 null
2023-11-28 Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames Chao Chen et.al. 2311.17940 null
2023-11-29 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries Huajian Huang et.al. 2311.17389 link
2023-11-27 Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation Samuele Poppi et.al. 2311.16254 link
2023-11-27 Optimal Transport Aggregation for Visual Place Recognition Sergio Izquierdo et.al. 2311.15937 link
2023-11-27 AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval Shicheng Xu et.al. 2311.14084 link
2023-11-23 3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology Asma Ben Abacha et.al. 2311.13752 link
2023-11-22 Medical Image Retrieval Using Pretrained Embeddings Farnaz Khun Jush et.al. 2311.13547 null
2023-11-22 Applications of Spiking Neural Networks in Visual Place Recognition Somayeh Hussaini et.al. 2311.13186 link
2023-11-21 Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval Xiu-Shen Wei et.al. 2311.12894 null
2023-11-21 Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs Zhentian Qian et.al. 2311.12245 null
2023-11-19 From Categories to Classifier: Name-Only Continual Learning by Exploring the Web Ameya Prabhu et.al. 2311.11293 null
2023-11-18 Lesion Search with Self-supervised Learning Kristin Qi et.al. 2311.11014 null
2023-11-15 Flow reconstruction and particle characterization from inertial Lagrangian tracks Ke Zhou et.al. 2311.09076 null
2023-11-15 Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval Junyang Chen et.al. 2311.07622 link
2023-11-13 VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search Shuting He et.al. 2311.07514 null
2023-11-10 Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval Xin Lu et.al. 2311.06067 null
2023-11-08 Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model Junya Shiraishi et.al. 2311.04788 null
2023-11-08 Training CLIP models on Data from Scientific Papers Calvin Metzger et.al. 2311.04711 link
2023-11-07 DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding Kehinde Ajayi et.al. 2311.04098 link
2023-11-06 Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences Zador Pataki et.al. 2311.03345 null
2023-11-06 FocusTune: Tuning Visual Localization through Focus-Guided Sampling Son Tung Nguyen et.al. 2311.02872 link
2023-11-01 DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing Gaoshuang Huang et.al. 2311.00230 link
2023-10-29 Identifiable Contrastive Learning with Automatic Feature Importance Discovery Qi Zhang et.al. 2310.18904 link
2023-10-27 LipSim: A Provably Robust Perceptual Similarity Metric Sara Ghazanfari et.al. 2310.18274 link
2023-10-27 Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation Susu Fang et.al. 2310.17879 null
2023-10-25 FoundLoc: Vision-based Onboard Aerial Localization in the Wild Yao He et.al. 2310.16299 null
2023-10-24 Cross-view Self-localization from Synthesized Scene-graphs Ryogo Yamamoto et.al. 2310.15504 null
2023-10-23 Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval Xu Yuan et.al. 2310.14637 link
2023-10-21 Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation Anastasia Kritharoula et.al. 2310.14025 link
2023-10-20 FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer Xinyu Zhang et.al. 2310.13605 null
2023-10-20 CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants Shaoan Wang et.al. 2310.13320 link
2023-10-27 Representation Learning via Consistent Assignment of Views over Random Partitions Thalles Silva et.al. 2310.12692 link
2023-10-18 Evaluating the Fairness of Discriminative Foundation Models in Computer Vision Junaid Ali et.al. 2310.11867 link
2023-10-17 Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification Shuanglin Yan et.al. 2310.11210 null
2023-10-16 Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People Dharmateja Adapa et.al. 2310.10290 null
2023-10-16 EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge Tom Bryan et.al. 2310.10050 null
2023-10-15 CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes Yulei Qin et.al. 2310.09761 link
2023-10-13 Pairwise Similarity Learning is SimPLE Yandong Wen et.al. 2310.09449 link
2023-10-13 Vision-by-Language for Training-Free Compositional Image Retrieval Shyamgopal Karthik et.al. 2310.09291 link
2023-10-12 Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning Shiyang Yan et.al. 2310.08390 null
2023-10-12 Jointly Optimized Global-Local Visual Localization of UAVs Haoling Li et.al. 2310.08082 null
2023-10-10 Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization Le Chen et.al. 2310.06984 null
2023-10-10 Distillation Improves Visual Place Recognition for Low-Quality Queries Anbang Yang et.al. 2310.06906 link
2023-10-10 Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets Jiajun Zhang et.al. 2310.06566 null
2023-10-10 Topological RANSAC for instance verification and retrieval without fine-tuning Guoyuan An et.al. 2310.06486 null
2023-10-10 3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments Ghanta Sai Krishna et.al. 2310.06385 null
2023-10-09 Collaborative Visual Place Recognition Yiming Li et.al. 2310.05541 null
2023-10-09 Sentence-level Prompts Benefit Composed Image Retrieval Yang Bai et.al. 2310.05473 link
2023-10-08 AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition Feng Lu et.al. 2310.05184 link
2023-10-08 LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization Artem Nenashev et.al. 2310.05134 null
2023-10-12 ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer Yifan Xu et.al. 2310.04099 null
2023-10-06 Sub-token ViT Embedding via Stochastic Resonance Transformers Dong Lao et.al. 2310.03967 link
2023-10-04 Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach Matthew Hanlon et.al. 2310.02650 null
2023-10-02 NEUCORE: Neural Concept Reasoning for Composed Image Retrieval Shu Zhao et.al. 2310.01358 null
2023-10-02 Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images Georg Bökman et.al. 2310.01092 null
2023-10-05 PlaceNav: Topological Navigation through Place Recognition Lauri Suomela et.al. 2309.17260 null
2023-09-29 Segment Anything Model is a Good Teacher for Local Feature Learning Jingqian Wu et.al. 2309.16992 link
2023-09-28 Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning Albert Mohwald et.al. 2309.16351 link
2023-09-28 FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding Pengxiang Wu et.al. 2309.16249 link
2023-09-28 Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval Yuanmin Tang et.al. 2309.16137 link
2023-09-27 GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization Vicente Vivanco Cepeda et.al. 2309.16020 link
2023-09-27 Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization Zhenbo Song et.al. 2309.15556 null
2023-09-26 Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features Hila Levi et.al. 2309.14999 null
2023-09-23 Resolving References in Visually-Grounded Dialogue via Text Generation Bram Willemsen et.al. 2309.13430 link
2023-09-21 Face Identity-Aware Disentanglement in StyleGAN Adrian Suwała et.al. 2309.12033 null
2023-09-21 On-the-Fly SfM: What you capture is What you get Zongqian Zhan et.al. 2309.11883 link
2023-09-20 2D-3D Pose Tracking with Multi-View Constraints Huai Yu et.al. 2309.11335 null
2023-09-19 VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition Adam D. Hines et.al. 2309.10225 link
2023-09-18 DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach Chenghao Xu et.al. 2309.09879 null
2023-09-18 Decompose Semantic Shifts for Composed Image Retrieval Xingyu Yang et.al. 2309.09531 null
2023-09-16 Efficient Object Rearrangement via Multi-view Fusion Dehao Huang et.al. 2309.08994 null
2023-09-16 DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF Mert Asim Karaoglu et.al. 2309.08927 link
2023-09-16 Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning Pengyu Yin et.al. 2309.08914 link
2023-09-15 Active Learning for Fine-Grained Sketch-Based Image Retrieval Himanshu Thakur et.al. 2309.08743 null
2023-09-15 Optimization of Rank Losses for Image Retrieval Elias Ramzi et.al. 2309.08250 link
2023-09-18 Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer Yaoting Wang et.al. 2309.07929 link
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-13 RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline Mirko Usuelli et.al. 2309.07094 null
2023-09-11 Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Guoyuan An et.al. 2309.05438 link
2023-09-08 Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning Hiroki Nakamura et.al. 2309.04148 null
2023-09-05 Magnetic Navigation using Attitude-Invariant Magnetic Field Information for Loop Closure Detection Natalia Pavlasek et.al. 2309.02394 null
2023-09-05 Dual Relation Alignment for Composed Image Retrieval Xintong Jiang et.al. 2309.02169 null
2023-09-04 NLLB-CLIP -- train performant multilingual image retrieval model on a budget Alexander Visheratin et.al. 2309.01859 null
2023-09-04 Target-Guided Composed Image Retrieval Haokun Wen et.al. 2309.01366 null
2023-09-02 Deep supervised hashing for fast retrieval of radio image cubes Steven Ndung'u et.al. 2309.00932 null
2023-08-31 Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval Prateksha Udhayanan et.al. 2308.16649 null
2023-08-28 Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics Nils Böhne et.al. 2308.14786 null
2023-08-28 CoVR: Learning Composed Video Retrieval from Web Video Captions Lucas Ventura et.al. 2308.14746 link
2023-08-27 Deep Learning for Visual Localization and Mapping: A Survey Changhao Chen et.al. 2308.14039 null
2023-08-26 Learning Efficient Representations for Image-Based Patent Retrieval Hongsong Wang et.al. 2308.13749 null
2023-08-25 Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers Mohammad Javad Rajabi et.al. 2308.13671 null
2023-08-24 Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities Jinze Bai et.al. 2308.12966 link
2023-08-23 Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval Huafeng Li et.al. 2308.11994 null
2023-08-23 OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes Tao Xie et.al. 2308.11928 link
2023-08-22 Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features Alberto Baldrati et.al. 2308.11485 link
2023-08-22 GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training Xinchi Deng et.al. 2308.11331 null
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223 null
2023-08-21 EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition Gabriele Berton et.al. 2308.10832 link
2023-08-20 FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory Anwesan Pal et.al. 2308.10170 null
2023-08-18 3D Model-free Visual localization System from Essential Matrix under Local Planar Motion Yanmei Jiao et.al. 2308.09566 null
2023-08-17 FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings Yulin Su et.al. 2308.09012 link
2023-08-16 Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval Aishwarya Venkataramanan et.al. 2308.08431 link
2023-08-16 Ranking-aware Uncertainty for Text-guided Image Retrieval Junyang Chen et.al. 2308.08131 null
2023-08-19 Global Features are All You Need for Image Retrieval and Reranking Shihao Shao et.al. 2308.06954 link
2023-08-14 MixBCT: Towards Self-Adapting Backward-Compatible Training Yu Liang et.al. 2308.06948 link
2023-08-10 KS-APR: Keyframe Selection for Robust Absolute Pose Regression Changkun Liu et.al. 2308.05459 null
2023-08-09 AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities Jingdan Zhang et.al. 2308.04992 link
2023-08-08 Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval Yi Bin et.al. 2308.04343 link
2023-08-08 Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval Yunquan Zhu et.al. 2308.04008 link
2023-08-05 A Comprehensive Analysis of Real-World Image Captioning and Scene Identification Sai Suprabhanu Nallapaneni et.al. 2308.02833 null
2023-08-03 Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies Eunsuk Seo et.al. 2308.01871 null
2023-08-01 AnyLoc: Towards Universal Visual Place Recognition Nikhil Keetha et.al. 2308.00688 link
2023-07-31 Guiding Image Captioning Models Toward More Specific Captions Simon Kornblith et.al. 2307.16686 null
2023-07-31 Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks Kousik Rajesh et.al. 2307.16395 null
2023-07-28 D2S: Representing local descriptors and global scene coordinates for camera relocalization Bach-Thuan Bui et.al. 2307.15250 link
2023-07-26 Neural-based Cross-modal Search and Retrieval of Artwork Yan Gong et.al. 2307.14244 null
2023-07-26 Boon: A Neural Search Engine for Cross-Modal Information Retrieval Yan Gong et.al. 2307.14240 null
2023-07-25 Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network Chull Hwan Song et.al. 2307.13254 null
2023-07-28 SACReg: Scene-Agnostic Coordinate Regression for Visual Localization Jerome Revaud et.al. 2307.11702 null
2023-07-19 Lazy Visual Localization via Motion Averaging Siyan Dong et.al. 2307.09981 null
2023-07-19 Quantum Optics based Algorithm for Measuring the Similarity between Images Vivek Mehta et.al. 2307.09789 null
2023-07-18 Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments Max Moebius et.al. 2307.09172 null
2023-07-18 3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving Qipeng Li et.al. 2307.09044 null
2023-07-19 Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation Rundong Luo et.al. 2307.08779 null
2023-07-17 Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition Gabriele Trivigno et.al. 2307.08417 link
2023-07-17 Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification Tengfei Liang et.al. 2307.08316 link
2023-07-17 NDT-Map-Code: A 3D global descriptor for real-time loop closure detection in lidar SLAM Lizhou Liao et.al. 2307.08221 link
2023-07-20 Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer Yujiao Shi et.al. 2307.08015 link
2023-07-10 Phoneme-retrieval; voice recognition; vowels recognition Brunello Tirozzi et.al. 2307.07407 null
2023-07-14 Risk Controlled Image Retrieval Kaiwen Cai et.al. 2307.07336 link
2023-07-11 ResMatch: Residual Attention Learning for Local Feature Matching Yuxin Deng et.al. 2307.05180 link
2023-07-11 Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification Yi Liao et.al. 2307.05017 null
2023-07-10 Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor San Jiang et.al. 2307.04520 null
2023-07-10 RaPlace: Place Recognition for Imaging Radar using Radon Transform and Mutable Threshold Hyesu Jang et.al. 2307.04321 link
2023-07-08 Calibration-Aware Margin Loss: Pushing the Accuracy-Calibration Consistency Pareto Frontier for Deep Metric Learning Qin Zhang et.al. 2307.04047 null
2023-07-04 Unsupervised Quality Prediction for Improved Single-Frame and Weighted Sequential Visual Place Recognition Helen Carson et.al. 2307.01464 null
2023-07-04 Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network Zizhuo Li et.al. 2307.01447 null
2023-07-03 Cross-modal Place Recognition in Image Databases using Event-based Sensors Xiang Ji et.al. 2307.01047 null
2023-06-30 DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions Stephen Hausler et.al. 2306.17536 null
2023-06-30 Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization Stephen Hausler et.al. 2306.17529 null
2023-06-27 Dental CLAIRES: Contrastive LAnguage Image REtrieval Search for Dental Research Tanjida Kabir et.al. 2306.15651 null
2023-06-27 Mean Field Theory in Deep Metric Learning Takuya Furusawa et.al. 2306.15368 null
2023-06-26 Hierarchical Matching and Reasoning for Multi-Query Image Retrieval Zhong Ji et.al. 2306.14460 link
2023-06-25 Enhancing Dynamic Image Advertising with Vision-Language Pre-training Zhoufutu Wen et.al. 2306.14112 null
2023-06-23 Catching Image Retrieval Generalization Maksim Zhdanov et.al. 2306.13357 null
2023-06-22 Deep Metric Learning with Soft Orthogonal Proxies Farshad Saberi-Movahed et.al. 2306.13055 null
2023-06-22 What to Learn: Features, Image Transformations, or Both? Yuxuan Chen et.al. 2306.13040 null
2023-06-22 Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval Katrin Glinka et.al. 2306.12843 null
2023-06-26 Annotation Cost Efficient Active Learning for Content Based Image Retrieval Julia Henkel et.al. 2306.11605 null
2023-06-19 Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning Shivaen Ramshetty et.al. 2306.11065 link
2023-06-18 LiDAR-Based Place Recognition For Autonomous Driving: A Survey Pengcheng Shi et.al. 2306.10561 link
2023-06-15 Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization Dror Aiger et.al. 2306.09012 link
2023-06-15 Prompt Performance Prediction for Generative IR Nicolas Bizzozzero et.al. 2306.08915 null
2023-06-15 Graph Convolution Based Efficient Re-Ranking for Visual Retrieval Yuqi Zhang et.al. 2306.08792 link
2023-06-13 GeneCIS: A Benchmark for General Conditional Image Similarity Sagar Vaze et.al. 2306.07969 null
2023-06-13 MOFI: Learning Image Representations from Noisy Entity Annotated Images Wentao Wu et.al. 2306.07952 link
2023-06-12 Zero-shot Composed Text-Image Retrieval Yikun Liu et.al. 2306.07272 link
2023-06-12 Sticker820K: Empowering Interactive Retrieval with Stickers Sijie Zhao et.al. 2306.06870 null
2023-06-11 Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models Yuguang Yang et.al. 2306.06691 null
2023-06-03 Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval Xu Zhang et.al. 2306.02092 null
2023-06-03 Class Anchor Margin Loss for Content-Based Image Retrieval Alexandru Ghita et.al. 2306.00630 null
2023-05-31 Chatting Makes Perfect -- Chat-based Image Retrieval Matan Levy et.al. 2305.20062 link
2023-05-31 Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization Junan Chen et.al. 2305.20044 null
2023-05-30 A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation Omar Seddati et.al. 2305.18988 null
2023-05-29 Synfeal: A Data-Driven Simulator for End-to-End Camera Localization Daniel Coelho et.al. 2305.18260 link
2023-05-29 Nanoscale visualization of the thermally-driven evolution of antiferromagnetic domains in FeTe thin films Shrinkhala Sharma et.al. 2305.18197 null
2023-05-29 TReR: A Lightweight Transformer Re-Ranking Approach for 3D LiDAR Place Recognition Tiago Barros et.al. 2305.18013 null
2023-05-28 ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval Jiapeng Wang et.al. 2305.17652 null
2023-06-01 FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing Zhuang Li et.al. 2305.17497 link
2023-05-27 Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation Yueh-Cheng Huang et.al. 2305.17463 null
2023-05-26 Generating Images with Multimodal Language Models Jing Yu Koh et.al. 2305.17216 link
2023-05-25 Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder Zheyuan Liu et.al. 2305.16304 link
2023-05-23 Leveraging BEV Representation for 360-degree Visual Place Recognition Xuecheng Xu et.al. 2305.13814 link
2023-05-23 EDIS: Entity-Driven Image Search over Multimodal Web Content Siqi Liu et.al. 2305.13631 link
2023-05-20 DAC: Detector-Agnostic Spatial Covariances for Deep Local Features Javier Tirado-Garín et.al. 2305.12250 link
2023-05-19 Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach Zahra Tabatabaei et.al. 2305.11728 null
2023-05-19 Learning Sequence Descriptor based on Spatiotemporal Attention for Visual Place Recognition Fenglin Zhang et.al. 2305.11467 link
2023-05-12 IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images Varuna Krishna et.al. 2305.10438 null
2023-05-17 Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval Haokun Wen et.al. 2305.09979 null
2023-05-13 Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance Xinyu Lin et.al. 2305.07943 link
2023-05-11 Foundations of Spatial Perception for Robotics: Hierarchical Representations and Real-time Systems Nathan Hughes et.al. 2305.07154 link
2023-05-09 Visual Place Recognition with Low-Resolution Images Mihnea-Alexandru Tomita et.al. 2305.05776 null
2023-05-09 Vision-Language Models in Remote Sensing: Current Progress and Future Trends Congcong Wen et.al. 2305.05726 null
2023-05-09 An Evaluation and Ranking of Different Voting Schemes for Improved Visual Place Recognition Maria Waheed et.al. 2305.05705 null
2023-05-09 Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query Ho Hin Lee et.al. 2305.05598 null
2023-05-09 ColonMapper: topological mapping and localization for colonoscopy Javier Morlana et.al. 2305.05546 null
2023-05-09 Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization Clémentin Boittiaux et.al. 2305.05301 link
2023-05-09 Patch-DrosoNet: Classifying Image Partitions With Fly-Inspired Models For Lightweight Visual Place Recognition Bruno Arcanjo et.al. 2305.05256 null
2023-05-09 Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval Shiyin Dong et.al. 2305.05144 null
2023-05-08 Hierarchical Visual Localization Based on Sparse Feature Pyramid for Adaptive Reduction of Keypoint Map Size Andrei Potapov et.al. 2305.04856 null
2023-05-08 Privacy-Preserving Representations are not Enough -- Recovering Scene Content from Camera Poses Kunal Chelani et.al. 2305.04603 link
2023-05-06 Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer Minyi Zhao et.al. 2305.04072 null
2023-05-06 Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing Swagatika Dash et.al. 2305.03881 link
2023-05-05 COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? Arijit Ray et.al. 2305.03689 link
2023-05-05 HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer Shuzhe Wang et.al. 2305.03595 null
2023-05-05 WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval Zahra Tabatabaei et.al. 2305.03383 null
2023-05-04 Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval Tan Pan et.al. 2305.02610 link
2023-05-03 Learning-based Relational Object Matching Across Views Cathrin Elich et.al. 2305.02398 null
2023-05-05 A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text Yunxin Li et.al. 2305.02265 link
2023-05-03 AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation Shentong Mo et.al. 2305.01836 null
2023-04-30 Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection Jie Ren et.al. 2305.00435 null
2023-04-28 SFD2: Semantic-guided Feature Detection and Description Fei Xue et.al. 2304.14845 link
2023-04-28 Quantum enhanced non-interferometric quantitative phase imaging Giuseppe Ortolano et.al. 2304.14727 null
2023-04-26 Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams Yun Chang et.al. 2304.13487 null
2023-04-27 STIR: Siamese Transformer for Image Retrieval Postprocessing Aleksei Shabanov et.al. 2304.13393 null
2023-04-25 DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design Jiahao Weng et.al. 2304.12506 null
2023-04-24 Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning Lucas Pascotti Valem et.al. 2304.12448 link
2023-04-23 IDLL: Inverse Depth Line based Visual Localization in Challenging Environments Wanting Li et.al. 2304.11748 null
2023-04-23 Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval Mehdi Rafiei et.al. 2304.11734 null
2023-04-17 Features-over-the-Air: Contrastive Learning Enabled Cooperative Edge Inference Haotian Wu et.al. 2304.08221 null
2023-04-17 NeRF-Loc: Visual Localization with Conditional Neural Radiance Field Jianlin Liu et.al. 2304.07979 link
2023-04-16 Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification Luca Piano et.al. 2304.07883 null
2023-04-16 Language Guided Local Infiltration for Interactive Image Retrieval Fuxiang Huang et.al. 2304.07747 null
2023-04-16 Long-term Visual Localization with Mobile Sensors Shen Yan et.al. 2304.07691 null
2023-04-16 Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging Jielin Qiu et.al. 2304.07675 null
2023-04-14 CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression Mubariz Zaffar et.al. 2304.07426 null
2023-04-14 FM-Loc: Using Foundation Models for Improved Vision-based Localization Reihaneh Mirjalili et.al. 2304.07058 null
2023-04-17 Toward Real-Time Image Annotation Using Marginalized Coupled Dictionary Learning Seyed Mahdi Roostaiyan et.al. 2304.06907 link
2023-04-17 You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization problem and dataset Matteo Toso et.al. 2304.06373 link
2023-04-12 Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation Yifeng Shi et.al. 2304.06051 link
2023-04-12 Visual Localization using Imperfect 3D Models from the Internet Vojtech Panek et.al. 2304.05947 link
2023-04-12 Are Local Features All You Need for Cross-Domain Visual Place Recognition? Giovanni Barbarani et.al. 2304.05887 link
2023-04-12 Unicom: Universal and Compact Representation Learning for Image Retrieval Xiang An et.al. 2304.05884 link
2023-04-12 SGL: Structure Guidance Learning for Camera Localization Xudong Zhang et.al. 2304.05571 null
2023-04-14 Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency Xingwu Ji et.al. 2304.05146 link
2023-04-10 CAVL: Learning Contrastive and Adaptive Representations of Vision and Language Shentong Mo et.al. 2304.04399 null
2023-04-09 Unsupervised Multi-Criteria Adversarial Detection in Deep Image Retrieval Yanru Xiao et.al. 2304.04228 null
2023-04-08 SGIDN-LCD: An Appearance-based Loop Closure Detection Algorithm using Superpixel Grids and Incremental Dynamic Nodes Baosheng Zhang et.al. 2304.03872 null
2023-04-06 $R^{2}$Former: Unified $R$etrieval and $R$ eranking Transformer for Place Recognition Sijie Zhu et.al. 2304.03410 null
2023-04-06 Distributed formation-enforcing control for UAVs robust to observation noise in relative pose measurements Viktor Walter et.al. 2304.03057 link
2023-04-05 Efficient OCR for Building a Diverse Digital History Jacob Carlson et.al. 2304.02737 link
2023-04-05 LogoNet: a fine-grained network for instance-level logo sketch retrieval Binbin Feng et.al. 2304.02214 link
2023-04-04 OrienterNet: Visual Localization in 2D Public Maps with Neural Matching Paul-Edouard Sarlin et.al. 2304.02009 link
2023-04-04 Cross-Domain Image Captioning with Discriminative Finetuning Roberto Dessì et.al. 2304.01662 link
2023-04-02 Learning Similarity between Scene Graphs and Images with Transformers Yuren Cong et.al. 2304.00590 link
2023-04-01 NPR: Nocturnal Place Recognition in Street Bingxi Liu et.al. 2304.00276 null
2023-03-31 Unsupervised crack detection on complex stone masonry surfaces Panagiotis Agrafiotis et.al. 2303.17989 null
2023-03-30 If At First You Don't Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval Finlay G. C. Hudson et.al. 2303.17703 null
2023-03-30 Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime Rhydian Windsor et.al. 2303.17644 null
2023-03-30 3D Line Mapping Revisited Shaohui Liu et.al. 2303.17504 link
2023-03-30 Methods and advancement of content-based fashion image retrieval: A Review Amin Muhammad Shoib et.al. 2303.17371 null
2023-03-30 Adaptive Cross Batch Normalization for Metric Learning Thalaiyasingam Ajanthan et.al. 2303.17127 null
2023-03-30 MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks Weicheng Kuo et.al. 2303.16839 null
2023-03-29 Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval Leo Sampaio Ferraz Ribeiro et.al. 2303.16769 null
2023-03-29 Bi-directional Training for Composed Image Retrieval via Text Prompt Learning Zheyuan Liu et.al. 2303.16604 link
2023-03-27 Model Cascades for Efficient Image Search Robert Hönig et.al. 2303.15595 null
2023-03-27 Zero-Shot Composed Image Retrieval with Textual Inversion Alberto Baldrati et.al. 2303.15247 link
2023-03-27 What Can Human Sketches Do for Object Detection? Pinaki Nath Chowdhury et.al. 2303.15149 null
2023-03-25 Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style Fengyin Lin et.al. 2303.14348 link
2023-03-24 A-MuSIC: An Adaptive Ensemble System For Visual Place Recognition In Changing Environments Bruno Arcanjo et.al. 2303.14247 null
2023-03-24 PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View Ze Shi et.al. 2303.14095 link
2023-03-24 Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR Aneeshan Sain et.al. 2303.13779 null
2023-03-28 CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not Aneeshan Sain et.al. 2303.13440 null
2023-03-22 Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval Xunguang Wang et.al. 2303.12658 null
2023-03-21 CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion Geonmo Gu et.al. 2303.11916 link
2023-03-21 LIMITR: Leveraging Local Information for Medical Image-Text Representation Gefen Dawidowicz et.al. 2303.11755 null
2023-03-25 Data-efficient Large Scale Place Recognition with Graded Similarity Supervision Maria Leyva-Vallina et.al. 2303.11739 link
2023-03-20 Picture that Sketch: Photorealistic Image Generation from Abstract Sketches Subhadeep Koley et.al. 2303.11162 null
2023-03-19 Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths Ming Xu et.al. 2303.10778 link
2023-03-17 MRIS: A Multi-modal Retrieval Approach for Image Synthesis on Diverse Modalities Boqi Chen et.al. 2303.10249 null
2023-03-17 IRGen: Generative Modeling for Image Retrieval Yidan Zhang et.al. 2303.10126 link
2023-03-16 Data Roaming and Early Fusion for Composed Image Retrieval Matan Levy et.al. 2303.09429 link
2023-03-16 Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval Yi Xie et.al. 2303.09230 null
2023-03-16 Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space Yuhang He et.al. 2303.09192 null
2023-03-16 Unsupervised Facial Expression Representation Learning with Contrastive Local Warping Fanglei Xue et.al. 2303.09034 null
2023-03-15 A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval Saeideh Yousefzadeh et.al. 2303.08398 null
2023-03-14 Data-Free Sketch-Based Image Retrieval Abhra Chaudhuri et.al. 2303.07775 link
2023-03-14 PATS: Patch Area Transportation with Subdivision for Local Feature Matching Junjie Ni et.al. 2303.07700 null
2023-03-10 Robotic Applications of Pre-Trained Vision-Language Models to Various Recognition Behaviors Kento Kawaharazuka et.al. 2303.05674 null
2023-03-09 Dominating Set Database Selection for Visual Place Recognition Anastasiia Kornilova et.al. 2303.05123 null
2023-03-07 Graph Neural Networks in Vision-Language Image Understanding: A Survey Henry Senior et.al. 2303.03761 null
2023-03-07 Sketch-based Medical Image Retrieval Kazuma Kobayashi et.al. 2303.03633 link
2023-03-06 Visual Place Recognition: A Tutorial Stefan Schubert et.al. 2303.03281 link
2023-03-06 MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval Rohit Agarwal et.al. 2303.03050 link
2023-03-06 Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints Chenjie Cao et.al. 2303.02885 link
2023-03-05 Composing Mood Board with User Feedback in Concept Space Shin Sano et.al. 2303.02547 null
2023-03-04 FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks Xiao Han et.al. 2303.02483 link
2023-03-09 Self-Supervised Learning for Place Representation Generalization across Appearance Changes Mohamed Adel Musallam et.al. 2303.02370 null
2023-03-03 MixVPR: Feature Mixing for Visual Place Recognition Amar Ali-bey et.al. 2303.02190 link
2023-03-01 A Complementarity-Based Switch-Fuse System for Improved Visual Place Recognition Maria Waheed et.al. 2303.00714 null
2023-03-01 ORCHNet: A Robust Global Feature Aggregation approach for 3D LiDAR-based Place recognition in Orchards T. Barros et.al. 2303.00477 link
2023-03-03 Renderable Neural Radiance Map for Visual Navigation Obin Kwon et.al. 2303.00304 null
2023-03-01 Region Prediction for Efficient Robot Localization on Large Maps Matteo Scucchia et.al. 2303.00295 link
2023-02-28 OEKG: The Open Event Knowledge Graph Simon Gottschalk et.al. 2302.14688 null
2023-02-28 Global Proxy-based Hard Mining for Visual Place Recognition Amar Ali-bey et.al. 2302.14217 link
2023-02-27 Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation Yue Xiang et.al. 2302.13929 link
2023-02-26 Data-Efficient Sequence-Based Visual Place Recognition with Highly Compressed JPEG Images Mihnea-Alexandru Tomita et.al. 2302.13314 null
2023-02-26 Learning cross space mapping via DNN using large scale click-through logs Wei Yu et.al. 2302.13275 null
2023-02-25 DeepBrainPrint: A Novel Contrastive Framework for Brain MRI Re-Identification Lemuel Puglisi et.al. 2302.13057 null
2023-02-23 Teaching CLIP to Count to Ten Roni Paiss et.al. 2302.12066 null
2023-02-22 Steerable Equivariant Representation Learning Sangnie Bhardwaj et.al. 2302.11349 null
2023-02-21 iQPP: A Benchmark for Image Query Performance Prediction Eduard Poesina et.al. 2302.10126 link
2023-02-20 Ontology-aware Network for Zero-shot Sketch-based Image Retrieval Haoxiang Zhang et.al. 2302.10040 null
2023-02-20 TBPos: Dataset for Large-Scale Precision Visual Localization Masud Fahim et.al. 2302.09825 link
2023-02-17 Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts Zhihong Chen et.al. 2302.08958 link
2023-02-22 Fashion Image Retrieval with Multi-Granular Alignment Jinkuan Zhu et.al. 2302.08902 null
2023-02-15 Unsupervised Hashing via Similarity Distribution Calibration Kam Woh Ng et.al. 2302.07669 link
2023-02-13 Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior Shen Yan et.al. 2302.06287 link
2023-02-13 Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation Binqian Jiang et.al. 2302.06149 link
2023-02-13 Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval Xu Wang et.al. 2302.06081 link
2023-02-11 Sketch Less Face Image Retrieval: A New Challenge Dawei Dai et.al. 2302.05576 link
2023-02-10 Is multi-modal vision supervision beneficial to language? Avinash Madasu et.al. 2302.05016 link
2023-02-06 Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval Kuniaki Saito et.al. 2302.03084 link
2023-02-06 Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs Michael Kirchhof et.al. 2302.02865 link
2023-02-03 Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization Yingying Zhu et.al. 2302.01572 link
2023-02-04 Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval Frederik Warburg et.al. 2302.01332 link
2023-01-31 Grounding Language Models to Images for Multimodal Generation Jing Yu Koh et.al. 2301.13823 link
2023-01-31 UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Dachuan Shi et.al. 2301.13741 link
2023-01-23 Lexi: Self-Supervised Learning of the UI Language Pratyay Banerjee et.al. 2301.10165 link
2023-01-17 Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval Yuchen Wu et.al. 2301.06685 null
2023-01-19 High-bandwidth Close-Range Information Transport through Light Pipes Joowon Lim et.al. 2301.06496 null
2023-01-13 A LiDAR-Inertial-Visual SLAM System with Loop Detection Kangcheng Liu et.al. 2301.05604 null
2023-01-12 GH-Feat: Learning Versatile Generative Hierarchical Features from GANs Yinghao Xu et.al. 2301.05315 null
2023-01-10 Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images Xindi Wu et.al. 2301.04224 null
2023-01-10 Collaborative Semantic Communication at the Edge Wing Fei Lo et.al. 2301.03996 null
2023-01-10 Online Backfilling with No Regret for Large-Scale Image Retrieval Seonguk Seo et.al. 2301.03767 null
2023-01-06 CyberLoc: Towards Accurate Long-term Visual Localization Liu Liu et.al. 2301.02403 null
2023-01-05 A Probabilistic Framework for Visual Localization in Ambiguous Scenes Fereidoon Zangeneh et.al. 2301.02086 link
2022-12-31 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions Patrick Wenzel et.al. 2301.01147 null
2022-12-30 HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images Dmitry Yudin et.al. 2212.14649 link
2022-12-27 Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning Wooyoung Kang et.al. 2212.13563 link
2022-12-23 SuperGF: Unifying Local and Global Features for Visual Localization Wenzheng Song et.al. 2212.13105 null
2022-12-24 GraffMatch: Global Matching of 3D Lines and Planes for Wide Baseline LiDAR Registration Parker C. Lusk et.al. 2212.12745 null
2022-12-19 From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration Zekun Qian et.al. 2212.09298 link
2022-12-14 The Infinite Index: Information Retrieval on Generative Text-To-Image Models Niklas Deckers et.al. 2212.07476 null
2022-12-14 Shared Coupling-bridge for Weakly Supervised Local Feature Learning Jiayuan Sun et.al. 2212.07047 link
2022-12-08 Group Generalized Mean Pooling for Vision Transformer Byungsoo Ko et.al. 2212.04114 null
2022-12-12 Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models Gowthami Somepalli et.al. 2212.03860 null
2022-12-07 LSVL: Large-scale season-invariant visual localization for UAVs Jouko Kinnari et.al. 2212.03581 null
2022-12-06 ADIR: Adaptive Diffusion for Image Reconstruction Shady Abu-Hussein et.al. 2212.03221 null
2022-12-08 Privacy-Preserving Visual Localization with Event Cameras Junho Kim et.al. 2212.03177 link
2022-12-06 Semantic Communication for Internet of Vehicles: A Multi-User Cooperative Approach Wenjun Xu et.al. 2212.03037 null
2022-12-06 Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds Zhipeng Zhao et.al. 2212.02757 null
2022-12-04 Fast and Lightweight Scene Regressor for Camera Relocalization Thuan B. Bui et.al. 2212.01830 link
2022-12-02 Information Retrieval from the Digitized Books Riya Gupta et.al. 2212.00999 null
2022-12-09 StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition Yanqing Shen et.al. 2212.00937 null
2022-11-30 Self-Supervised Feature Learning for Long-Term Metric Visual Localization Yuxuan Chen et.al. 2212.00122 null
2022-11-30 SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation Tianyu Zhang et.al. 2211.16697 link
2022-11-28 SLAN: Self-Locator Aided Network for Cross-Modal Understanding Jiang-Tian Zhai et.al. 2211.16208 null
2022-11-29 RankDNN: Learning to Rank for Few-shot Learning Qianyu Guo et.al. 2211.15320 link
2022-11-28 Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map Xi Zheng et.al. 2211.15127 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069 link
2022-11-27 BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images Zhihuang Zhang et.al. 2211.14927 null
2022-11-27 A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition Rui Huang et.al. 2211.14864 null
2022-11-26 Visual Place Recognition Bailu Guo et.al. 2211.14533 null
2022-11-26 Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval Fan Yang et.al. 2211.14515 link
2022-11-30 Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark Floriana Ciaglia et.al. 2211.13523 link
2022-11-23 InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images Konstantin Kobs et.al. 2211.12760 link
2022-11-29 Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments Joshua Knights et.al. 2211.12732 link
2022-11-23 FE-Fusion-VPR: Attention-based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events Kuanxu Hou et.al. 2211.12244 null
2022-11-22 Multimorbidity Content-Based Medical Image Retrieval Using Proxies Yunyan Xing et.al. 2211.12185 null
2022-11-22 Vision-based localization methods under GPS-denied conditions Zihao Lu et.al. 2211.11988 null
2022-11-21 ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields Mohammad Mahdi Johari et.al. 2211.11704 null
2022-11-21 LISA: Localized Image Stylization with Audio via Implicit Neural Representation Seung Hyun Lee et.al. 2211.11381 null
2022-11-21 NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization Shitao Tang et.al. 2211.11177 link
2022-11-16 Improving Feature-based Visual Localization by Geometry-Aided Matching Hailin Yu et.al. 2211.08712 link
2022-11-15 LiePoseNet: Heterogeneous Loss Function Based on Lie Group for Significant Speed-up of PoseNet Training Process Mikhail Kurenkov et.al. 2211.08480 null
2022-11-14 Degeneracy removal of spin bands in antiferromagnets with non-interconvertible spin motif pair Lin-Ding Yuan et.al. 2211.07803 null
2022-11-14 Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition Farid Alijani et.al. 2211.07696 null
2022-11-14 Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization Yiyang Chen et.al. 2211.07394 link
2022-11-14 Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment Junyang Wang et.al. 2211.07275 null
2022-11-14 ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations Chanda Grover et.al. 2211.07122 null
2022-11-14 Few-shot Metric Learning: Online Adaptation of Embedding for Retrieval Deunsol Jung et.al. 2211.07116 null
2022-11-12 Partial Visual-Semantic Embedding: Fashion Intelligence System with Sensitive Part-by-Part Learning Ryotaro Shimizu et.al. 2211.06688 null
2022-11-09 Visual Named Entity Linking: A New Dataset and A Baseline Wenxiang Sun et.al. 2211.04872 link
2022-11-07 Ultrafast Image Retrieval from a Holographic Memory Disc for High-Speed Operation of a Shift, Scale, and Rotation Invariant Target Recognition System Julian Gamboa et.al. 2211.03881 null
2022-11-06 A Geometrically Constrained Point Matching based on View-invariant Cross-ratios, and Homography Yueh-Cheng Huang et.al. 2211.03007 null
2022-11-02 Optimizing Fiducial Marker Placement for Improved Visual Localization Qiangqiang Huang et.al. 2211.01513 link
2022-11-02 A comparison of uncertainty estimation approaches for DNN-based camera localization Matteo Vaghi et.al. 2211.01234 null
2022-11-02 M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval Layne Berry et.al. 2211.01180 null
2022-11-11 Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality Anuj Diwan et.al. 2211.00768 link
2022-11-07 Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding Ryotaro Shimizu et.al. 2210.17417 null
2022-10-27 Structuring User-Generated Content on Social Media with Multimodal Aspect-Based Sentiment Analysis Miriam Anschütz et.al. 2210.15377 link
2022-10-27 Leveraging Computer Vision Application in Visual Arts: A Case Study on the Use of Residual Neural Network to Classify and Analyze Baroque Paintings Daniel Kvak et.al. 2210.15300 null
2022-10-27 Towards Practicality of Sketch-Based Visual Understanding Ayan Kumar Bhunia et.al. 2210.15146 null
2022-10-27 MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval Chen Bao et.al. 2210.15128 null
2022-10-26 FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning Suvir Mirchandani et.al. 2210.15028 null
2022-10-26 FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization Junyang Wang et.al. 2210.14562 null
2022-11-02 A Framework for Collaborative Multi-Robot Mapping using Spectral Graph Wavelets Lukas Bernreiter et.al. 2210.13856 null
2022-10-27 Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision Tzu-Jui Julius Wang et.al. 2210.13591 null
2022-10-24 Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval Zhaopeng Dou et.al. 2210.13440 link
2022-10-23 Neural Eigenfunctions Are Structured Representation Learners Zhijie Deng et.al. 2210.12637 link
2022-10-21 Boosting vision transformers for image retrieval Chull Hwan Song et.al. 2210.11909 link
2022-10-20 Communication breakdown: On the low mutual intelligibility between human and neural captioning Roberto Dessì et.al. 2210.11512 link
2022-10-19 Image Semantic Relation Generation Mingzhe Du et.al. 2210.11253 null
2022-10-20 General Image Descriptors for Open World Image Retrieval using ViT CLIP Marcos V. Conde et.al. 2210.11141 link
2022-10-20 DeepRING: Learning Roto-translation Invariant Representation for LiDAR based Place Recognition Sha Lu et.al. 2210.11029 null
2022-10-19 Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval Abhra Chaudhuri et.al. 2210.10486 link
2022-10-19 GSV-Cities: Toward Appropriate Supervised Visual Place Recognition Amar Ali-bey et.al. 2210.10239 link
2022-10-18 A Real-Time Fusion Framework for Long-term Visual Localization Yuchen Yang et.al. 2210.09757 null
2022-10-17 Bridging the Gap between Local Semantic Concepts and Bag of Visual Words for Natural Scene Image Retrieval Yousef Alqasrawi et.al. 2210.08875 null
2022-10-17 SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation Woo Suk Choi et.al. 2210.08675 null
2022-10-16 Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers Tao Tang et.al. 2210.08458 link
2022-10-14 Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding Xuetong Xue et.al. 2210.07572 link
2022-10-14 Boosting Performance of a Baseline Visual Place Recognition Technique by Predicting the Maximally Complementary Technique Connor Malone et.al. 2210.07509 null
2022-10-11 Large-to-small Image Resolution Asymmetry in Deep Metric Learning Pavel Suma et.al. 2210.05463 link
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236 null
2022-10-05 Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features Deepak Gupta et.al. 2210.02401 link
2022-10-05 Granularity-aware Adaptation for Image Retrieval over Multiple Tasks Jon Almazán et.al. 2210.02254 null
2022-10-05 Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective Zijian Zhang et.al. 2210.02206 link
2022-10-04 Supervised Metric Learning for Retrieval via Contextual Similarity Optimization Christopher Liao et.al. 2210.01908 link
2022-10-04 Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing Weiying Wang et.al. 2210.01320 null
2022-10-03 Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments Bruno Arcanjo et.al. 2210.00834 null
2022-10-02 Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval Kei Nishimaki et.al. 2210.00506 null
2022-09-29 Guided Unsupervised Learning by Subaperture Decomposition for Ocean SAR Image Retrieval Nicolae-Cătălin Ristea et.al. 2209.15034 null
2022-09-28 TVLT: Textless Vision-Language Transformer Zineng Tang et.al. 2209.14156 link
2022-09-28 SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval Yang Shen et.al. 2209.13833 link
2022-09-28 Learning Deep Representations via Contrastive Learning for Instance Retrieval Tao Wu et.al. 2209.13832 null
2022-09-28 Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text Cheng-An Hsieh et.al. 2209.13764 link
2022-09-27 Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors Hao Dong et.al. 2209.13586 link
2022-09-27 Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability Peisong Wen et.al. 2209.13262 link
2022-09-26 NDD: A 3D Point Cloud Descriptor Based on Normal Distribution for Loop Closure Detection Ruihao Zhou et.al. 2209.12513 link
2022-09-25 Personalized Saliency in Task-Oriented Semantic Communications: Image Transmission and Performance Analysis Jiawen Kang et.al. 2209.12274 link
2022-09-24 Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes Jonathan J. Y. Kim et.al. 2209.11894 null
2022-09-23 Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs Youya Xia et.al. 2209.11673 null
2022-09-23 Query-based Hard-Image Retrieval for Object Detection at Test Time Edward Ayers et.al. 2209.11559 link
2022-09-23 Unsupervised Hashing with Semantic Concept Mining Rong-Cheng Tu et.al. 2209.11475 link
2022-09-22 UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low vision Anbang Yang et.al. 2209.11336 null
2022-09-21 Visual Localization and Mapping in Dynamic and Changing Environments João Carlos Virgolino Soares et.al. 2209.10710 null
2022-09-20 PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention José Arce et.al. 2209.09699 link
2022-09-19 Deep Metric Learning with Chance Constraints Yeti Z. Gurbuz et.al. 2209.09060 link
2022-09-18 HGI-SLAM: Loop Closure With Human and Geometric Importance Features Shuhul Mujoo et.al. 2209.08608 null
2022-09-18 Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM Jiarui Tan et.al. 2209.08578 link
2022-09-17 Data Efficient Visual Place Recognition Using Extremely JPEG-Compressed Images Mihnea-Alexandru Tomita et.al. 2209.08343 null
2022-09-15 Efficient Planar Pose Estimation via UWB Measurements Haodong Jiang et.al. 2209.06779 link
2022-09-14 Transformers and CNNs both Beat Humans on SBIR Omar Seddati et.al. 2209.06629 null
2022-09-14 Tac2Structure: Object Surface Reconstruction Only through Multi Times Touch J. Lu et.al. 2209.06545 link
2022-09-14 iSimLoc: Visual Global Localization for Previously Unseen Environments with Simulated Images Peng Yin et.al. 2209.06376 null
2022-09-09 General Place Recognition Survey: Towards the Real-world Autonomy Age Peng Yin et.al. 2209.04497 link
2022-09-09 Retinal Image Restoration and Vessel Segmentation using Modified Cycle-CBAM and CBAM-UNet Alnur Alimanov et.al. 2209.04234 link
2022-09-13 Segment Augmentation and Differentiable Ranking for Logo Retrieval Feyza Yavuz et.al. 2209.02482 null
2022-09-12 ScaleFace: Uncertainty-aware Deep Metric Learning Roman Kail et.al. 2209.01880 link
2022-09-04 CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud Evgeny Yudin et.al. 2209.01605 null
2022-08-31 EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing Qihua Feng et.al. 2208.14657 link
2022-08-25 A Deep Perceptual Measure for Lens and Camera Calibration Yannick Hold-Geoffroy et.al. 2208.12300 null
2022-08-25 A Privacy-Preserving and End-to-End-Based Encrypted Image Retrieval Scheme Zhixun Lu et.al. 2208.11876 null
2022-08-23 Satellite Image Search in AgoraEO Ahmet Kerem Aksoy et.al. 2208.10830 null
2022-08-20 Fuse and Attend: Generalized Embedding Learning for Art and Sketches Ujjal Kr Dutta et.al. 2208.09698 null
2022-08-19 Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods Chao Chen et.al. 2208.09315 link
2022-08-19 TTT-UCDR: Test-time Training for Universal Cross-Domain Retrieval Soumava Paul et.al. 2208.09198 link
2022-08-17 Visual Cross-View Metric Localization with Dense Uncertainty Estimates Zimin Xia et.al. 2208.08519 link
2022-08-17 Understanding Attention for Vision-and-Language Tasks Feiqi Cao et.al. 2208.08104 link
2022-08-14 Visual Localization via Few-Shot Scene Region Classification Siyan Dong et.al. 2208.06933 link
2022-08-14 HyP $^2$ Loss: Beyond Hypersphere Metric Space for Multi-label Image Retrieval Chengyin Xu et.al. 2208.06866 link
2022-08-13 Finding Point with Image: An End-to-End Benchmark for Vision-based UAV Localization Ming Dai et.al. 2208.06561 link
2022-08-16 Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation Georgios Kouros et.al. 2208.06195 link
2022-08-12 Instance Image Retrieval by Learning Purely From Within the Dataset Zhongyan Zhang et.al. 2208.06119 null
2022-08-07 CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization Yujiao Shi et.al. 2208.03660 null
2022-08-05 A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch Patsorn Sangkloy et.al. 2208.03354 null
2022-08-05 ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding Bingning Wang et.al. 2208.03030 link
2022-08-04 Pattern Spotting and Image Retrieval in Historical Documents using Deep Hashing Caio da S. Dias et.al. 2208.02397 null
2022-07-27 On the robustness of self-supervised representations for multi-view object classification David Torpey et.al. 2208.00787 null
2022-07-26 Multimodal Neural Machine Translation with Search Engine Based Image Retrieval ZhenHao Tang et.al. 2208.00767 null
2022-07-30 Towards Privacy-Preserving, Real-Time and Lossless Feature Matching Qiang Meng et.al. 2208.00214 link
2022-07-30 DAS: Densely-Anchored Sampling for Deep Metric Learning Lizhao Liu et.al. 2208.00119 link
2022-07-29 Curriculum Learning for Data-Efficient Vision-Language Alignment Tejas Srinivasan et.al. 2207.14525 null
2022-07-29 Neural Density-Distance Fields Itsuki Ueda et.al. 2207.14455 link
2022-07-27 Abstracting Sketches through Simple Primitives Stephan Alaniz et.al. 2207.13543 link
2022-07-27 Satellite Image Based Cross-view Localization for Autonomous Vehicle Shan Wang et.al. 2207.13506 null
2022-07-26 RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments Jiahui Zhang et.al. 2207.12579 null
2022-07-25 A hybrid-qudit representation of digital RGB images Sreetama Das et.al. 2207.12550 null
2022-07-19 ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization Ivan Cisneros et.al. 2207.12317 link
2022-07-22 PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes BaoSheng Zhang et.al. 2207.10916 null
2022-07-25 MeshLoc: Mesh-Based Visual Localization Vojtech Panek et.al. 2207.10762 link
2022-07-20 Revisiting Hotels-50K and Hotel-ID Aarash Feizi et.al. 2207.10200 link
2022-07-20 Feature Representation Learning for Unsupervised Cross-domain Image Retrieval Conghui Hu et.al. 2207.09721 link
2022-07-19 SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany Dominik Koßmann et.al. 2207.09507 null
2022-07-19 Context Unaware Knowledge Distillation for Image Retrieval Bytasandram Yaswanth Reddy et.al. 2207.09070 link
2022-07-17 FashionViL: Fashion-Focused Vision-and-Language Representation Learning Xiao Han et.al. 2207.08150 link
2022-07-14 AutoMerge: A Framework for Map Assembling and Smoothing in City-scale Environments Peng Yin et.al. 2207.06965 null
2022-07-14 Semi-supervised Vector-Quantization in Visual SLAM using HGCN Amir Zarringhalam et.al. 2207.06738 null
2022-07-14 Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders Amir Zarringhalam et.al. 2207.06732 null
2022-07-19 Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras Fangwen Shu et.al. 2207.06058 link
2022-07-12 CPO: Change Robust Panorama to Point Cloud Localization Junho Kim et.al. 2207.05317 link
2022-07-05 Hierarchical Average Precision Training for Pertinent Image Retrieval Elias Ramzi et.al. 2207.04873 link
2022-07-11 A clinically motivated self-supervised approach for content-based image retrieval of CT liver images Kristoffer Knutsen Wickstrøm et.al. 2207.04812 link
2022-07-09 BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval Wenqiao Zhang et.al. 2207.04211 null
2022-07-08 Learning Sequential Descriptors for Sequence-based Visual Place Recognition Riccardo Mereu et.al. 2207.03868 link
2022-07-08 GEMS: Scene Expansion using Generative Models of Graphs Rishi Agarwal et.al. 2207.03729 null
2022-07-05 Object-Level Targeted Selection via Deep Template Matching Suraj Kothawade et.al. 2207.01778 null
2022-07-06 Adaptive Fine-Grained Sketch-Based Image Retrieval Ayan Kumar Bhunia et.al. 2207.01723 link
2022-07-04 Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets Paul Albert et.al. 2207.01573 link
2022-07-08 Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval Keyu Wen et.al. 2207.00733 null
2022-07-01 DALG: Deep Attentive Local and Global Modeling for Image Retrieval Yuxin Song et.al. 2207.00287 null
2022-07-04 BadHash: Invisible Backdoor Attacks against Deep Hashing with Clean Label Shengshan Hu et.al. 2207.00278 link
2022-06-28 Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems Stephen Hausler et.al. 2206.13883 null
2022-07-08 How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels Tobias Fischer et.al. 2206.13673 link
2022-06-25 FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance Yongzhi Fan et.al. 2206.12628 link
2022-06-25 Inverted Semantic-Index for Image Retrieval Ying Wang et.al. 2206.12623 null
2022-06-17 RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval Yihan Wu et.al. 2206.11225 null
2022-06-22 ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas Prathmesh Madhu et.al. 2206.11115 null
2022-06-20 Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval Guile Wu et.al. 2206.09806 null
2022-06-18 Attention-based Dynamic Subspace Learners for Medical Image Analysis Sukesh Adiga V et.al. 2206.09068 null
2022-06-17 Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments Khairuldanial Ismail et.al. 2206.08733 null
2022-06-06 Learning Treatment Plan Representations for Content Based Image Retrieval Charles Huang et.al. 2206.02912 null
2022-06-19 NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation Ekaterina Nepovinnykh et.al. 2206.02498 link
2022-06-05 Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks B. G. Palm et.al. 2206.02278 null
2022-05-28 FaIRCoP: Facial Image Retrieval using Contrastive Personalization Devansh Gupta et.al. 2205.15870 null
2022-05-31 Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark Martin Humenberger et.al. 2205.15761 link
2022-05-27 Improving Road Segmentation in Challenging Domains Using Similar Place Priors Connor Malone et.al. 2205.14112 null
2022-05-31 LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments Yun Chang et.al. 2205.13135 link
2022-05-26 Fine-grained Image Captioning with CLIP Reward Jaemin Cho et.al. 2205.13115 link
2022-05-25 Deep Dense Local Feature Matching and Vehicle Removal for Indoor Visual Localization Kyung Ho Park et.al. 2205.12544 null
2022-05-24 OnePose: One-Shot Object Pose Estimation without CAD Models Jiaming Sun et.al. 2205.12257 link
2022-05-23 VPAIR -- Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments Michael Schleiss et.al. 2205.11567 link
2022-05-23 VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering Yanan Wang et.al. 2205.11501 null
2022-05-23 Deep Image Retrieval is not Robust to Label Noise Stanislav Dereka et.al. 2205.11195 null
2022-05-22 Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval Zelong Zeng et.al. 2205.10878 link
2022-05-20 Visually-Augmented Language Modeling Weizhi Wang et.al. 2205.10178 link
2022-05-18 Deep Features for CBIR with Scarce Data using Hebbian Learning Gabriele Lagani et.al. 2205.08935 null
2022-05-19 Text Detection & Recognition in the Wild for Robot Localization Zobeir Raisi et.al. 2205.08565 null
2022-05-12 One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code Yong Dai et.al. 2205.06126 null
2022-05-11 Review on Panoramic Imaging and Its Applications in Scene Understanding Shaohua Gao et.al. 2205.05570 null
2022-05-18 Identical Image Retrieval using Deep Learning Sayan Nath et.al. 2205.04883 link
2022-05-09 Introspective Deep Metric Learning Chengkun Wang et.al. 2205.04449 link
2022-05-11 Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting Kai Uwe Barthel et.al. 2205.04255 link
2022-05-08 Adversarial Learning of Hard Positives for Place Recognition Wenxuan Fang et.al. 2205.03871 null
2022-05-10 AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching Khanh Nguyen et.al. 2205.02849 link
2022-04-29 Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval Shupeng Su et.al. 2204.13919 null
2022-04-29 Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval Siyu Ren et.al. 2204.13913 link
2022-04-28 Spatio-Temporal Graph Localization Networks for Image-based Navigation Takahiro Niwa et.al. 2204.13237 null
2022-04-27 The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection Konstantinos A. Tsintotas et.al. 2204.12831 null
2022-04-25 SceneTrilogy: On Scene Sketches and its Relationship with Text and Photo Pinaki Nath Chowdhury et.al. 2204.11964 null
2022-04-23 On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning Muhammad Umer Anwaar et.al. 2204.11848 null
2022-04-24 Progressive Learning for Image Retrieval with Hybrid-Modality Queries Yida Zhao et.al. 2204.11212 null
2022-04-23 Training and challenging models for text-guided fashion image retrieval Eric Dodds et.al. 2204.11004 link
2022-04-18 Centralized Adversarial Learning for Robust Deep Hashing Xunguang Wang et.al. 2204.10779 link
2022-04-22 Transferring ConvNet Features from Passive to Active Robot Self-Localization: The Use of Ego-Centric and World-Centric Views Kanya Kurauchi et.al. 2204.10497 null
2022-04-21 Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval Zhiqiang Yuan et.al. 2204.09868 link
2022-04-21 Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information Zhiqiang Yuan et.al. 2204.09860 link
2022-04-20 Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations Leila Pishdad et.al. 2204.09268 null
2022-04-19 Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing Georgii Mikriukov et.al. 2204.08707 null
2022-04-18 Multiple-environment Self-adaptive Network for Aerial-view Geo-localization Tingyu Wang et.al. 2204.08381 link
2022-04-15 Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder Hanjing Ye et.al. 2204.07350 link
2022-04-14 Composite Code Sparse Autoencoders for first stage retrieval Carlos Lassance et.al. 2204.07023 null
2022-04-13 Reuse your features: unifying retrieval and feature-metric alignment Javier Morlana et.al. 2204.06292 link
2022-04-12 Probabilistic Compositional Embeddings for Multimodal Image Retrieval Andrei Neculai et.al. 2204.05845 link
2022-04-12 Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval Yu-Wei Zhan et.al. 2204.05666 null
2022-04-12 HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud Zhixing Hou et.al. 2204.05481 null
2022-04-11 Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context Lizhou Liao et.al. 2204.04932 link
2022-04-10 Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image Yujiao Shi et.al. 2204.04752 link
2022-04-08 A Generic Image Retrieval Method for Date Estimation of Historical Document Collections Adrià Molina et.al. 2204.04028 null
2022-04-08 SnapMode: An Intelligent and Distributed Large-Scale Fashion Image Retrieval Platform Based On Big Data and Deep Generative Adversarial Network Technologies Narges Norouzi et.al. 2204.03998 null
2022-04-05 Leveraging Equivariant Features for Absolute Pose Regression Mohamed Adel Musallam et.al. 2204.02163 null
2022-04-04 "This is my unicorn, Fluffy": Personalizing frozen vision-language representations Niv Cohen et.al. 2204.01694 link
2022-04-01 Bi-directional Loop Closure for Visual SLAM Ihtisham Ali et.al. 2204.01524 null
2022-04-01 LASER: LAtent SpacE Rendering for 2D Visual Localization Zhixiang Min et.al. 2204.00157 link
2022-03-31 Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning Semih Orhan et.al. 2203.16945 null
2022-03-30 AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift Burak Yildiz et.al. 2203.16291 link
2022-03-29 Long-term Visual Map Sparsification with Heterogeneous GNN Ming-Fang Chang et.al. 2203.15182 null
2022-04-01 A Simulation Benchmark for Vision-based Autonomous Navigation Lauri Suomela et.al. 2203.13048 link
2022-03-24 Is Geometry Enough for Matching in Visual Localization? Qunjie Zhou et.al. 2203.12979 link
2022-03-21 MatchFormer: Interleaving Attention in Transformers for Feature Matching Qing Wang et.al. 2203.09645 link
2022-03-10 ReF -- Rotation Equivariant Features for Local Feature Matching Abhishek Peri et.al. 2203.05206 null
2022-03-09 Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction Matthieu Zins et.al. 2203.04613 null
2022-03-08 Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM Pierre-Yves Lajoie et.al. 2203.04446 link
2022-03-07 ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization Simon Maurer et.al. 2203.03610 link
2022-03-07 Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms Qingqing Li et.al. 2203.03454 link
2022-03-01 SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments Maria Waheed et.al. 2203.00591 null
2022-02-28 Deep Camera Pose Regression Using Pseudo-LiDAR Ali Raza et.al. 2203.00080 null
2022-02-25 RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation Praveen Kumar Rajendran et.al. 2202.12838 null
2022-02-24 Highly-Efficient Binary Neural Networks for Visual Place Recognition Bruno Ferrarini et.al. 2202.12375 null
2022-02-18 MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery Ahmad Khaliq et.al. 2202.09146 link
2022-02-14 Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition Y. Shen et.al. 2202.06470 null
2022-02-11 Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition Yingfeng Cai et.al. 2202.05738 null
2022-02-09 Object-Guided Day-Night Visual Localization in Urban Scenes Assia Benbihi et.al. 2202.04445 null
2022-02-08 A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition Nie Jiwei et.al. 2202.03677 null
2022-02-25 CFP-SLAM: A Real-time Visual SLAM Based on Coarse-to-Fine Probability in Dynamic Environments Xinggang Hu et.al. 2202.01938 null
2022-02-03 Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization Andrea Vallone et.al. 2202.01821 null
2022-02-02 Training Semantic Descriptors for Image-Based Localization Ibrahim Cinaroglu et.al. 2202.01212 null
2022-01-31 Hydra: A Real-time Spatial Perception Engine for 3D Scene Graph Construction and Optimization Nathan Hughes et.al. 2201.13360 null
2022-01-31 Rigidity Preserving Image Transformations and Equivariance in Perspective Lucas Brynte et.al. 2201.13065 null
2022-01-25 Learning Semantics for Visual Place Recognition through Multi-Scale Attention Valerio Paolicelli et.al. 2201.09701 link
2022-01-22 Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems Xi Zheng et.al. 2201.09048 link
2022-01-15 A Critical Analysis of Image-based Camera Pose Estimation Techniques Meng Xu et.al. 2201.05816 null
2022-01-14 SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions Ali Samadzadeh et.al. 2201.05386 link
2021-12-23 NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning Tony Ng et.al. 2112.12785 null
2021-12-16 CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data Qi Yan et.al. 2112.09081 link
2021-12-05 RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather Jialu Wang et.al. 2112.02469 null
2021-11-25 MegLoc: A Robust and Accurate Visual Localization Pipeline Shuxue Peng et.al. 2111.13063 null
2021-10-08 Semantic Image Alignment for Vehicle Localization Markus Herb et.al. 2110.04162 null
2021-10-05 Season-invariant GNSS-denied visual localization for UAVs Jouko Kinnari et.al. 2110.01967 link
2021-09-30 Forming a sparse representation for visual place recognition using a neurorobotic approach Sylvain Colomer et.al. 2109.14916 null
2021-09-22 Audio-Visual Grounding Referring Expression for Robotic Manipulation Yefei Wang et.al. 2109.10571 null
2021-09-20 Efficient shape mapping through dense touch and vision Sudharshan Suresh et.al. 2109.09884 link
2021-09-15 S3LAM: Structured Scene SLAM Mathieu Gonzalez et.al. 2109.07339 null
2021-09-13 Monocular Camera Localization for Automated Vehicles Using Image Retrieval Eunhyek Joa et.al. 2109.06296 null
2021-09-10 Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization Sungho Yoon et.al. 2109.04753 link
2021-09-09 CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization Ara Jafarzadeh et.al. 2109.04527 null
2021-09-09 Keeping an Eye on Things: Deep Learned Features for Long-Term Visual Localization Mona Gridseth et.al. 2109.04041 link

(back to top)

Keypoint Detection

Publish Date Title Authors PDF Code
2025-04-24 EdgePoint2: Compact Descriptors for Superior Efficiency and Accuracy Haodi Yao et.al. 2504.17280 null
2025-04-15 UKDM: Underwater keypoint detection and matching using underwater image enhancement techniques Pedro Diaz-Garcia et.al. 2504.11063 null
2025-04-15 Acquisition of high-quality images for camera calibration in robotics applications via speech prompts Timm Linder et.al. 2504.11031 null
2025-04-11 Stereophotoclinometry Revisited Travis Driver et.al. 2504.08252 null
2025-03-31 SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection Yannick Burkhardt et.al. 2504.00139 null
2025-03-29 Deep Visual Servoing of an Aerial Robot Using Keypoint Feature Extraction Shayan Sepahvand et.al. 2503.23171 null
2025-03-25 Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines Junle Liu et.al. 2503.19278 null
2025-03-05 Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing Ryan Banks et.al. 2503.13477 null
2025-03-16 Histogram Transporter: Learning Rotation-Equivariant Orientation Histograms for High-Precision Robotic Kitting Jiadong Zhou et.al. 2503.12541 null
2025-04-12 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-10 REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Yan Tai et.al. 2503.07413 link
2025-03-11 DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection Johan Edstedt et.al. 2503.07347 link
2025-03-07 Automatic determination of quasicrystalline patterns from microscopy images Tano Kim Kender et.al. 2503.05472 link
2025-03-07 Spatial regularisation for improved accuracy and interpretability in keypoint-based registration Benjamin Billot et.al. 2503.04499 link
2025-03-04 A Novel Streamline-based diffusion MRI Tractography Registration Method with Probabilistic Keypoint Detection Junyi Wang et.al. 2503.02481 null
2025-03-01 Autonomous Dissection in Robotic Cholecystectomy Ki-Hwan Oh et.al. 2503.00666 null
2025-02-28 CNSv2: Probabilistic Correspondence Encoded Neural Image Servo Anzhe Chen et.al. 2503.00132 null
2025-02-27 Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets Jisoo Lee et.al. 2502.19766 null
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522 null
2025-02-19 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification Yusuke Uchida et.al. 2502.13484 link
2025-01-30 Transfer Learning for Keypoint Detection in Low-Resolution Thermal TUG Test Images Wei-Lun Chen et.al. 2501.18453 null
2025-01-30 Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models Bhargav Ghanekar et.al. 2501.18361 null
2025-01-30 Lifelong 3D Mapping Framework for Hand-held & Robot-mounted LiDAR Mapping Systems Liudi Yang et.al. 2501.18110 null
2025-01-21 Keypoint Detection Empowered Near-Field User Localization and Channel Reconstruction Mengyuan Li et.al. 2501.11844 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299 null
2025-01-19 Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation Shibang Liu et.al. 2501.11069 null
2025-01-13 Empirical Comparison of Four Stereoscopic Depth Sensing Cameras for Robotics Applications Lukas Rustler et.al. 2501.07421 null
2025-01-13 Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps Saurabh Gupta et.al. 2501.07399 null
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221 link
2024-12-21 A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection Shahid Ansari et.al. 2412.16755 null
2024-12-19 Corn Ear Detection and Orientation Estimation Using Deep Learning Nathan Sprague et.al. 2412.14954 null
2024-12-12 Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models Faith Johnson et.al. 2412.09739 null
2024-12-09 An Efficient Scene Coordinate Encoding and Relocalization Method Kuan Xu et.al. 2412.06488 link
2024-12-09 ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models Bingchen Gong et.al. 2412.06292 null
2024-12-07 Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures Muhammad Umar Farooq et.al. 2412.05487 null
2024-12-04 Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything Yongkyu Lee et.al. 2412.03472 link
2024-12-02 MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection Yonghao Dang et.al. 2412.01422 null
2024-11-23 OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs Chen Xin et.al. 2411.15653 link
2024-11-19 IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose Fei Ren et.al. 2411.12676 null
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851 null
2024-11-04 KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension Jie Yang et.al. 2411.01846 null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906 null
2024-10-04 Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation Aman Anand et.al. 2410.14700 null
2024-11-27 Sim2real Cattle Joint Estimation in 3D point clouds Mohammad Okour et.al. 2410.14419 null
2024-10-16 PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network Asish Bera et.al. 2410.12742 null
2024-10-16 RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition Asish Bera et.al. 2410.12718 null
2024-10-01 A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference Yuan Li et.al. 2410.11848 null
2024-10-11 Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image Marta Veganzones Rodriguez et.al. 2410.09155 null
2024-10-08 Unsupervised Model Diagnosis Yinong Oliver Wang et.al. 2410.06243 null
2024-10-08 Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration Xueyang Kang et.al. 2410.05729 link
2024-10-16 Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features Chengkai Hou et.al. 2410.02237 null
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-09-30 OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection Changsheng Lu et.al. 2409.19899 link
2024-10-07 SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation Xin Li et.al. 2409.18082 null
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link
2024-09-20 Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators Niloufar Amiri et.al. 2409.13668 null
2024-09-25 Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding Rania Hossam et.al. 2409.08695 link
2024-09-06 D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection Kentaro Hirahara et.al. 2409.04060 null
2024-10-01 Towards Practical Human Motion Prediction with LiDAR Point Clouds Xiao Han et.al. 2408.08202 null
2024-07-31 Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods Xusheng Luo et.al. 2408.00117 null
2024-07-26 SHIC: Shape-Image Correspondences with no Keypoint Supervision Aleksandar Shtedritski et.al. 2407.18907 null
2024-07-25 LION: Linear Group RNN for 3D Object Detection in Point Clouds Zhe Liu et.al. 2407.18232 link
2024-07-22 RADA: Robust and Accurate Feature Learning with Domain Adaptation Jingtai He et.al. 2407.15791 null
2024-07-09 LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition Teng Wang et.al. 2407.06730 null
2024-07-04 PFGS: High Fidelity Point Cloud Rendering via Feature Splatting Jiaxu Wang et.al. 2407.03857 link
2024-07-03 A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes Li Fang et.al. 2407.02830 link
2024-07-02 Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Chengchao Shen et.al. 2407.02014 link
2024-06-28 Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics Chengrui Gao et.al. 2406.19672 null
2024-07-23 A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking Lorenzo Shaikewitz et.al. 2406.16837 link
2024-06-03 Scale-Free Image Keypoints Using Differentiable Persistent Homology Giovanni Barbarani et.al. 2406.01315 link
2024-06-23 W-Net: A Facial Feature-Guided Face Super-Resolution Network Hao Liu et.al. 2406.00676 null
2024-05-25 Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration Junjie Gao et.al. 2405.16085 null
2024-06-01 Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection -- Towards Precise Fish Morphological Assessment in Aquaculture Breeding Weizhen Liu et.al. 2405.12476 link
2024-05-14 TP3M: Transformer-based Pseudo 3D Image Matching with Reference Liming Han et.al. 2405.08434 null
2024-05-15 Vector-Symbolic Architecture for Event-Based Optical Flow Hongzhi You et.al. 2405.08300 null
2024-05-13 RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration Congjia Chen et.al. 2405.07594 null
2024-05-08 Unsupervised Skin Feature Tracking with Deep Neural Networks Jose Chang et.al. 2405.04943 null
2024-05-07 A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images László Kopácsi et.al. 2405.04650 null
2024-04-30 A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images Wang Zhang et.al. 2404.19311 null
2024-04-25 Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach Tahmim Hossain et.al. 2404.14560 null
2024-04-19 SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers Vandad Davoodnia et.al. 2404.12625 null
2024-04-17 Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images Junbiao Pang et.al. 2404.10985 null
2024-03-28 Towards Long Term SLAM on Thermal Imagery Colin Keil et.al. 2403.19885 link
2024-03-28 Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation Xiao Lin et.al. 2403.19527 link
2024-03-27 RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation Yang Tian et.al. 2403.18259 null
2024-03-18 FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events Xiangyuan Wang et.al. 2403.11662 link
2024-03-05 Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion Meng Zheng et.al. 2403.03217 null
2024-02-22 A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets Chengzhang Yu et.al. 2402.14241 null
2024-02-25 A Feature Matching Method Based on Multi-Level Refinement Strategy Shaojie Zhang et.al. 2402.13488 null
2024-03-05 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data Zhi-Yi Lin et.al. 2402.13172 null
2024-02-25 Region Feature Descriptor Adapted to High Affine Transformations Shaojie Zhang et.al. 2402.09724 null
2024-01-29 Reconstructing Close Human Interactions from Multiple Views Qing Shuai et.al. 2401.16173 link
2024-01-17 To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection Luyi Han et.al. 2401.09336 link
2024-01-08 Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach Huanyu Liu et.al. 2401.03742 link
2024-03-22 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation Li Xu et.al. 2401.00029 null
2023-12-27 Bezier-based Regression Feature Descriptor for Deformable Linear Objects Fangqing Chen et.al. 2312.16502 null
2023-12-24 Residual Learning for Image Point Descriptors Rashik Shrestha et.al. 2312.15471 null
2023-12-22 BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions Elias Marks et.al. 2312.14706 null
2023-12-19 Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation Jiaming Liu et.al. 2312.12480 null
2023-12-19 An effective image copy-move forgery detection using entropy image Zhaowei Lu et.al. 2312.11793 link
2023-12-11 VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data Jian Shi et.al. 2312.08871 link
2023-12-11 Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach Travis Driver et.al. 2312.06865 link
2023-12-01 Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) Emma Cramer et.al. 2312.00592 link
2023-11-30 Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications Sahar Almahfouz Nasser et.al. 2311.18281 null
2023-11-29 Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features Thomas Wimmer et.al. 2311.18113 link
2023-11-28 Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features Niladri Shekhar Dutt et.al. 2311.17024 link
2023-11-28 Riemannian Self-Attention Mechanism for SPD Networks Rui Wang et.al. 2311.16738 null
2023-11-27 A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor Jialin Liu et.al. 2311.15609 null
2023-11-21 Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers Bo Sun et.al. 2311.12291 null
2023-11-20 CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement Boni Hu et.al. 2311.11604 link
2023-11-17 Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration Paul J. Claasen et.al. 2311.10361 link
2023-11-13 Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning Tomáš Kunzo et.al. 2311.07398 null
2023-11-11 CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer Haoyu Ma et.al. 2311.06443 link
2023-11-08 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud Jianchao Ci et.al. 2311.04699 null
2023-11-06 TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains Alexander Naumann et.al. 2311.03124 link
2023-11-06 An invariant feature extraction for multi-modal images matching Chenzhong Gao et.al. 2311.02842 null
2023-10-20 Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification Mateus Roder et.al. 2310.13490 null
2023-10-12 UniPose: Detecting Any Keypoints Jie Yang et.al. 2310.08530 link
2023-10-10 l-dyno: framework to learn consistent visual features using robot's motion Kartikeya Singh et.al. 2310.06249 link
2023-10-10 Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face Hao Zhang et.al. 2310.05056 link
2023-10-13 H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation Yanjie Ze et.al. 2310.01404 link
2023-10-04 Self-supervised Learning of Contextualized Local Visual Embeddings Thalles Santos Silva et.al. 2310.00527 link
2023-10-22 ObVi-SLAM: Long-Term Object-Visual SLAM Amanda Adkins et.al. 2309.15268 link
2023-09-19 LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation Haizhou Zhang et.al. 2309.10436 link
2023-09-18 RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy Mert Asim Karaoglu et.al. 2309.09563 null
2023-09-17 CryoAlign: feature-based method for global and local 3D alignment of EM density maps Bintao He et.al. 2309.09217 null
2023-09-14 EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization Minjung Kim et.al. 2309.07471 link
2023-09-09 Mirror-Aware Neural Humans Daniel Ajisafe et.al. 2309.04750 link
2023-09-07 InstructDiffusion: A Generalist Modeling Interface for Vision Tasks Zigang Geng et.al. 2309.03895 null
2023-09-04 SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras Himanshu Pahadia et.al. 2309.01324 null
2023-09-12 Improving the matching of deformable objects by learning to detect keypoints Felipe Cadar et.al. 2309.00434 link
2023-08-31 SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation Jiaben Chen et.al. 2308.16876 null
2023-08-30 Learning Structure-from-Motion with Graph Attention Networks Lucas Brynte et.al. 2308.15984 link
2023-08-29 A lightweight 3D dense facial landmark estimation model from position map data Shubhajit Basak et.al. 2308.15170 link
2023-08-27 Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors Francesco Pirotti et.al. 2308.14047 null
2023-08-24 VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition Gengxuan Tian et.al. 2308.12870 null
2023-08-22 LDP-Feat: Image Features with Local Differential Privacy Francesco Pittaluga et.al. 2308.11223 null
2023-08-20 Neural Interactive Keypoint Detection Jie Yang et.al. 2308.10174 link
2023-08-19 ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment Bingyang Zhou et.al. 2308.09987 null
2023-09-03 DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching Johan Edstedt et.al. 2308.08479 link
2023-08-15 CoDeF: Content Deformation Fields for Temporally Consistent Video Processing Hao Ouyang et.al. 2308.07926 link
2023-08-15 ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition Wenyuan Xue et.al. 2308.07743 null
2023-08-14 DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport Sk Aziz Ali et.al. 2308.07153 null
2023-08-14 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds Minhao Li et.al. 2308.05667 link
2023-08-02 Automated Hit-frame Detection for Badminton Match Analysis Yu-Hang Chien et.al. 2307.16000 link
2023-07-25 Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception Chuanyu Luo et.al. 2307.13300 null
2023-07-21 Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data Sahar Almahfouz Nasser et.al. 2307.10698 link
2023-07-19 SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid Zi Li et.al. 2307.09727 link
2023-07-01 SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation Fabian Duffhauss et.al. 2307.00306 link
2023-06-27 Detector-Free Structure from Motion Xingyi He et.al. 2306.15669 link
2023-06-26 CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild Li Ding et.al. 2306.15073 null
2023-06-28 Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset Ziqiao Weng et.al. 2306.07089 link
2023-06-07 Learning Probabilistic Coordinate Fields for Robust Correspondences Weiyue Zhao et.al. 2306.04231 null
2023-06-03 LDEB -- Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues Amitabha Dey et.al. 2306.02193 null
2023-06-02 Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images Marcela Mera-Trujillo et.al. 2306.01938 null
2023-06-01 A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm Onur Beker et.al. 2306.00892 null
2023-05-30 Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection Supeng Wang et.al. 2305.18714 link
2023-05-23 Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Grace Luo et.al. 2305.14334 null
2023-05-15 Non-Separable Multi-Dimensional Network Flows for Visual Computing Viktoria Ehm et.al. 2305.08628 null
2023-05-13 Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance Xinyu Lin et.al. 2305.07943 link
2023-05-05 HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration Canhui Tang et.al. 2305.03487 link
2023-04-17 Human Pose Estimation in Monocular Omnidirectional Top-View Images Jingrui Yu et.al. 2304.08186 null
2023-04-14 CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression Mubariz Zaffar et.al. 2304.07426 null
2023-04-12 SiLK -- Simple Learned Keypoints Pierre Gleize et.al. 2304.06194 link
2023-04-06 From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection Changsheng Lu et.al. 2304.03140 null
2023-03-29 NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud Xiangyu Zhu et.al. 2303.16465 link
2023-03-24 PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View Ze Shi et.al. 2303.14095 link
2023-03-23 Semantic Image Attack for Visual Model Diagnosis Jinqi Luo et.al. 2303.13010 null
2023-03-22 Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation Heng Yang et.al. 2303.12246 link
2023-03-21 RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network Sangmin Yoo et.al. 2303.10770 null
2023-03-17 ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty Vanessa Wirth et.al. 2303.10042 null
2023-03-15 Descriptor Distillation for Efficient Multi-Robot SLAM Xiyue Guo et.al. 2303.08420 null
2023-03-15 From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning Zhuo Su et.al. 2303.08414 null
2023-03-16 KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input Yiye Chen et.al. 2303.05617 link
2023-03-07 External Camera-based Mobile Robot Pose Estimation for Collaborative Perception with Smart Edge Sensors Simon Bultmann et.al. 2303.03797 null
2023-02-26 PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection Shenwei Xie et.al. 2302.13263 null
2023-02-24 Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks Julian Lißner et.al. 2302.12545 null
2023-02-21 Deep Reinforcement Learning Based on Local GNN for Goal-conditioned Deformable Object Rearranging Yuhong Deng et.al. 2302.10446 null
2023-02-12 A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training Jingnan Shi et.al. 2302.06019 null
2023-02-11 Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing Zitong Yu et.al. 2302.05744 null
2023-02-09 MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection Yuhe Ding et.al. 2302.04589 link
2023-02-03 Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation Jie Yang et.al. 2302.01593 link
2023-02-03 Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization Yingying Zhu et.al. 2302.01572 link
2023-01-21 Vision Aided Environment Semantics Extraction and Its Application in mmWave Beam Selection Feiyang Wen et.al. 2301.08973 null
2023-01-18 OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models Xingyi He et.al. 2301.07673 null
2023-01-12 Towards High Performance One-Stage Human Pose Estimation Ling Li et.al. 2301.04842 null
2022-12-31 Rethinking Rotation Invariance with Point Cloud Registration Jianhui Yu et.al. 2301.00149 null
2023-02-06 Fruit Ripeness Classification: a Survey Matteo Rizzo et.al. 2212.14441 null
2022-12-28 NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action Kuan-Chieh Wang et.al. 2212.13660 link
2022-12-24 HandsOff: Labeled Dataset Generation With No Additional Human Annotations Austin Xu et.al. 2212.12645 null
2022-12-13 Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images Welerson Melo et.al. 2212.09589 link
2022-12-15 Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation Bugra C. Sefercik et.al. 2212.07567 null
2023-02-01 DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization Xiangyu Xu et.al. 2212.04575 null
2022-12-07 ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation Yufei Xu et.al. 2212.04246 link
2022-12-15 Designing Feature Vector Representations: A case study from Chemistry Signe Sidwall Thygesen et.al. 2212.03731 null
2022-12-09 DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model Jeongjun Choi et.al. 2212.02796 link
2022-12-05 Images Speak in Images: A Generalist Painter for In-Context Visual Learning Xinlong Wang et.al. 2212.02499 link
2022-12-06 R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor Bai Zhu et.al. 2212.02277 null
2022-11-28 FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network Xinjiang Wang et.al. 2211.15069 link
2022-11-29 BALF: Simple and Efficient Blur Aware Local Feature Detector Zhenjun Zhao et.al. 2211.14731 null
2022-11-21 Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching Paul Roetzer et.al. 2211.11589 link
2022-11-07 Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration Zixin Yang et.al. 2211.03688 null
2022-10-31 Tree Detection and Diameter Estimation Based on Deep Learning Vincent Grondin et.al. 2210.17424 link
2022-10-26 Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds Zhiyuan Zhang et.al. 2210.14899 null
2022-10-23 Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders Ömer Sümer et.al. 2210.12705 null
2022-10-21 Real-time Detection of 2D Tool Landmarks with Synthetic Training Data Bram Vanherle et.al. 2210.11991 null
2022-10-09 Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning Ali Safa et.al. 2210.04236 null
2022-10-04 Centroid Distance Keypoint Detector for Colored Point Clouds Hanzhe Teng et.al. 2210.01298 link
2022-09-28 Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences Jun-Jee Chao et.al. 2209.14419 null
2022-09-28 USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation Zhengrong Xue et.al. 2209.13864 null
2022-10-16 Suture Thread Spline Reconstruction from Endoscopic Images for Robotic Surgery with Reliability-driven Keypoint Detection Neelay Joglekar et.al. 2209.13657 link
2022-09-27 Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors Hao Dong et.al. 2209.13586 link
2022-09-26 Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments Kyungmin Jung et.al. 2209.12881 null
2022-10-07 Long-Lived Accurate Keypoints in Event Streams Philippe Chiberre et.al. 2209.10385 null
2022-09-20 Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence Sunghwan Hong et.al. 2209.08742 null
2022-09-15 Online Marker-free Extrinsic Camera Calibration using Person Keypoint Detections Bastian Pätzold et.al. 2209.07393 link
2022-09-07 Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip Yang Li et.al. 2209.03440 null
2022-08-27 Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes Ali Safa et.al. 2208.12997 null
2022-08-24 Self-Supervised Endoscopic Image Key-Points Matching Manel Farhat et.al. 2208.11424 link
2022-08-19 Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture Muhammad Muzammel et.al. 2208.08224 null
2022-08-08 MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis Maximilian Gilles et.al. 2208.03963 null
2022-08-07 CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization Yujiao Shi et.al. 2208.03660 null
2022-07-29 Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation Qihao Liu et.al. 2208.00090 null
2022-07-25 Translating a Visual LEGO Manual to a Machine-Executable Plan Ruocheng Wang et.al. 2207.12572 null
2022-07-21 Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network Aline Sindel et.al. 2207.10506 null
2022-07-15 Human keypoint detection for close proximity human-robot interaction Jan Docekal et.al. 2207.07742 null
2022-07-15 Adversarial Focal Loss: Asking Your Discriminator for Hard Examples Chen Liu et.al. 2207.07739 null
2022-07-13 Rapid Person Re-Identification via Sub-space Consistency Regularization Qingze Yin et.al. 2207.05933 null
2022-07-07 RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments Qihao Peng et.al. 2207.03539 null
2022-08-15 Semi-supervised Human Pose Estimation in Art-historical Images Matthias Springstein et.al. 2207.02976 link
2022-07-01 Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling Jiamin Liang et.al. 2207.00474 null
2022-06-24 Motion Estimation for Large Displacements and Deformations Qiao Chen et.al. 2206.12464 null
2022-06-24 Deep embedded clustering algorithm for clustering PACS repositories Teo Manojlović et.al. 2206.12417 null
2022-06-21 KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences Xuanhan Wang et.al. 2206.10090 link
2022-06-20 Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval Guile Wu et.al. 2206.09806 null
2022-06-15 A Unified Sequence Interface for Vision Tasks Ting Chen et.al. 2206.07669 link
2022-06-09 Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields Mingtong Zhang et.al. 2206.04669 null
2022-06-03 SNAKE: Shape-aware Neural 3D Keypoint Field Chengliang Zhong et.al. 2206.01724 link
2022-05-17 MulT: An End-to-End Multitask Learning Transformer Deblina Bhattacharjee et.al. 2205.08303 null
2022-05-10 ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild Chirag Raman et.al. 2205.05177 link
2022-04-28 Polarimetric imaging for the detection of synthetic models of SARS-CoV-2: a proof of concept Emilio Gomez-Gonzalez et.al. 2204.14050 null
2022-05-02 GRIT: General Robust Image Task Benchmark Tanmay Gupta et.al. 2204.13653 link
2022-05-24 ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation Yufei Xu et.al. 2204.12484 link
2022-04-26 Unified GCNs: Towards Connecting GCNs with CNNs Ziyan Zhang et.al. 2204.12300 null
2022-04-19 Self-Supervised Equivariant Learning for Oriented Keypoint Detection Jongmin Lee et.al. 2204.08613 link
2022-04-17 The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation Bao Zhao et.al. 2204.08024 null
2022-04-15 2D Human Pose Estimation: A Survey Haoming Chen et.al. 2204.07370 null
2022-04-11 Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification Haojie Liu et.al. 2204.04842 null
2022-04-07 Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification Yanan Wang et.al. 2204.02611 link
2022-04-02 SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning Nilaksh Das et.al. 2204.00734 link
2022-04-01 MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration Chenzhong Gao et.al. 2204.00260 null
2022-03-29 Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning David Howard et.al. 2203.15172 null
2022-03-28 REGTR: End-to-end Point Cloud Correspondences with Transformers Zi Jian Yew et.al. 2203.14517 link
2022-03-27 UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection Ye Liu et.al. 2203.12745 link
2022-03-21 MatchFormer: Interleaving Attention in Transformers for Feature Matching Qing Wang et.al. 2203.09645 link
2022-03-16 PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research R. James Cotton et.al. 2203.08792 link
2022-03-11 DRTAM: Dual Rank-1 Tensor Attention Module Hanxing Chi et.al. 2203.05893 null
2022-03-07 Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation Meng Tian et.al. 2203.03498 null
2022-02-10 Motion-Aware Transformer For Occluded Person Re-identification Mi Zhou et.al. 2202.04243 null
2022-02-03 Sim2Real Object-Centric Keypoint Detection and Description Chengliang Zhong et.al. 2202.00448 null
2022-01-16 Cross-Centroid Ripple Pattern for Facial Expression Recognition Monu Verma et.al. 2201.05958 null
2022-01-14 Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words Harry Nguyen et.al. 2201.03556 link
2022-01-10 TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials Jinnavat Sanalohit et.al. 2201.03170 null
2022-01-06 A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration Aline Sindel et.al. 2201.02242 null
2021-12-28 Skin feature point tracking using deep feature encodings Jose Ramon Chang et.al. 2112.14159 null
2021-12-23 Data-efficient learning for 3D mirror symmetry detection Yancong Lin et.al. 2112.12579 null
2021-12-22 Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations -- combining input rotations and a kinematic model Michael Zwölfer et.al. 2112.12193 null
2021-12-22 Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction Henrique Siqueira et.al. 2112.12002 link
2021-12-19 Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection Renjie Li et.al. 2112.10275 null
2021-12-19 GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor Jean-Baptiste Carluer et.al. 2112.10258 link
2021-12-16 Masked Feature Prediction for Self-Supervised Visual Pre-Training Chen Wei et.al. 2112.09133 link
2021-12-13 DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points Zhengfei Kuang et.al. 2112.06910 null
2021-12-12 Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species Changsheng Lu et.al. 2112.06183 link
2021-12-13 Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings Mel Vecerik et.al. 2112.04910 null
2021-12-06 ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction Xiaoming Zhao et.al. 2112.02906 link
2021-11-25 Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association Sen Yang et.al. 2111.12892 link
2021-11-08 Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images Jianfei Guo et.al. 2111.04237 null
2021-11-04 Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image Feng Liu et.al. 2111.03098 null
2021-11-01 Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-realistic Perspective of Computer Vision Ali Safa et.al. 2111.00791 null
2021-10-30 Geometry-Aware Hierarchical Bayesian Learning on Manifolds Yonghui Fan et.al. 2111.00184 null
2021-10-26 CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration Hao Yu et.al. 2110.14076 link
2021-10-23 HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware James Hegarty et.al. 2110.12106 null
2021-10-18 Keypoint-Based Bimanual Shaping of Deformable Linear Objects under Environmental Constraints using Hierarchical Action Planning Shengzeng Huo et.al. 2110.08962 null
2021-10-11 High-order Tensor Pooling with Attention for Action Recognition Piotr Koniusz et.al. 2110.05216 null
2021-10-10 Digging Into Self-Supervised Learning of Feature Descriptors Iaroslav Melekhov et.al. 2110.04773 null
2021-10-04 BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion Zhaoqun Li et.al. 2110.01179 link
2021-10-01 Machine learning aided noise filtration and signal classification for CREDO experiment Łukasz Bibrzycki et.al. [2110.00297](h