Updated on 2024.12.21
Usage instructions: here
SLAM
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-18 | Energy-Efficient SLAM via Joint Design of Sensing, Communication, and Exploration Speed | Zidong Han et.al. | 2412.13912 | null |
2024-12-18 | Immersive Human-in-the-Loop Control: Real-Time 3D Surface Meshing and Physics Simulation | Sait Akturk et.al. | 2412.13752 | null |
2024-12-18 | 4D Radar-Inertial Odometry based on Gaussian Modeling and Multi-Hypothesis Scan Matching | Fernando Amodeo et.al. | 2412.13639 | link |
2024-12-17 | NFL-BA: Improving Endoscopic SLAM with Near-Field Light Bundle Adjustment | Andrea Dunn Beltran et.al. | 2412.13176 | null |
2024-12-18 | Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera | Zhengdi Yu et.al. | 2412.12861 | null |
2024-12-16 | Global SLAM in Visual-Inertial Systems with 5G Time-of-Arrival Integration | Meisam Kabiri et.al. | 2412.12406 | null |
2024-12-16 | MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors | Riku Murai et.al. | 2412.12392 | null |
2024-12-16 | Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges | Martin Aubard et.al. | 2412.11840 | null |
2024-12-19 | RoMeO: Robust Metric Visual Odometry | Junda Cheng et.al. | 2412.11530 | null |
2024-12-14 | Affine EKF: Exploring and Utilizing Sufficient and Necessary Conditions for Observability Maintenance to Improve EKF Consistency | Yang Song et.al. | 2412.10809 | link |
2024-12-13 | RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting | Lizhi Bai et.al. | 2412.09868 | null |
2024-12-12 | SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos | Yuzheng Liu et.al. | 2412.09401 | link |
2024-12-12 | eCARLA-scenes: A synthetically generated dataset for event-based optical flow prediction | Jad Mansour et.al. | 2412.09209 | link |
2024-12-12 | Drift-free Visual SLAM using Digital Twins | Roxane Merat et.al. | 2412.08496 | null |
2024-12-10 | A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM | Zongbo Liao et.al. | 2412.07513 | null |
2024-12-08 | DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments | Juwon Kim et.al. | 2412.05839 | null |
2024-12-06 | MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos | Zhengqi Li et.al. | 2412.04463 | null |
2024-12-05 | Multi-cam Multi-map Visual Inertial Localization: System, Validation and Dataset | Fuzhang Han et.al. | 2412.04287 | link |
2024-12-10 | MOANA: Multi-Radar Dataset for Maritime Odometry and Autonomous Navigation Application | Hyesu Jang et.al. | 2412.03887 | null |
2024-12-04 | Large-Scale Dense 3D Mapping Using Submaps Derived From Orthogonal Imaging Sonars | John McConnell et.al. | 2412.03760 | null |
2024-12-04 | BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement | Miguel Arturo Vega Torres et.al. | 2412.03434 | link |
2024-12-04 | NeRF and Gaussian Splatting SLAM in the Wild | Fabian Schmidt et.al. | 2412.03263 | link |
2024-12-04 | MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras | Huai Yu et.al. | 2412.03146 | link |
2024-12-04 | An indoor DSO-based ceiling-vision odometry system for indoor industrial environments | Abdelhak Bougouffa et.al. | 2412.02950 | null |
2024-12-03 | ROVER: A Multi-Season Dataset for Visual SLAM | Fabian Schmidt et.al. | 2412.02506 | link |
2024-12-04 | RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting | Zhenzhong Cao et.al. | 2412.01217 | link |
2024-11-28 | Visual SLAMMOT Considering Multiple Motion Models | Peilin Tian et.al. | 2411.19134 | null |
2024-11-27 | ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching | Yangrui Dong et.al. | 2411.18174 | null |
2024-11-27 | HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction | Wei Zhang et.al. | 2411.17982 | null |
2024-11-26 | MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework | Xiangcheng Hu et.al. | 2411.17928 | link |
2024-11-29 | DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting | Christian Homeyer et.al. | 2411.17660 | link |
2024-11-25 | MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Vladimir Yugay et.al. | 2411.16785 | null |
2024-11-24 | Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors | Soumava Paul et.al. | 2411.15966 | null |
2024-11-24 | Near-Range Environmental Perception for Inland Waterway Vessels: A Comparative Study of LiDAR and Automotive FMCW RADAR Sensors | R. Herrmann et.al. | 2411.15901 | null |
2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
2024-11-23 | Gassidy: Gaussian Splatting SLAM in Dynamic Environments | Long Wen et.al. | 2411.15476 | null |
2024-11-22 | OVO-SLAM: Open-Vocabulary Online Simultaneous Localization and Mapping | Tomas Berriel Martins et.al. | 2411.15043 | null |
2024-11-22 | A Benchmark Dataset for Collaborative SLAM in Service Environments | Harin Park et.al. | 2411.14775 | link |
2024-11-21 | InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation | Marziyeh Bamdad et.al. | 2411.14358 | null |
2024-11-20 | Robust Monocular Visual Odometry using Curriculum Learning | Assaf Lahiany et.al. | 2411.13438 | null |
2024-11-20 | Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds | Jelena Trisovic et.al. | 2411.13310 | null |
2024-11-19 | 3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality | Hanbeom Chang et.al. | 2411.12514 | null |
2024-11-19 | LiV-GS: LiDAR-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments | Renxiang Xiao et.al. | 2411.12185 | null |
2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
2024-11-18 | The Blue Horizontal-Branch Stars From the LAMOST Survey: Atmospheric Parameters | Jie Ju et.al. | 2411.11250 | null |
2024-11-17 | A Monocular SLAM-based Multi-User Positioning System with Image Occlusion in Augmented Reality | Wei-Hsiang Lien et.al. | 2411.10940 | null |
2024-11-16 | DGS-SLAM: Gaussian Splatting SLAM in Dynamic Environment | Mangyu Kong et.al. | 2411.10722 | link |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation | Yufei Wei et.al. | 2411.10195 | null |
2024-11-13 | DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization | Yueming Xu et.al. | 2411.08373 | null |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-12 | Enhanced Monocular Visual Odometry with AR Poses and Integrated INS-GPS for Robust Localization in Urban Environments | Ankit Shaw et.al. | 2411.08231 | null |
2024-11-12 | NL-SLAM for OC-VLN: Natural Language Grounded SLAM for Object-Centric VLN | Sonia Raychaudhuri et.al. | 2411.07848 | null |
2024-11-11 | Lost in Tracking Translation: A Comprehensive Analysis of Visual SLAM in Human-Centered XR and IoT Ecosystems | Yasra Chandio et.al. | 2411.07146 | null |
2024-11-11 | Learning from Feedback: Semantic Enhancement for Object SLAM Using Foundation Models | Jungseok Hong et.al. | 2411.06752 | null |
2024-11-11 | HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation | Xiaolong Wang et.al. | 2411.06700 | null |
2024-11-08 | Development of an indoor localization and navigation system based on monocular SLAM for mobile robots | Thanh Nguyen Canh et.al. | 2411.05337 | null |
2024-11-07 | Development of a Service Robot for Hospital Environments in Rehabilitation Medicine with LiDAR Based Simultaneous Localization and Mapping | Sayat Ibrayev et.al. | 2411.04797 | null |
2024-11-07 | MPVO: Motion-Prior based Visual Odometry for PointGoal Navigation | Sayan Paul et.al. | 2411.04796 | null |
2024-11-09 | DEIO: Deep Event Inertial Odometry | Weipeng Guan et.al. | 2411.03928 | link |
2024-11-06 | Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward | Shashi Kumar et.al. | 2411.03866 | null |
2024-11-06 | LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior | Jiahui Wang et.al. | 2411.03610 | link |
2024-11-05 | LVI-GS: Tightly-coupled LiDAR-Visual-Inertial SLAM using 3D Gaussian Splatting | Huibin Zhao et.al. | 2411.02703 | null |
2024-11-04 | Map++: Towards User-Participatory Visual SLAM Systems with Efficient Map Expansion and Sharing | Xinran Zhang et.al. | 2411.02553 | null |
2024-11-04 | Semantic Masking and Visual Feature Matching for Robust Localization | Luisa Mao et.al. | 2411.01804 | null |
2024-10-31 | XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM | Xiaomeng Wang et.al. | 2410.23690 | link |
2024-10-30 | LGU-SLAM: Learnable Gaussian Uncertainty Matching with Deformable Correlation Sampling for Deep Visual SLAM | Yucheng Huang et.al. | 2410.23231 | link |
2024-10-30 | ISAC Prototype System for Multi-Domain Cooperative Communication Networks | Jie Yang et.al. | 2410.22956 | null |
2024-10-30 | SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark | HyunJun Jung et.al. | 2410.22715 | null |
2024-10-29 | LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues | Hanqing Jiang et.al. | 2410.22213 | null |
2024-10-29 | EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments | Linus Nwankwo et.al. | 2410.22200 | null |
2024-10-28 | NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments | Taiyi Pan et.al. | 2410.21615 | link |
2024-10-28 | coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM | Emiliano Höss et.al. | 2410.21149 | link |
2024-11-01 | RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior | Mingjiang Liang et.al. | 2410.20358 | null |
2024-10-25 | Context-Based Visual-Language Place Recognition | Soojin Woo et.al. | 2410.19341 | link |
2024-10-22 | AG-SLAM: Active Gaussian Splatting SLAM | Wen Jiang et.al. | 2410.17422 | null |
2024-10-22 | Impact of 3D LiDAR Resolution in Graph-based SLAM Approaches: A Comparative Study | J. Jorge et.al. | 2410.17171 | null |
2024-10-19 | EndoMetric: Near-light metric scale monocular SLAM | Raúl Iranzo et.al. | 2410.15065 | null |
2024-10-17 | Automatic Navigation and Voice Cloning Technology Deployment on a Humanoid Robot | Dongkun Han et.al. | 2410.13612 | null |
2024-10-17 | TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal | Yanpeng Jia et.al. | 2410.13240 | null |
2024-10-16 | QueensCAMP: an RGB-D dataset for robust Visual SLAM | Hudson M. S. Bruno et.al. | 2410.12520 | link |
2024-10-18 | PAPL-SLAM: Principal Axis-Anchored Monocular Point-Line SLAM | Guanghao Li et.al. | 2410.12324 | null |
2024-10-16 | Towards Autonomous Indoor Parking: A Globally Consistent Semantic SLAM System and A Semantic Localization Subsystem | Yichen Sha et.al. | 2410.12169 | null |
2024-10-15 | V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting | Tuan Dang et.al. | 2410.12068 | link |
2024-10-15 | GSORB-SLAM: Gaussian Splatting SLAM benefits from ORB features and Transmittance information | Wancai Zheng et.al. | 2410.11356 | null |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
2024-10-14 | MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator | Taozhe Li et.al. | 2410.10669 | null |
2024-10-13 | Markerless Aerial-Terrestrial Co-Registration of Forest Point Clouds using a Deformable Pose Graph | Benoit Casseau et.al. | 2410.09896 | null |
2024-10-12 | SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs | Wenxi Chen et.al. | 2410.09503 | link |
2024-10-12 | An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation | Wei Liang et.al. | 2410.09443 | null |
2024-10-12 | ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras | Junkai Niu et.al. | 2410.09374 | link |
2024-10-11 | Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System | Zheng Liu et.al. | 2410.08935 | link |
2024-10-11 | Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints | Yicheng He et.al. | 2410.08780 | null |
2024-10-10 | ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization | Mason B. Peterson et.al. | 2410.08262 | null |
2024-10-10 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
2024-10-08 | Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching | Gongxin Yao et.al. | 2410.06285 | null |
2024-10-08 | Submodular Optimization for Keyframe Selection & Usage in SLAM | David Thorne et.al. | 2410.05576 | null |
2024-10-07 | SharpSLAM: 3D Object-Oriented Visual SLAM with Deblurring for Agile Drones | Denis Davletshin et.al. | 2410.05405 | null |
2024-10-07 | Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection | Ang He et.al. | 2410.05017 | null |
2024-10-05 | A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems | Nikola Radulov et.al. | 2410.04242 | link |
2024-10-05 | High-Speed Stereo Visual SLAM for Low-Powered Computing Devices | Ashish Kumar et.al. | 2410.04090 | link |
2024-10-04 | EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM | Shi Chen et.al. | 2410.03812 | null |
2024-10-04 | Estimating Body and Hand Motion in an Ego-sensed World | Brent Yi et.al. | 2410.03665 | null |
2024-10-03 | LiDAR Inertial Odometry And Mapping Using Learned Registration-Relevant Features | Zihao Dong et.al. | 2410.02961 | null |
2024-10-02 | ReFeree: Radar-Based Lightweight and Robust Localization using Feature and Free space | Hogyun Kim et.al. | 2410.01325 | null |
2024-10-01 | Under Pressure: Altimeter-Aided ICP for 3D Maps Consistency | William Dubois et.al. | 2410.00758 | null |
2024-10-02 | CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM | Dapeng Feng et.al. | 2410.00486 | link |
2024-09-30 | Additively Manufactured Open-Source Quadruped Robots for Multi-Robot SLAM Applications | Zachary Fuge et.al. | 2410.00122 | null |
2024-09-30 | Direct Multipath-Based SLAM | Mingchao Liang et.al. | 2409.20552 | null |
2024-09-30 | Robust Gaussian Splatting SLAM by Leveraging Loop Closure | Zunjie Zhu et.al. | 2409.20111 | null |
2024-09-30 | DynORecon: Dynamic Object Reconstruction for Navigation | Yiduo Wang et.al. | 2409.19928 | null |
2024-09-29 | CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation | Yifan Duan et.al. | 2409.19597 | null |
2024-09-29 | CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought | Yexing Du et.al. | 2409.19510 | link |
2024-09-29 | Fast-UMI: A Scalable and Hardware-Independent Universal Manipulation Interface | Ziniu Wu et.al. | 2409.19499 | null |
2024-09-27 | Royal Reveals: LiDAR Mapping of Kronborg Castle, Echoes of Hamlet’s Halls | Leon Davies et.al. | 2409.18752 | null |
2024-09-26 | BlinkTrack: Feature Tracking over 100 FPS via Events and Images | Yichen Shen et.al. | 2409.17981 | null |
2024-09-26 | Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry | Qi Zhang et.al. | 2409.17729 | null |
2024-09-26 | Event-based Stereo Depth Estimation: A Survey | Suman Ghosh et.al. | 2409.17680 | null |
2024-09-25 | Efficient Submap-based Autonomous MAV Exploration using Visual-Inertial SLAM Configurable for LiDARs or Depth Cameras | Sotiris Papatheodorou et.al. | 2409.16972 | null |
2024-09-25 | Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM | Phu Pham et.al. | 2409.16944 | null |
2024-09-25 | Inline Photometrically Calibrated Hybrid Visual SLAM | Nicolas Abboud et.al. | 2409.16810 | link |
2024-09-25 | Topological SLAM in colonoscopies leveraging deep features and topological priors | Javier Morlana et.al. | 2409.16806 | link |
2024-09-25 | Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots | Masoud Dayani Najafabadi et.al. | 2409.16595 | link |
2024-09-25 | Task-driven SLAM Benchmarking | Yanwei Du et.al. | 2409.16573 | null |
2024-09-24 | SoMaSLAM: 2D Graph SLAM for Sparse Range Sensing with Soft Manhattan World Constraints | Jeahn Han et.al. | 2409.15736 | null |
2024-09-23 | Spectral Graph Theoretic Methods for Enhancing Network Robustness in Robot Localization | Neelkamal Somisetty et.al. | 2409.15506 | null |
2024-09-22 | SPAQ-DL-SLAM: Towards Optimizing Deep Learning-based SLAM for Resource-Constrained Embedded Platforms | Niraj Pudasaini et.al. | 2409.14515 | null |
2024-09-21 | Point Cloud Structural Similarity-based Underwater Sonar Loop Detection | Donghwi Jung et.al. | 2409.14020 | link |
2024-09-20 | HMD $^2$ : Environment-aware Motion Generation from Single Egocentric Head-Mounted Device | Vladimir Guzov et.al. | 2409.13426 | null |
2024-09-20 | Learning Visual Information Utility with PIXER | Yash Turkar et.al. | 2409.13151 | null |
2024-09-19 | MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting | Yan Song Hu et.al. | 2409.13055 | null |
2024-09-19 | Hi-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting | Boying Li et.al. | 2409.12518 | null |
2024-09-18 | Bundle Adjustment in the Eager Mode | Zitong Zhan et.al. | 2409.12190 | null |
2024-09-23 | Uncertainty-Aware Visual-Inertial SLAM with Volumetric Occupancy Mapping | Jaehyung Jung et.al. | 2409.12051 | null |
2024-09-18 | Metric-Semantic Factor Graph Generation based on Graph Neural Networks | Jose Andres Millan-Romera et.al. | 2409.11972 | null |
2024-09-18 | Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments | Lei Cheng et.al. | 2409.11854 | null |
2024-09-18 | ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation | Yanlin Jin et.al. | 2409.11692 | null |
2024-09-18 | SLAM assisted 3D tracking system for laparoscopic surgery | Jingwei Song et.al. | 2409.11688 | null |
2024-09-17 | GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure | Ziheng Xu et.al. | 2409.10982 | null |
2024-09-17 | Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells | Ankit Butola et.al. | 2409.10971 | null |
2024-09-17 | Evaluating and Improving the Robustness of LiDAR-based Localization and Mapping | Bo Yang et.al. | 2409.10824 | link |
2024-09-16 | P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty | Yufan Zhang et.al. | 2409.10143 | link |
2024-09-16 | SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning | Amogh Joshi et.al. | 2409.09990 | null |
2024-09-16 | Enhancing Visual Inertial SLAM with Magnetic Measurements | Bharat Joshi et.al. | 2409.09904 | null |
2024-09-15 | Marginalizing and Conditioning Gaussians onto Linear Approximations of Smooth Manifolds with Applications in Robotics | Zi Cong Guo et.al. | 2409.09871 | null |
2024-09-15 | Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping | Yi Liu et.al. | 2409.09763 | null |
2024-09-15 | High Definition Map Mapping and Update: A General Overview and Future Directions | Benny Wijaya et.al. | 2409.09726 | null |
2024-09-14 | MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry | Yuheng Qiu et.al. | 2409.09479 | null |
2024-09-14 | Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM | Haoying Li et.al. | 2409.09410 | null |
2024-09-14 | GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians | Dasong Gao et.al. | 2409.09295 | null |
2024-09-14 | Panoramic Direct LiDAR-assisted Visual Odometry | Zikang Yuan et.al. | 2409.09287 | link |
2024-09-11 | Object Depth and Size Estimation using Stereo-vision and Integration with SLAM | Layth Hamad et.al. | 2409.07623 | null |
2024-09-11 | Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry | Anbo Tao et.al. | 2409.06948 | null |
2024-09-10 | Technical Report of Mobile Manipulator Robot for Industrial Environments | Erfan Amoozad Khalili et.al. | 2409.06693 | null |
2024-09-10 | Heterogeneous LiDAR Dataset for Benchmarking Robust Localization in Diverse Degenerate Scenarios | Zhiqiang Chen et.al. | 2409.04961 | link |
2024-09-08 | FLAF: Focal Line and Feature-constrained Active View Planning for Visual Teach and Repeat | Changfei Fu et.al. | 2409.03457 | null |
2024-09-03 | Integration of Augmented Reality and Mobile Robot Indoor SLAM for Enhanced Spatial Awareness | Michael D. Friske et.al. | 2409.01915 | null |
2024-09-03 | Explicit Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric | Tingchen Ma et.al. | 2409.01856 | null |
2024-09-02 | Saying goodbyes to rotating your phone: Magnetometer calibration during SLAM | Ilari Vallivaara et.al. | 2409.01242 | null |
2024-09-02 | Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection | Manon Kok et.al. | 2409.01091 | null |
2024-09-02 | Robust Vehicle Localization and Tracking in Rain using Street Maps | Yu Xiang Tan et.al. | 2409.01038 | link |
2024-08-31 | UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM | Mostafa Mansour et.al. | 2409.00362 | null |
2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
2024-08-30 | Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Shuyang Zhang et.al. | 2408.17005 | link |
2024-08-29 | Creating a Segmented Pointcloud of Grapevines by Combining Multiple Viewpoints Through Visual Odometry | Michael Adlerstein et.al. | 2408.16472 | null |
2024-08-28 | Single-Photon 3D Imaging with Equi-Depth Photon Histograms | Kaustubh Sadekar et.al. | 2408.16150 | null |
2024-08-28 | BIM-SLAM: Integrating BIM Models in Multi-session SLAM for Lifelong Mapping using 3D LiDAR | Miguel Arturo Vega Torres et.al. | 2408.15870 | link |
2024-08-30 | Addressing the challenges of loop detection in agricultural environments | Nicolás Soncini et.al. | 2408.15761 | link |
2024-08-28 | ES-PTAM: Event-based Stereo Parallel Tracking and Mapping | Suman Ghosh et.al. | 2408.15605 | link |
2024-08-28 | PointEMRay: A Novel Efficient SBR Framework on Point Based Geometry | Kaiqiao Yang et.al. | 2408.15583 | null |
2024-09-02 | Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration | Rongge Zhang et.al. | 2408.14726 | link |
2024-08-26 | A Survey on Reinforcement Learning Applications in SLAM | Mohammad Dehghani Tezerjani et.al. | 2408.14518 | null |
2024-08-28 | FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2408.14035 | link |
2024-08-21 | Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild | Turcan Tuna et.al. | 2408.11809 | null |
2024-08-21 | LiFCal: Online Light Field Camera Calibration via Bundle Adjustment | Aymeric Fleith et.al. | 2408.11682 | null |
2024-08-21 | Enhanced Visual SLAM for Collision-free Driving with Lightweight Autonomous Cars | Zhihao Lin et.al. | 2408.11582 | null |
2024-08-21 | RaNDT SLAM: Radar SLAM Based on Intensity-Augmented Normal Distributions Transform | Maximilian Hilger et.al. | 2408.11576 | link |
2024-08-21 | Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models | Kento Kawaharazuka et.al. | 2408.11380 | null |
2024-08-20 | LoopSplat: Loop Closure by Registering 3D Gaussian Splats | Liyuan Zhu et.al. | 2408.10154 | link |
2024-08-19 | Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM | Sanghyun Hahn et.al. | 2408.09727 | link |
2024-08-17 | GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System | Shuo Wang et.al. | 2408.09191 | null |
2024-08-15 | GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Yutong Wang et.al. | 2408.07917 | link |
2024-08-14 | Inverse k-visibility for RSSI-based Indoor Geometric Mapping | Junseo Kim et.al. | 2408.07757 | null |
2024-08-14 | Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition | Hogyun Kim et.al. | 2408.07330 | link |
2024-08-12 | CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments | Yanpeng Jia et.al. | 2408.05981 | null |
2024-08-21 | Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis | Zhongche Qu et.al. | 2408.05635 | null |
2024-08-10 | TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping | Seoyeon Jang et.al. | 2408.05453 | null |
2024-08-08 | Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods | Yiming Zhou et.al. | 2408.04268 | null |
2024-08-07 | Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM | Yan Song Hu et.al. | 2408.03825 | null |
2024-08-07 | AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System | Kuan Xu et.al. | 2408.03520 | link |
2024-08-06 | BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications | G. Manni et.al. | 2408.03078 | link |
2024-08-04 | SLAMS-Propelled Electron Acceleration at High-Mach Number Astrophysical Shocks | Vladimir Zeković et.al. | 2408.02084 | null |
2024-08-03 | Visual-Inertial SLAM for Agricultural Robotics: Benchmarking the Benefits and Computational Costs of Loop Closing | Fabian Schmidt et.al. | 2408.01716 | null |
2024-08-03 | Deep Patch Visual SLAM | Lahav Lipson et.al. | 2408.01654 | link |
2024-08-02 | Momentum Capture and Prediction System Based on Wimbledon Open2023 Tournament Data | Chang Liu et.al. | 2408.01544 | null |
2024-08-07 | IG-SLAM: Instant Gaussian SLAM | F. Aykut Sarikamis et.al. | 2408.01126 | null |
2024-08-01 | Collecting Larg-Scale Robotic Datasets on a High-Speed Mobile Platform | Yuxin Lin et.al. | 2408.00545 | null |
2024-08-01 | High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition Datasets | Jian Li et.al. | 2408.00538 | link |
2024-07-31 | SuperVINS: A visual-inertial SLAM framework integrated deep learning features | Hongkun Luo et.al. | 2407.21348 | link |
2024-07-30 | NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding | Hongjia Zhai et.al. | 2407.20853 | null |
2024-07-29 | A flexible framework for accurate LiDAR odometry, map manipulation, and localization | José Luis Blanco-Claraco et.al. | 2407.20465 | link |
2024-07-28 | Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data | Azmyin Md. Kamal et.al. | 2407.19518 | null |
2024-07-26 | Real-time Uncertainty-Aware Motion Planning for Magnetic-based Navigation | Aditya Penumarti et.al. | 2407.19046 | null |
2024-07-26 | HERO-SLAM: Hybrid Enhanced Robust Optimization of Neural SLAM | Zhe Xin et.al. | 2407.18813 | null |
2024-07-25 | CodedVO: Coded Visual Odometry | Sachin Shah et.al. | 2407.18240 | null |
2024-07-28 | HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation | Zhenzhi Wang et.al. | 2407.17438 | link |
2024-07-22 | Memory Management for Real-Time Appearance-Based Loop Closure Detection | Mathieu Labbé et.al. | 2407.15890 | null |
2024-07-22 | Reinforcement Learning Meets Visual Odometry | Nico Messikommer et.al. | 2407.15626 | link |
2024-07-22 | Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM | Mathieu Labbe et.al. | 2407.15305 | null |
2024-07-21 | Semi-Supervised Pipe Video Temporal Defect Interval Localization | Zhu Huang et.al. | 2407.15170 | null |
2024-07-21 | VoxDepth: Rectification of Depth Images on Edge Devices | Yashashwee Chakrabarty et.al. | 2407.15067 | null |
2024-07-20 | From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM | Lorenzo Montano-Oliván et.al. | 2407.14797 | null |
2024-07-19 | MSSP : A Versatile Multi-Scenario Adaptable Intelligent Robot Simulation Platform Based on LIDAR-Inertial Fusion | Qiyan Li et.al. | 2407.14102 | null |
2024-07-18 | A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion | Jianxiang Xu et.al. | 2407.13878 | link |
2024-07-18 | Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM | Baicheng Li et.al. | 2407.13338 | null |
2024-07-18 | Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-based Visual Odometry in Underwater terrain | Bach Nguyen Gia et.al. | 2407.13159 | link |
2024-07-17 | Is That Rain? Understanding Effects on Visual Odometry Performance for Autonomous UAVs and Efficient DNN-based Rain Classification at the Edge | Andrea Albanese et.al. | 2407.12663 | null |
2024-07-17 | Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM | Markus Weißflog et.al. | 2407.12408 | null |
2024-07-19 | Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion | Sangjun Lee et.al. | 2407.12405 | link |
2024-07-17 | Fusion LiDAR-Inertial-Encoder data for High-Accuracy SLAM | Manh Do Duc et.al. | 2407.11870 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-16 | Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems | Jianzhu Huai et.al. | 2407.11705 | null |
2024-07-16 | Batch SLAM with PMBM Data Association Sampling and Graph-Based Optimization | Yu Ge et.al. | 2407.11643 | null |
2024-07-16 | I $^2$ -SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM | Gwangtak Bae et.al. | 2407.11347 | null |
2024-07-16 | FR-SLAM: A SLAM Improvement Method Based on Floor Plan Registration | Jiantao Feng et.al. | 2407.11299 | null |
2024-07-15 | Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method | Adam Korycki et.al. | 2407.11238 | null |
2024-07-12 | An Adaptive Indoor Localization Approach Using WiFi RSSI Fingerprinting with SLAM-Enabled Robotic Platform and Deep Neural Networks | Seyed Alireza Rahimi Azghadi et.al. | 2407.09242 | null |
2024-07-11 | SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM | Neng Wang et.al. | 2407.08106 | link |
2024-07-09 | Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM | David Hug et.al. | 2407.07074 | link |
2024-07-15 | A Neurosymbolic Approach to Adaptive Feature Extraction in SLAM | Yasra Chandio et.al. | 2407.06889 | null |
2024-07-08 | Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots | Siva Krishna Ravipati et.al. | 2407.06077 | link |
2024-07-10 | Co-RaL: Complementary Radar-Leg Odometry with 4-DoF Optimization and Rolling Contact | Sangwoo Jung et.al. | 2407.05820 | null |
2024-07-07 | Active Collaborative Visual SLAM exploiting ORB Features | Muhammad Farhan Ahmed et.al. | 2407.05453 | null |
2024-07-06 | VIPS-Odom: Visual-Inertial Odometry Tightly-coupled with Parking Slots for Autonomous Parking | Xuefeng Jiang et.al. | 2407.05017 | null |
2024-07-06 | Symmetric Linear Arc Monadic Datalog and Gadget Reductions | Manuel Bodirsky et.al. | 2407.04924 | null |
2024-07-03 | Ultra-Lightweight Collaborative Mapping for Robot Swarms | Vlad Niculescu et.al. | 2407.03136 | null |
2024-07-01 | RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields | Haochen Jiang et.al. | 2407.01303 | link |
2024-07-01 | Preserving Relative Localization of FoV-Limited Drone Swarm via Active Mutual Observation | Lianjie Guo et.al. | 2407.01292 | link |
2024-07-01 | Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization | Ruofei Bai et.al. | 2407.01013 | link |
2024-06-30 | Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation | Adnan Abdullah et.al. | 2407.00848 | null |
2024-06-30 | OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration | Fengyuan Yang et.al. | 2407.00574 | null |
2024-06-24 | Compressing Search with Language Models | Thomas Mulc et.al. | 2407.00085 | null |
2024-06-28 | CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services | DongKi Noh et.al. | 2406.19634 | null |
2024-06-25 | Benchmarking SLAM Algorithms in the Cloud: The SLAM Hive System | Xinzhe Liu et.al. | 2406.17586 | null |
2024-07-02 | SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation | Xu Liu et.al. | 2406.17249 | link |
2024-06-24 | From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking | Xiaohao Xu et.al. | 2406.16850 | link |
2024-06-23 | Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy | Chen Wang et.al. | 2406.16087 | null |
2024-06-19 | Simultaneous Map and Object Reconstruction | Nathaniel Chodosh et.al. | 2406.13896 | null |
2024-06-14 | Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization | Wonho Song et.al. | 2406.11599 | null |
2024-06-16 | Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry | Boris Chidlovskii et.al. | 2406.11019 | null |
2024-06-15 | Detection and Utilization of Reflections in LiDAR Scans Through Plane Optimization and Plane SLAM | Yinjie Li et.al. | 2406.10494 | link |
2024-06-12 | From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers | Swaminathan Gurumurthy et.al. | 2406.07785 | link |
2024-06-27 | Notes on Kalman Filter (KF, EKF, ESKF, IEKF, IESKF) | Gyubeom Im et.al. | 2406.06427 | null |
2024-06-10 | Notes on Various Errors and Jacobian Derivations for SLAM | Gyubeom Im et.al. | 2406.06422 | null |
2024-06-23 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374 | link |
2024-06-15 | Visual-Inertial SLAM as Simple as A, B, VINS | Nathaniel Merrill et.al. | 2406.05969 | null |
2024-06-09 | MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps | Jianhao Zheng et.al. | 2406.05849 | null |
2024-06-06 | Open Problem: Active Representation Learning | Nikola Milosevic et.al. | 2406.03845 | null |
2024-06-04 | ProGEO: Generating Prompts through Image-Text Contrastive Learning for Visual Geo-localization | Chen Mao et.al. | 2406.01906 | link |
2024-06-03 | The Empirical Impact of Forgetting and Transfer in Continual Visual Odometry | Paolo Cudrano et.al. | 2406.01797 | null |
2024-06-03 | Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry | Takayuki Kanai et.al. | 2406.00929 | null |
2024-06-02 | Visual place recognition for aerial imagery: A survey | Ivan Moskalenko et.al. | 2406.00885 | link |
2024-05-30 | Structure Gaussian SLAM with Manhattan World Hypothesis | Shuhong Liu et.al. | 2405.20031 | null |
2024-05-30 | Semantic Landmark Detection & Classification Using Neural Networks For 3D In-Air Sonar | Wouter Jansen et.al. | 2405.19869 | null |
2024-05-30 | SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization | Jiang Wang et.al. | 2405.19813 | link |
2024-05-30 | TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM | Peifeng Jiang et.al. | 2405.19614 | null |
2024-05-27 | CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy | Richard Elvira et.al. | 2405.16932 | null |
2024-05-26 | Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians | Erik Sandström et.al. | 2405.16544 | link |
2024-05-24 | NeB-SLAM: Neural Blocks-based Salable RGB-D SLAM for Unknown Scenes | Lizhi Bai et.al. | 2405.15151 | null |
2024-05-23 | ETA-INIT: Enhancing the Translation Accuracy for Stereo Visual-Inertial SLAM Initialization | Han Song et.al. | 2405.15082 | null |
2024-05-23 | Synergistic Global-space Camera and Human Reconstruction from Videos | Yizhou Zhao et.al. | 2405.14855 | null |
2024-05-23 | CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments | Yang Zhou et.al. | 2405.14731 | link |
2024-05-23 | Efficient Robot Learning for Perception and Mapping | Niclas Vödisch et.al. | 2405.14688 | null |
2024-05-22 | Monocular Gaussian SLAM with Language Extended Loop Closure | Tian Lan et.al. | 2405.13748 | null |
2024-05-26 | NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments | Dongha Chung et.al. | 2405.12563 | link |
2024-05-20 | EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving | Boyi Liu et.al. | 2405.12120 | null |
2024-05-24 | Outlier-Robust Long-Term Robotic Mapping Leveraging Ground Segmentation | Hyungtae Lim et.al. | 2405.11176 | null |
2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129 | link |
2024-05-17 | CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion | Gang Wang et.al. | 2405.10793 | null |
2024-05-17 | Occupancy-SLAM: Simultaneously Optimizing Robot Poses and Continuous Occupancy Map | Liang Zhao et.al. | 2405.10743 | null |
2024-05-10 | MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization | Pengcheng Zhu et.al. | 2405.06241 | null |
2024-05-07 | Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map | Yuxuan Xia et.al. | 2405.04290 | null |
2024-05-07 | IMU-Aided Event-based Stereo Visual Odometry | Junkai Niu et.al. | 2405.04071 | link |
2024-04-27 | An Attention-Based Deep Learning Architecture for Real-Time Monocular Visual Odometry: Applications to GPS-free Drone Navigation | Olivier Brochu Dufour et.al. | 2404.17745 | null |
2024-04-26 | Camera Motion Estimation from RGB-D-Inertial Scene Flow | Samuel Cerezo et.al. | 2404.17251 | null |
2024-04-23 | Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization | Lahav Lipson et.al. | 2404.15263 | link |
2024-04-18 | SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints | Spencer Carmichael et.al. | 2404.12339 | null |
2024-04-17 | VBR: A Vision Benchmark in Rome | Leonardo Brizi et.al. | 2404.11322 | link |
2024-04-14 | Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration | Yanhao Zhang et.al. | 2404.09169 | link |
2024-04-06 | Salient Sparse Visual Odometry With Pose-Only Supervision | Siyu Chen et.al. | 2404.04677 | null |
2024-03-25 | A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments | Gianluca D’Amico et.al. | 2403.17084 | null |
2024-03-19 | On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine | Jagatpreet Singh Nir et.al. | 2403.13170 | null |
2024-03-18 | The POLAR Traverse Dataset: A Dataset of Stereo Camera Images Simulating Traverses across Lunar Polar Terrain under Extreme Lighting Conditions | Margaret Hansen et.al. | 2403.12194 | null |
2024-03-18 | An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation | Zewen Xu et.al. | 2403.11639 | null |
2024-03-16 | Efficient Domain Adaptation for Endoscopic Visual Odometry | Junyang Wu et.al. | 2403.10860 | null |
2024-03-14 | Visual Inertial Odometry using Focal Plane Binary Features (BIT-VIO) | Matthew Lisondra et.al. | 2403.09882 | null |
2024-03-02 | Grid-based Fast and Structural Visual Odometry | Zhang Zhihe et.al. | 2403.01110 | null |
2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | link |
2024-02-22 | Secure Navigation using Landmark-based Localization in a GPS-denied Environment | Ganesh Sapkota et.al. | 2402.14280 | null |
2024-02-19 | Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment | Ganesh Sapkota et.al. | 2402.12551 | null |
2024-02-07 | Online and Certifiably Correct Visual Odometry and Mapping | Devansh R Agrawal et.al. | 2402.05254 | null |
2024-02-06 | YOLOPoint Joint Keypoint and Object Detection | Anton Backhaus et.al. | 2402.03989 | link |
2024-01-19 | Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning | André O. Françani et.al. | 2401.10857 | null |
2024-01-17 | Event-Based Visual Odometry on Non-Holonomic Ground Vehicles | Wanting Xu et.al. | 2401.09331 | link |
2024-01-11 | On State Estimation in Multi-Sensor Fusion Navigation: Optimization and Filtering | Feng Zhu et.al. | 2401.05836 | null |
2023-12-19 | Loss it right: Euclidean and Riemannian Metrics in Learning-based Visual Odometry | Olaya Álvarez-Tuñón et.al. | 2401.05396 | link |
2024-01-07 | Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people | Ali Samadzadeh et.al. | 2401.03604 | link |
2024-01-03 | LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry | Weirong Chen et.al. | 2401.01887 | null |
2023-12-28 | SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction | Zikang Yuan et.al. | 2312.16800 | link |
2023-12-20 | NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields | Jens Naumann et.al. | 2312.13471 | null |
2023-12-22 | Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM | Junru Lin et.al. | 2312.13332 | null |
2023-12-20 | Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach | Habib Boloorchi Tabrizi et.al. | 2312.13162 | link |
2023-12-20 | Trajectory Approximation of Video Based on Phase Correlation for Forward Facing Camera | Abdulkadhem A. Abdulkadhem et.al. | 2312.12680 | null |
2023-12-15 | Deep Event Visual Odometry | Simon Klenk et.al. | 2312.09800 | link |
2023-12-10 | SuperPrimitive: Scene Reconstruction at a Primitive Level | Kirill Mazur et.al. | 2312.05889 | null |
2023-12-04 | iMatching: Imperative Correspondence Learning | Zitong Zhan et.al. | 2312.02141 | link |
2023-11-30 | Event-based Visual Inertial Velometer | Xiuyuan Lu et.al. | 2311.18189 | null |
2023-11-21 | CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems | Young-Hee Lee et.al. | 2311.12580 | null |
2023-11-10 | Dense Visual Odometry Using Genetic Algorithm | Slimane Djema et.al. | 2311.06149 | null |
2023-11-07 | Inertial Guided Uncertainty Estimation of Feature Correspondence in Visual-Inertial Odometry/SLAM | Seongwook Yoon et.al. | 2311.03722 | null |
2023-10-23 | Converting Depth Images and Point Clouds for Feature-based Pose Estimation | Robert Lösch et.al. | 2310.14924 | link |
2023-10-17 | Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms | Yanyan Li et.al. | 2310.10931 | link |
2023-10-12 | Jointly Optimized Global-Local Visual Localization of UAVs | Haoling Li et.al. | 2310.08082 | null |
2023-10-10 | l-dyno: framework to learn consistent visual features using robot’s motion | Kartikeya Singh et.al. | 2310.06249 | link |
2023-10-08 | XVO: Generalized Visual Odometry via Cross-Modal Self-Training | Lei Lai et.al. | 2309.16772 | null |
2023-10-22 | ObVi-SLAM: Long-Term Object-Visual SLAM | Amanda Adkins et.al. | 2309.15268 | link |
2023-09-23 | Tag-based Visual Odometry Estimation for Indoor UAVs Localization | Massimiliano Bertoni et.al. | 2309.13311 | null |
2023-09-22 | Exposing the Unseen: Exposure Time Emulation for Offline Benchmarking of Vision Algorithms | Olivier Gamache et.al. | 2309.13139 | link |
2023-09-20 | Conformalized Multimodal Uncertainty Regression and Reasoning | Domenico Parente et.al. | 2309.11018 | null |
2023-09-20 | OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving | Heng Li et.al. | 2309.11011 | link |
2023-09-19 | LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation | Haizhou Zhang et.al. | 2309.10436 | link |
2023-09-21 | Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration | Hongbo Zhao et.al. | 2309.10314 | null |
2023-09-18 | End-to-End Learned Event- and Image-based Visual Odometry | Roberto Pellerito et.al. | 2309.09947 | link |
2023-09-14 | An Explicit Method for Fast Monocular Depth Recovery in Corridor Environments | Yehao Liu et.al. | 2309.07408 | null |
2023-09-11 | Evaluating Visual Odometry Methods for Autonomous Driving in Rain | Yu Xiang Tan et.al. | 2309.05249 | null |
2023-09-08 | Robot Localization and Mapping Final Report – Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry | Akankshya Kar et.al. | 2309.04147 | null |
2023-09-04 | EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity | Zijie Jiang et.al. | 2309.01296 | null |
2023-08-27 | Deep Learning for Visual Localization and Mapping: A Survey | Changhao Chen et.al. | 2308.14039 | null |
2023-08-19 | Enhancing State Estimation in Robots: A Data-Driven Approach with Differentiable Ensemble Kalman Filters | Xiao Liu et.al. | 2308.09870 | link |
2023-08-12 | 4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion | Guirong Zhuo et.al. | 2308.06573 | null |
2023-08-10 | Mono-hydra: Real-time 3D scene graph construction from monocular camera input with IMU | U. V. B. L. Udugama et.al. | 2308.05515 | null |
2023-08-02 | A Small Form Factor Aerial Research Vehicle for Pick-and-Place Tasks with Onboard Real-Time Object Detection and Visual Odometry | Cora A. Dimmig et.al. | 2308.01398 | null |
2023-08-02 | Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network | Shenbagaraj Kannapiran et.al. | 2308.01125 | null |
2023-08-02 | Preliminary Design of the Dragonfly Navigation Filter | Ben Schilling et.al. | 2307.13513 | null |
2023-07-19 | Optimizing the extended Fourier Mellin Transformation Algorithm | Wenqing Jiang et.al. | 2307.10015 | link |
2023-07-15 | Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents | Ke Cao et.al. | 2307.07763 | null |
2023-07-26 | Event-based Stereo Visual Odometry with Native Temporal Resolution via Continuous-time Gaussian Process Regression | Jianeng Wang et.al. | 2306.01188 | null |
2023-07-06 | OSPC: Online Sequential Photometric Calibration | Jawad Haidar et.al. | 2305.17673 | null |
2023-05-15 | Event Camera-based Visual Odometry for Dynamic Motion Tracking of a Legged Robot Using Adaptive Time Surface | Shifan Zhu et.al. | 2305.08962 | null |
2023-05-10 | Transformer-based model for monocular visual odometry: a video understanding approach | André O. Françani et.al. | 2305.06121 | link |
2023-04-29 | Modality-invariant Visual Odometry for Embodied Vision | Marius Memmel et.al. | 2305.00348 | link |
2023-04-21 | FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving | Yuxuan Liu et.al. | 2304.10719 | null |
2023-07-08 | Visual-LiDAR Odometry and Mapping with Monocular Scale Correction and Visual Bootstrapping | Hanyu Cai et.al. | 2304.08978 | null |
2023-04-12 | SiLK – Simple Learned Keypoints | Pierre Gleize et.al. | 2304.06194 | link |
2023-04-11 | ClusterFusion: Real-time Relative Positioning and Dense Reconstruction for UAV Cluster | Yifei Dong et.al. | 2304.04943 | null |
2023-03-21 | Learning a Depth Covariance Function | Eric Dexheimer et.al. | 2303.12157 | null |
2023-03-21 | Online Learning of Wheel Odometry Correction for Mobile Robots with Attention-based Neural Network | Alessandro Navone et.al. | 2303.11725 | null |
2023-03-20 | VR-SLAM: A Visual-Range Simultaneous Localization and Mapping System using Monocular Camera and Ultra-wideband Sensors | Thien Hoang Nguyen et.al. | 2303.10903 | null |
2023-03-17 | CoVIO: Online Continual Learning for Visual-Inertial Odometry | Niclas Vödisch et.al. | 2303.10149 | link |
2023-03-15 | UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry | Chaoyang Jiang et.al. | 2303.08550 | null |
2023-03-13 | Discovering Multiple Algorithm Configurations | Leonid Keselman et.al. | 2303.07434 | null |
2023-03-09 | Virtual Inverse Perspective Mapping for Simultaneous Pose and Motion Estimation | Masahiro Hirano et.al. | 2303.05192 | null |
2023-03-16 | Stereo Event-based Visual-Inertial Odometry | Kunfeng Wang et.al. | 2303.05086 | link |
2023-03-07 | Long Distance GNSS-Denied Visual Inertial Navigation for Autonomous Fixed Wing Unmanned Air Vehicles: SO(3) Manifold Filter based on Virtual Vision Sensor | Eduardo Gallo et.al. | 2303.03804 | null |
2023-03-03 | Lightweight, Uncertainty-Aware Conformalized Visual Odometry | Alex C. Stutts et.al. | 2303.02207 | null |
2023-02-24 | FLSea: Underwater Visual-Inertial and Stereo-Vision Forward-Looking Datasets | Yelena Randall et.al. | 2302.12772 | null |
2023-02-27 | CP+: Camera Poses Augmentation with Large-scale LiDAR Maps | Jiadi Cui et.al. | 2302.12198 | null |
2023-02-19 | EdgeVO: An Efficient and Accurate Edge-based Visual Odometry | Hui Zhao et.al. | 2302.09493 | null |
2023-01-27 | HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera | Mostafa Ahmadi et.al. | 2301.11823 | null |
2023-01-26 | Distributed Optimization Methods for Multi-Robot Systems: Part I – A Tutorial | Ola Shorinwa et.al. | 2301.11313 | null |
2023-01-24 | Generalized Object Search | Kaiyu Zheng et.al. | 2301.10121 | null |
2023-01-22 | Improving Autonomous Vehicle Mapping and Navigation in Work Zones Using Crowdsourcing Vehicle Trajectories | Hanlin Chen et.al. | 2301.09194 | null |
2023-01-21 | Dense RGB SLAM with Neural Implicit Maps | Heng Li et.al. | 2301.08930 | null |
2023-01-18 | Extended FastSLAM Using Cellular Multipath Component Delays and Angular Information | Junshi Chen et.al. | 2301.07560 | null |
2023-01-17 | COVINS-G: A Generic Back-end for Collaborative Visual-Inertial SLAM | Manthan Patel et.al. | 2301.07147 | link |
2023-01-31 | Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot Systems | Pierre-Yves Lajoie et.al. | 2301.06230 | link |
2023-01-13 | A LiDAR-Inertial-Visual SLAM System with Loop Detection | Kangcheng Liu et.al. | 2301.05604 | null |
2023-01-11 | AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization | Ying Chen et.al. | 2301.04620 | link |
2023-01-12 | TBV Radar SLAM – trust but verify loop candidates | Daniel Adolfsson et.al. | 2301.04397 | link |
2022-12-31 | Digital Twin-Enabled Domain Adaptation for Zero-Touch UAV Networks: Survey and Challenges | Maxwell McManus et.al. | 2301.03359 | null |
2023-01-09 | Motion Addition and Motion Optimization | Liqun Qi et.al. | 2301.03174 | null |
2023-01-08 | Towards Open World NeRF-Based SLAM | Daniil Lisus et.al. | 2301.03102 | null |
2023-01-06 | CyberLoc: Towards Accurate Long-term Visual Localization | Liu Liu et.al. | 2301.02403 | null |
2023-01-03 | LunarNav: Crater-based Localization for Long-range Autonomous Lunar Rover Navigation | Shreyansh Daftry et.al. | 2301.01350 | null |
2022-12-31 | 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions | Patrick Wenzel et.al. | 2301.01147 | null |
2023-01-03 | BS3D: Building-scale 3D Reconstruction from RGB-D Images | Janne Mustaniemi et.al. | 2301.01057 | null |
2023-01-10 | An Event-based Algorithm for Simultaneous 6-DOF Camera Pose Tracking and Mapping | Masoud Dayani Najafabadi et.al. | 2301.00618 | link |
2022-12-25 | A Combined Approach Toward Consistent Reconstructions of Indoor Spaces Based on 6D RGB-D Odometry and KinectFusion | Nadia Figueroa et.al. | 2212.14772 | null |
2022-12-29 | An Enhanced LiDAR-Inertial SLAM System for Robotics Localization and Mapping | Kangcheng Liu et.al. | 2212.14209 | link |
2022-12-27 | Clock and Orientation-Robust Simultaneous Radio Localization and Mapping at Millimeter Wave Bands | Felipe Gómez-Cuba et.al. | 2212.13477 | link |
2022-12-26 | ESVIO: Event-based Stereo Visual Inertial Odometry | Peiyu Chen et.al. | 2212.13184 | link |
2022-12-24 | A Comprehensive Review on Autonomous Navigation | Saeid Nahavandi et.al. | 2212.12808 | null |
2022-12-23 | Radio SLAM for 6G Systems at THz Frequencies: Design and Experimental Validation | Marina Lotti et.al. | 2212.12388 | null |
2022-12-23 | Implementation of a Blind navigation method in outdoors/indoors areas | Mohammad Javadian Farzaneh et.al. | 2212.12185 | null |
2022-12-22 | S-Graphs+: Real-time Localization and Mapping leveraging Hierarchical Representations | Hriday Bavle et.al. | 2212.11770 | link |
2022-12-22 | Active SLAM: A Review On Last Decade | Muhammad Farhan Ahmed et.al. | 2212.11654 | null |
2022-12-27 | Motion, Unit Dual Quaternion and Motion Optimization | Liqun Qi et.al. | 2212.11593 | null |
2022-12-22 | Vision-Based Environmental Perception for Autonomous Driving | Fei Liu et.al. | 2212.11453 | null |
2022-12-19 | Mu $^{2}$ SLAM: Multitask, Multilingual Speech and Language Models | Yong Cheng et.al. | 2212.09553 | null |
2022-12-16 | Cartographer_glass: 2D Graph SLAM Framework using LiDAR for Glass Environments | Lasitha Weerakoon et.al. | 2212.08633 | null |
2022-12-16 | rWiFiSLAM: Effective WiFi Ranging based SLAM System in Ambient Environments | Bo Wei et.al. | 2212.08418 | null |
2023-03-02 | AirVO: An Illumination-Robust Point-Line Visual Odometry | Kuan Xu et.al. | 2212.07595 | link |
2022-12-14 | Autonomous Vehicle Navigation with LIDAR using Path Planning | Rahul M K et.al. | 2212.07155 | null |
2022-12-14 | RIS-Enabled and Access-Point-Free Simultaneous Radio Localization and Mapping | Hyowon Kim et.al. | 2212.07141 | null |
2022-12-13 | Know What You Don’t Know: Consistency in Sliding Window Filtering with Unobservable States Applied to Visual-Inertial SLAM (Extended Version) | Daniil Lisus et.al. | 2212.06923 | null |
2022-12-13 | SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance | Chenyangguang Zhang et.al. | 2212.06524 | null |
2022-12-13 | Localization and Navigation System for Indoor Mobile Robot | Yanbaihui Liu et.al. | 2212.06391 | null |
2022-12-12 | Evaluation of RGB-D SLAM in Large Indoor Environments | Kirill Muravyev et.al. | 2212.05980 | null |
2022-12-19 | A Light-Weight LiDAR-Inertial SLAM System with Loop Closing | Kangcheng Liu et.al. | 2212.05743 | link |
2022-12-12 | An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds | Kangcheng Liu et.al. | 2212.05705 | link |
2022-12-09 | SLAM for Visually Impaired People: A Survey | Marziyeh Bamdad et.al. | 2212.04745 | null |
2022-12-09 | Ego-Body Pose Estimation via Ego-Head Pose Estimation | Jiaman Li et.al. | 2212.04636 | null |
2022-12-06 | Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles | Sushant Veer et.al. | 2212.03323 | link |
2022-12-06 | PRISM: Probabilistic Real-Time Inference in Spatial World Models | Atanas Mirchev et.al. | 2212.02988 | null |
2022-12-06 | RGB-L: Enhancing Indirect Visual SLAM using LiDAR-based Dense Depth Maps | Florian Sauerbeck et.al. | 2212.02085 | link |
2022-12-05 | DL-SLOT: Dynamic LiDAR SLAM and object tracking based on collaborative graph optimization | Xuebo Tian et.al. | 2212.02077 | null |
2022-12-05 | ObjectMatch: Robust Registration using Canonical Object Correspondences | Can Gümeli et.al. | 2212.01985 | null |
2022-12-02 | Sparse SPN: Depth Completion from Sparse Keypoints | Yuqun Wu et.al. | 2212.00987 | null |
2022-12-01 | maplab 2.0 – A Modular and Multi-Modal Mapping Framework | Andrei Cramariuc et.al. | 2212.00654 | link |
2022-12-01 | AstroSLAM: Autonomous Monocular Navigation in the Vicinity of a Celestial Small Body – Theory and Experiments | Mehregan Dor et.al. | 2212.00350 | null |
2022-11-30 | MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves | Pranjali Pathre et.al. | 2211.16882 | null |
2022-11-29 | PatchMatch-Stereo-Panorama, a fast dense reconstruction from 360° video images | Hartmut Surmann et.al. | 2211.16266 | link |
2022-11-29 | MmWave Mapping and SLAM for 5G and Beyond | Yu Ge et.al. | 2211.16024 | null |
2022-11-28 | Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map | Xi Zheng et.al. | 2211.15127 | null |
2022-11-29 | BALF: Simple and Efficient Blur Aware Local Feature Detector | Zhenjun Zhao et.al. | 2211.14731 | null |
2022-11-27 | Development of a Modular Real-time Shared-control System for a Smart Wheelchair | Vaishanth Ramaraj et.al. | 2211.14711 | null |
2022-11-26 | A1 SLAM: Quadruped SLAM using the A1’s Onboard Sensors | Jerred Chen et.al. | 2211.14432 | link |
2022-11-23 | ActiveRMAP: Radiance Field for Active Mapping And Planning | Huangying Zhan et.al. | 2211.12656 | null |
2022-11-22 | Vision-based localization methods under GPS-denied conditions | Zihao Lu et.al. | 2211.11988 | null |
2022-11-21 | Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques | David Ramirez et.al. | 2211.11836 | null |
2022-11-21 | ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari et.al. | 2211.11704 | null |
2022-11-24 | Data Fusion for Multipath-Based SLAM: Combing Information from Multiple Propagation Paths | Erik Leitinger et.al. | 2211.09241 | null |
2022-11-16 | Self-supervised Egomotion and Depth Learning via Bi-directional Coarse-to-Fine Scale Recovery | Hao Qu et.al. | 2211.08904 | null |
2022-11-20 | Detecting Line Segments in Motion-blurred Images with Events | Huai Yu et.al. | 2211.07365 | link |
2022-11-13 | Automatic Eye-in-Hand Calibration using EKF | Aditya Ramakrishnan et.al. | 2211.06881 | null |
2022-11-12 | Active View Planning for Visual SLAM in Outdoor Environments Based on Continuous Information Modeling | Zhihao Wang et.al. | 2211.06557 | link |
2022-11-11 | Multi-domain Cooperative SLAM: The Enabler for Integrated Sensing and Communications | Jie Yang et.al. | 2211.05982 | null |
2022-11-10 | Online Stochastic Variational Gaussian Process Mapping for Large-Scale SLAM in Real Time | Ignacio Torroba et.al. | 2211.05601 | link |
2022-11-07 | When Geometry is not Enough: Using Reflector Markers in Lidar SLAM | Gerhard Kurz et.al. | 2211.03484 | null |
2022-11-07 | Detecting Invalid Map Merges in Lifelong SLAM | Matthias Holoch et.al. | 2211.03423 | null |
2022-11-06 | Wheel-SLAM: Simultaneous Localization and Terrain Mapping Using One Wheel-mounted IMU | Yibin Wu et.al. | 2211.03174 | link |
2022-11-07 | Lidar-level localization with radar? The CFEAR approach to accurate, fast and robust large-scale radar odometry in diverse environments | Daniel Adolfsson et.al. | 2211.02445 | link |
2022-11-03 | DyOb-SLAM : Dynamic Object Tracking SLAM System | Rushmian Annoy Wadud et.al. | 2211.01941 | null |
2022-11-03 | Enhanced Visual Feedback with Decoupled Viewpoint Control in Immersive Humanoid Robot Teleoperation using SLAM | Yang Chen et.al. | 2211.01749 | null |
2022-11-04 | $D^2$ SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm | Hao Xu et.al. | 2211.01538 | link |
2022-11-02 | Semantic SuperPoint: A Deep Semantic Descriptor | Gabriel S. Gama et.al. | 2211.01098 | link |
2022-11-02 | Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation | Myung-Hwan Jeon et.al. | 2211.00960 | link |
2022-10-31 | Mapping Extended Landmarks for Radar SLAM | Shuai Sun et.al. | 2210.17207 | null |
2022-10-25 | MAROAM: Map-based Radar SLAM through Two-step Feature Selection | Dequan Wang et.al. | 2210.13797 | null |
2022-10-25 | S3E: A Large-scale Multimodal Dataset for Collaborative SLAM | Dapeng Feng et.al. | 2210.13723 | link |
2022-10-24 | NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields | Antoni Rosinol et.al. | 2210.13641 | link |
2022-10-24 | Compact simultaneous label-free autofluorescence multi-harmonic (SLAM) microscopy for user-friendly photodamage-monitored imaging | Geng Wang et.al. | 2210.13556 | null |
2022-10-28 | VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points | Andreas Georgis et.al. | 2210.12756 | null |
2022-10-22 | SLAM: Semantic Learning based Activation Map for Weakly Supervised Semantic Segmentation | Junliang Chen et.al. | 2210.12417 | null |
2022-10-21 | DCL-SLAM: A Distributed Collaborative LiDAR SLAM Framework for a Robotic Swarm | Shipeng Zhong et.al. | 2210.11978 | link |
2022-10-21 | Motion Primitives Based Kinodynamic RRT for Autonomous Vehicle Navigation in Complex Environments | Shubham Kedia et.al. | 2210.11652 | null |
2022-10-22 | Visual SLAM: What are the Current Trends and What to Expect? | Ali Tourani et.al. | 2210.10491 | null |
2022-10-18 | Split-KalmanNet: A Robust Model-Based Deep Learning Approach for SLAM | Geon Choi et.al. | 2210.09636 | null |
2022-10-16 | D2SLAM: Semantic visual SLAM based on the influence of Depth for Dynamic environments | Ayman Beghdadi et.al. | 2210.08647 | null |
2022-10-16 | Indoor Smartphone SLAM with Learned Echoic Location Features | Wenjie Luo et.al. | 2210.08493 | null |
2022-10-15 | Self-Improving SLAM in Dynamic Environments: Learning When to Mask | Adrian Bojko et.al. | 2210.08350 | link |
2022-10-13 | Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems | Pushyami Kaveti et.al. | 2210.07315 | link |
2022-10-12 | RING++: Roto-translation Invariant Gram for Global Localization on a Sparse Scan Map | Xuecheng Xu et.al. | 2210.05984 | link |
2022-10-11 | Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization | Yuanzheng He et.al. | 2210.05600 | null |
2022-10-11 | Autonomous Asteroid Characterization Through Nanosatellite Swarming | Kaitlin Dennison et.al. | 2210.05518 | null |
2022-10-11 | DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion | Yuxi Xiao et.al. | 2210.05517 | null |
2022-10-11 | Multi-Object Navigation with dynamically learned neural implicit representations | Pierre Marza et.al. | 2210.05129 | link |
2022-10-12 | Spectral Sparsification for Communication-Efficient Collaborative Rotation and Translation Estimation | Yulun Tian et.al. | 2210.05020 | null |
2022-10-10 | Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios | Xingyu Chen et.al. | 2210.04562 | null |
2022-10-09 | Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning | Ali Safa et.al. | 2210.04236 | null |
2022-10-06 | SCORE: A Second-Order Conic Initialization for Range-Aided SLAM | Alan Papalia et.al. | 2210.03177 | link |
2022-10-06 | Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding | Kirill Mazur et.al. | 2210.03043 | null |
2022-10-06 | Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence | Osian Morgan et.al. | 2210.02642 | null |
2022-10-05 | MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation | Hanwei Zhang et.al. | 2210.02038 | null |
2022-10-04 | O2S: Open-source open shuttle | Nwankwo Linus et.al. | 2210.01627 | null |
2022-10-04 | Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing | Weiying Wang et.al. | 2210.01320 | null |
2022-10-03 | Probabilistic Volumetric Fusion for Dense Monocular SLAM | Antoni Rosinol et.al. | 2210.01276 | null |
2022-10-03 | DRACo-SLAM: Distributed Robust Acoustic Communication-efficient SLAM for Imaging Sonar Equipped Underwater Robot Teams | John McConnell et.al. | 2210.00867 | link |
2022-10-03 | A Benchmark for Multi-Modal Lidar SLAM with Ground Truth in GNSS-Denied Environments | Ha Sier et.al. | 2210.00812 | link |
2022-10-01 | Det-SLAM: A semantic visual SLAM for highly dynamic scenes using Detectron2 | Ali Eslamian et.al. | 2210.00278 | null |
2022-09-30 | PyPose: A Library for Robot Learning with Physics-based Optimization | Chen Wang et.al. | 2209.15428 | link |
2022-09-29 | DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment | Mariia Gladkova et.al. | 2209.14965 | null |
2022-09-28 | Robust Incremental Smoothing and Mapping (riSAM) | Daniel McGann et.al. | 2209.14359 | null |
2022-09-27 | Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping | Chi-Ming Chung et.al. | 2209.13274 | link |
2022-09-24 | Graph Neural Networks for Multi-Robot Active Information Acquisition | Mariliza Tzes et.al. | 2209.12091 | null |
2022-09-24 | Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes | Jonathan J. Y. Kim et.al. | 2209.11894 | null |
2022-09-23 | involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs | Gilad Rotman et.al. | 2209.11591 | null |
2022-09-23 | Automatic Sign Reading and Localization for Semantic Mapping with an Office Robot | David Balaban et.al. | 2209.11432 | null |
2022-09-22 | SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation | Xiao Han et.al. | 2209.10817 | null |
2022-09-22 | Acoustic SLAM based on the Direction-of-Arrival and the Direct-to-Reverberant Energy Ratio | Wenhao Qiu et.al. | 2209.10726 | null |
2022-09-21 | Visual Localization and Mapping in Dynamic and Changing Environments | João Carlos Virgolino Soares et.al. | 2209.10710 | null |
2022-09-20 | Uncertainty-Aware Tightly-Coupled GPS Fused LIO-SLAM | Sabir Hossain et.al. | 2209.10047 | null |
2022-09-20 | WGICP: Differentiable Weighted GICP-Based Lidar Odometry | Sanghyun Son et.al. | 2209.09777 | null |
2022-09-20 | PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention | José Arce et.al. | 2209.09699 | link |
2022-09-19 | MeSLAM: Memory Efficient SLAM based on Neural Fields | Evgenii Kruzhkov et.al. | 2209.09357 | null |
2022-09-19 | LMBAO: A Landmark Map for Bundle Adjustment Odometry in LiDAR SLAM | Letian Zhang et.al. | 2209.08810 | null |
2022-09-18 | HGI-SLAM: Loop Closure With Human and Geometric Importance Features | Shuhul Mujoo et.al. | 2209.08608 | null |
2022-09-18 | Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM | Jiarui Tan et.al. | 2209.08578 | link |
2022-09-17 | DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments | Shihao Shen et.al. | 2209.08430 | link |
2022-09-17 | OA-SLAM: Leveraging Objects for Camera Relocalization in Visual SLAM | Matthieu Zins et.al. | 2209.08338 | null |
2022-09-17 | PlaneSLAM: Plane-based LiDAR SLAM for Motion Planning in Structured 3D Environments | Adam Dai et.al. | 2209.08248 | link |
2022-09-16 | ViWiD: Leveraging WiFi for Robust and Resource-Efficient SLAM | Aditya Arun et.al. | 2209.08091 | null |
2022-09-16 | iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking | Yuhang Ming et.al. | 2209.07919 | null |
2022-09-16 | TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM | Mathieu Gonzalez et.al. | 2209.07888 | null |
2022-09-15 | Landmark Management in the Application of Radar SLAM | Shuai Sun et.al. | 2209.07199 | link |
2022-09-15 | PROB-SLAM: Real-time Visual SLAM Based on Probabilistic Graph Optimization | Xianwei Meng et.al. | 2209.07061 | null |
2022-09-14 | Semantic Visual Simultaneous Localization and Mapping: A Survey | Kaiqi Chen et.al. | 2209.06428 | null |
2022-09-13 | Optimizing SLAM Evaluation Footprint Through Dynamic Range Coverage Analysis of Datasets | Islam Ali et.al. | 2209.06316 | null |
2022-09-12 | A Review on Visual-SLAM: Advancements from Geometric Modelling to Learning-based Semantic Scene Understanding | Tin Lai et.al. | 2209.05222 | null |
2022-09-12 | Attitude-Guided Loop Closure for Cameras with Negative Plane | Ze Wang et.al. | 2209.05167 | link |
2022-09-09 | General Place Recognition Survey: Towards the Real-world Autonomy Age | Peng Yin et.al. | 2209.04497 | link |
2022-09-08 | ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology | Julio A. Placed et.al. | 2209.03693 | link |
2022-09-08 | R $^3$ LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator | Jiarong Lin et.al. | 2209.03666 | link |
2022-09-06 | Group- $k$ Consistent Measurement Set Maximization for Robust Outlier Detection | Brendon Forsgren et.al. | 2209.02658 | link |
2022-09-05 | Neuromorphic Visual Odometry with Resonator Networks | Alpha Renner et.al. | 2209.02000 | null |
2022-09-05 | MuCaSLAM: CNN-Based Frame Quality Assessment for Mobile Robot with Omnidirectional Visual SLAM | Pavel Karpyshev et.al. | 2209.01936 | null |
2022-09-05 | ElasticROS: An Elastically Collaborative Robot Operation System for Fog and Cloud Robotics | Boyi Liu et.al. | 2209.01774 | null |
2022-09-04 | CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud | Evgeny Yudin et.al. | 2209.01605 | null |
2022-08-31 | PFilter: Building Persistent Maps through Feature Filtering for Fast and Accurate LiDAR-based SLAM | Yifan Duan et.al. | 2208.14848 | null |
2022-08-30 | BioSLAM: A Bio-inspired Lifelong Memory System for General Place Recognition | Peng Yin et.al. | 2208.14543 | null |
2022-08-27 | Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes | Ali Safa et.al. | 2208.12997 | null |
2022-08-25 | FusionPortable: A Multi-Sensor Campus-Scene Dataset for Evaluation of Localization and Mapping Accuracy on Diverse Platforms | Jianhao Jiao et.al. | 2208.11865 | null |
2022-08-25 | Lidar SLAM for Autonomous Driving Vehicles | Farhad Aghili et.al. | 2208.11855 | null |
2022-08-24 | DynaVINS: A Visual-Inertial SLAM for Dynamic Environments | Seungwon Song et.al. | 2208.11500 | link |
2022-08-22 | Doppler Exploitation in Bistatic mmWave Radio SLAM | Yu Ge et.al. | 2208.10204 | null |
2022-08-21 | Hilti-Oxford Dataset: A Millimetre-Accurate Benchmark for Simultaneous Localization and Mapping | Lintong Zhang et.al. | 2208.09825 | link |
2022-08-26 | JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario | Longrui Dong et.al. | 2208.09777 | null |
2022-08-15 | BoW3D: Bag of Words for Real-time Loop Closing in 3D LiDAR SLAM | Yunge Cui et.al. | 2208.07473 | link |
2022-08-12 | Handling Constrained Optimization in Factor Graphs for Autonomous Navigation | Barbara Bazzana et.al. | 2208.06325 | null |
2022-08-11 | RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild | Jason Y. Zhang et.al. | 2208.05963 | null |
2022-08-08 | Visual-Inertial Multi-Instance Dynamic SLAM with Object-level Relocalisation | Yifei Ren et.al. | 2208.04274 | link |
2022-08-08 | SLAM-TKA: Real-time Intra-operative Measurement of Tibial Resection Plane in Conventional Total Knee Arthroplasty | Shuai Zhang et.al. | 2208.03945 | link |
2022-08-05 | A Survey on Visual Map Localization Using LiDARs and Cameras | Elhousni Mahdi et.al. | 2208.03376 | null |
2022-08-04 | SROS2: Usable Cyber Security Tools for ROS 2 | Victor Mayoral Vilches et.al. | 2208.02615 | link |
2022-08-03 | Evaluation and comparison of eight popular Lidar and Visual SLAM algorithms | Bharath Garigipati et.al. | 2208.02063 | null |
2022-08-02 | Present and Future of SLAM in Extreme Underground Environments | Kamak Ebadi et.al. | 2208.01787 | null |
2022-08-01 | Visual-Inertial SLAM with Tightly-Coupled Dropout-Tolerant GPS Fusion | Simon Boche et.al. | 2208.00709 | null |
2022-07-29 | Neural Density-Distance Fields | Itsuki Ueda et.al. | 2207.14455 | link |
2022-07-25 | DeepFusion: Real-Time Dense 3D Reconstruction for Monocular SLAM using Single-View Depth and Gradient Predictions | Tristan Laidlow et.al. | 2207.12244 | null |
2022-07-25 | Scalable Fiducial Tag Localization on a 3D Prior Map via Graph-Theoretic Global Tag-Map Registration | Kenji Koide et.al. | 2207.11942 | null |
2022-07-22 | NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction | Yunlong Ran et.al. | 2207.10985 | null |
2022-07-22 | Dense RGB-D-Inertial SLAM with Map Deformations | Tristan Laidlow et.al. | 2207.10940 | null |
2022-07-22 | PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes | BaoSheng Zhang et.al. | 2207.10916 | null |
2022-07-21 | Multi-Event-Camera Depth Estimation and Outlier Rejection by Refocused Events Fusion | Suman Ghosh et.al. | 2207.10494 | link |
2022-07-21 | Online Localisation and Colored Mesh Reconstruction Architecture for 3D Visual Feedback in Robotic Exploration Missions | Quentin Serdel et.al. | 2207.10489 | link |
2022-07-21 | On applicability of von Karman’s momentum theory in predicting the water entry load of V-shaped structures with varying initial velocity | Yujin Lu et.al. | 2207.10413 | null |
2022-07-19 | Hybrid Belief Pruning with Guarantees for Viewpoint-Dependent Semantic SLAM | Tuvy Lemberg et.al. | 2207.09103 | null |
2022-07-18 | DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM | Weicai Ye et.al. | 2207.08794 | link |
2022-07-18 | Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction | Marco Orsingher et.al. | 2207.08439 | null |
2022-07-18 | ORB-based SLAM accelerator on SoC FPGA | Vibhakar Vemulapati et.al. | 2207.08405 | null |
2022-07-14 | Challenges of SLAM in extremely unstructured environments: the DLR Planetary Stereo, Solid-State LiDAR, Inertial Dataset | Riccardo Giubilato et.al. | 2207.06815 | null |
2022-07-14 | Semi-supervised Vector-Quantization in Visual SLAM using HGCN | Amir Zarringhalam et.al. | 2207.06738 | null |
2022-07-14 | Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders | Amir Zarringhalam et.al. | 2207.06732 | null |
2022-07-13 | SLAM: SLO-Aware Memory Optimization for Serverless Applications | Gor Safaryan et.al. | 2207.06183 | null |
2022-07-19 | Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras | Fangwen Shu et.al. | 2207.06058 | link |
2022-07-12 | Accelerating Certifiable Estimation with Preconditioned Eigensolvers | David M. Rosen et.al. | 2207.05257 | null |
2022-07-12 | Robust Key-Frame Stereo Visual SLAM with low-threshold Point and Line Features | Meiyu Zhi et.al. | 2207.05244 | null |
2022-07-14 | SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial | Chih-Yuan Chiu et.al. | 2207.05043 | null |
2022-07-08 | BlindSpotNet: Seeing Where We Cannot See | Taichi Fukuda et.al. | 2207.03870 | null |
2022-07-08 | Continuous Target-free Extrinsic Calibration of a Multi-Sensor System from a Sequence of Static Viewpoints | Philipp Glira et.al. | 2207.03785 | null |
2022-07-08 | Distributed Ranging SLAM for Multiple Robots with Ultra-WideBand and Odometry Measurements | Ran Liu et.al. | 2207.03700 | null |
2022-07-07 | RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments | Qihao Peng et.al. | 2207.03539 | null |
2022-07-06 | VI-SLAM2tag: Low-Effort Labeled Dataset Collection for Fingerprinting-Based Indoor Localization | Marius Laska et.al. | 2207.02668 | null |
2022-07-06 | A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models | Axel Garcia-Vega et.al. | 2207.02396 | null |
2022-07-04 | VECtor: A Versatile Event-Centric Benchmark for Multi-Sensor SLAM | Ling Gao et.al. | 2207.01404 | null |
2022-07-04 | VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM | Danpeng Chen et.al. | 2207.01158 | null |
2022-07-03 | Wireless Channel Prediction in Partially Observed Environments | Mingsheng Yin et.al. | 2207.00934 | null |
2022-07-01 | A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers | Julio A. Placed et.al. | 2207.00254 | null |
2022-07-01 | Keeping Less is More: Point Sparsification for Visual SLAM | Yeonsoo Park et.al. | 2207.00225 | null |
2022-06-30 | Controlled and impulsive compression of an entrapped air bubble during impact | Utkarsh Jain et.al. | 2206.15297 | null |
2022-06-30 | Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery | Yuehao Wang et.al. | 2206.15255 | link |
2022-06-27 | IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments | Abanob Soliman et.al. | 2206.13455 | link |
2022-06-26 | An Efficient Global Optimality Certificate for Landmark-Based SLAM | Connor Holmes et.al. | 2206.12961 | link |
2022-06-21 | Object Structural Points Representation for Graph-based Semantic Monocular Localization and Mapping | Davide Tateo et.al. | 2206.10263 | link |
2022-06-20 | Data Fusion for Radio Frequency SLAM with Robust Sampling | Erik Leitinger et.al. | 2206.09746 | null |
2022-06-19 | RF-LIO: Removal-First Tightly-coupled Lidar Inertial Odometry in High Dynamic Environments | Chenglong Qian et.al. | 2206.09463 | null |
2022-06-17 | Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments | Khairuldanial Ismail et.al. | 2206.08733 | null |
2022-06-17 | An Algorithm for the SE(3)-Transformation on Neural Implicit Maps for Remapping Functions | Yijun Yuan et.al. | 2206.08712 | link |
2022-06-13 | ICP Algorithm: Theory, Practice And Its SLAM-oriented Taxonomy | Hao Bai et.al. | 2206.06435 | null |
2022-06-10 | Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming | Javier Cremona et.al. | 2206.05066 | link |
2022-06-09 | SparseFormer: Attention-based Depth Completion Network | Frederik Warburg et.al. | 2206.04557 | null |
2022-06-07 | Robot Self-Calibration Using Actuated 3D Sensors | Arne Peters et.al. | 2206.03430 | null |
2022-06-07 | Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map | Haodong Yuan et.al. | 2206.03062 | null |
2022-06-05 | DarkSLAM: GAN-assisted Visual SLAM for Reliable Operation in Low-light Conditions | Alena Savinykh et.al. | 2206.02199 | null |
2022-06-04 | C $^3$ Fusion: Consistent Contrastive Colon Fusion, Towards Deep SLAM in Colonoscopy | Erez Posner et.al. | 2206.01961 | null |
2022-06-01 | PaGO-LOAM: Robust Ground-Optimized LiDAR Odometry | Dong-Uk Seo et.al. | 2206.00266 | link |
2022-05-27 | A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching | Arno Solin et.al. | 2205.13821 | null |
2022-05-31 | LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments | Yun Chang et.al. | 2205.13135 | link |
2022-05-25 | Wildcat: Online Continuous-Time 3D Lidar-Inertial SLAM | Milad Ramezani et.al. | 2205.12595 | null |
2022-05-24 | Loop Closure Prioritization for Efficient and Scalable Multi-Robot SLAM | Christopher E. Denniston et.al. | 2205.12402 | link |
2022-05-22 | ALITA: A Large-scale Incremental Dataset for Long-term Autonomy | Peng Yin et.al. | 2205.10737 | link |
2022-05-19 | FogROS 2: An Adaptive and Extensible Platform for Cloud and Fog Robotics Using ROS 2 | Jeffrey Ichnowski et.al. | 2205.09778 | link |
2022-05-17 | Global Data Association for SLAM with 3D Grassmannian Manifold Objects | Parker C. Lusk et.al. | 2205.08556 | null |
2022-05-19 | Cluster on Wheels | Yuanyuan Yang et.al. | 2205.08151 | null |
2022-05-12 | Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry | Shihao Shen et.al. | 2205.05916 | link |
2022-05-12 | S3E-GNN: Sparse Spatial Scene Embedding with Graph Neural Networks for Camera Relocalization | Ran Cheng et.al. | 2205.05861 | null |
2022-05-14 | Multi-modal Semantic SLAM for Complex Dynamic Environments | Han Wang et.al. | 2205.04300 | link |
2022-05-06 | OROS: Orchestrating ROS-driven Collaborative Connected Robots in Mission-Critical Operations | Carmen Delgado et.al. | 2205.03256 | null |
2022-05-05 | CNN-Augmented Visual-Inertial SLAM with Planar Constraints | Pan Ji et.al. | 2205.02940 | null |
2022-05-05 | PMBM-based SLAM Filters in 5G mmWave Vehicular Networks | Hyowon Kim et.al. | 2205.02502 | null |
2022-05-04 | BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking | Dorian Henning et.al. | 2205.02301 | null |
2022-05-04 | A Global Asymptotic Convergent Observer for SLAM | Seyed Hamed Hashemi et.al. | 2205.01953 | null |
2022-05-04 | Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation | Nathaniel Merrill et.al. | 2205.01823 | link |
2022-05-03 | GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping | Pan Ji et.al. | 2205.01656 | null |
2022-04-29 | Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM | Jinwoo Jeon et.al. | 2204.13877 | link |
2022-04-27 | The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection | Konstantinos A. Tsintotas et.al. | 2204.12831 | null |
2022-04-27 | Dynamic Registration: Joint Ego Motion Estimation and 3D Moving Object Detection in Dynamic Environment | Wenyu Li et.al. | 2204.12769 | null |
2022-04-29 | MLO: Multi-Object Tracking and Lidar Odometry in Dynamic Environment | Tingchen Ma et.al. | 2204.11621 | null |
2022-04-23 | Indoor simultaneous localization and mapping based on fringe projection profilometry | Yang Zhao et.al. | 2204.11020 | null |
2022-04-22 | Enough is Enough: Towards Autonomous Uncertainty-driven Stopping Criteria | Julio A. Placed et.al. | 2204.10631 | null |
2022-04-22 | Fast Autonomous Robotic Exploration Using the Underlying Graph Structure | Julio A. Placed et.al. | 2204.10610 | null |
2022-04-22 | Making Parameterization and Constrains of Object Landmark Globally Consistent via SPD(3) Manifold and Improved Cost Functions | Yutong Hu et.al. | 2204.10552 | null |
2022-04-22 | Implicit Object Mapping With Noisy Data | Jad Abou-Chakra et.al. | 2204.10516 | link |
2022-04-19 | Photometric single-view dense 3D reconstruction in endoscopy | Victor M. Batlle et.al. | 2204.09083 | null |
2022-04-18 | Pulsar skips: Understanding variations in the regular periods of rotating neutron stars | Clayton Miller et.al. | 2204.08449 | null |
2022-04-18 | Tracking monocular camera pose and deformation for SLAM inside the human body | Juan J. Gomez Rodriguez et.al. | 2204.08309 | null |
2022-04-18 | Mapping While Following: 2D LiDAR SLAM in Indoor Dynamic Environments with a Person Tracker | Hanjing Ye et.al. | 2204.08163 | null |
2022-04-14 | ViViD++: Vision for Visibility Dataset | Alex Junho Lee et.al. | 2204.06183 | null |
2022-04-12 | HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud | Zhixing Hou et.al. | 2204.05481 | null |
2022-04-12 | RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room | Cong Gao et.al. | 2204.05467 | null |
2022-04-11 | Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context | Lizhou Liao et.al. | 2204.04932 | link |
2022-04-04 | Monitoring social distancing with single image depth estimation | Alessio Mingozzi et.al. | 2204.01693 | null |
2022-04-01 | Bi-directional Loop Closure for Visual SLAM | Ihtisham Ali et.al. | 2204.01524 | null |
2022-04-04 | IMOT: General-Purpose, Fast and Robust Estimation for Spatial Perception Problems with Outliers | Lei Sun et.al. | 2204.01324 | link |
2022-04-03 | Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor | Wenyan Ou et.al. | 2204.01154 | null |
2022-04-02 | UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps | Ayyappa Swamy Thatavarthy et.al. | 2204.00865 | link |
2022-03-31 | Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects | Yujie Lu et.al. | 2204.00035 | null |
2022-03-30 | GTP-SLAM: Game-Theoretic Priors for Simultaneous Localization and Mapping in Multi-Agent Scenarios | Chih-Yuan Chiu et.al. | 2203.16690 | null |
2022-03-29 | Indoor SLAM Using a Foot-mounted IMU and the local Magnetic Field | Mostafa Osman et.al. | 2203.15866 | null |
2022-03-29 | Eventor: An Efficient Event-Based Monocular Multi-View Stereo Accelerator on FPGA Platform | Mingjun Li et.al. | 2203.15439 | null |
2022-03-29 | Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots | Pranay Mathur et.al. | 2203.15272 | null |
2022-03-28 | Are High-Resolution Event Cameras Really Needed? | Daniel Gehrig et.al. | 2203.14672 | null |
2022-03-25 | Spectral Measurement Sparsification for Pose-Graph SLAM | Kevin J. Doherty et.al. | 2203.13897 | link |
2022-03-25 | FD-SLAM: 3-D Reconstruction Using Features and Dense Matching | Xingrui Yang et.al. | 2203.13861 | null |
2022-03-25 | Gravity-constrained point cloud registration | Vladimír Kubelka et.al. | 2203.13799 | null |
2022-03-24 | MD-SLAM: Multi-cue Direct SLAM | Luca Di Giammarino et.al. | 2203.13237 | link |
2022-03-24 | Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video | Shun Taguchi et.al. | 2203.12804 | null |
2022-03-19 | Hybrid Active and Passive Sensing for SLAM in Wireless Communication Systems | Jie Yang et.al. | 2203.10267 | null |
2022-03-16 | Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR | Ian D. Miller et.al. | 2203.08925 | link |
2022-03-15 | Neural RF SLAM for unsupervised positioning and mapping with channel state information | Shreya Kadambi et.al. | 2203.08264 | null |
2022-03-15 | Simultaneous Localisation and Mapping with Quadric Surfaces | Tristan Laidlow et.al. | 2203.08040 | null |
2022-03-14 | Drift Reduced Navigation with Deep Explainable Features | Mohd Omama et.al. | 2203.06897 | link |
2022-03-11 | An Efficient Accelerator for Deep Learning-based Point Cloud Registration on FPGAs | Keisuke Sugiura et.al. | 2203.05763 | null |
2022-03-10 | High Definition, Inexpensive, Underwater Mapping | Bharat Joshi et.al. | 2203.05640 | link |
2022-03-10 | SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning | Jaehoon Choi et.al. | 2203.05332 | null |
2022-03-08 | Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM | Pierre-Yves Lajoie et.al. | 2203.04446 | link |
2022-03-08 | SLAM-Supported Self-Training for 6D Object Pose Estimation | Ziqi Lu et.al. | 2203.04424 | link |
2022-03-08 | An Online Semantic Mapping System for Extending and Enhancing Visual SLAM | Thorsten Hempel et.al. | 2203.03944 | null |
2022-03-07 | Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms | Qingqing Li et.al. | 2203.03454 | link |
2022-03-07 | OverlapTransformer: An Efficient and Rotation-Invariant Transformer Network for LiDAR-Based Place Recognition | Junyi Ma et.al. | 2203.03397 | link |
2022-03-06 | Minimum Cost Multicuts for Incorrect Landmark Edge Detection in Pose-graph SLAM | Kazushi Aiba et.al. | 2203.02887 | null |
2022-03-06 | RGB-D SLAM in Indoor Planar Environments with Multiple Large Dynamic Objects | Ran Long et.al. | 2203.02882 | null |
2022-03-03 | STUN: Self-Teaching Uncertainty Estimation for Place Recognition | Kaiwen Cai et.al. | 2203.01851 | link |
2022-03-03 | Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning | Niclas Vödisch et.al. | 2203.01578 | link |
2022-03-02 | FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2203.00893 | link |
2022-03-02 | Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation | Yulun Tian et.al. | 2203.00851 | null |
2022-03-01 | Descriptellation: Deep Learned Constellation Descriptors for SLAM | Chunwei Xing et.al. | 2203.00567 | null |
2022-03-01 | Collaborative Robot Mapping using Spectral Graph Analysis | Lukas Bernreiter et.al. | 2203.00308 | null |
2022-02-26 | RL-PGO: Reinforcement Learning-based Planar Pose-Graph Optimization | Nikolaos Kourtzanidis et.al. | 2202.13221 | link |
2022-02-25 | Probabilistic Data Association for Semantic SLAM at Scale | Elad Michael et.al. | 2202.12802 | link |
2022-02-24 | TwistSLAM: Constrained SLAM in Dynamic Environment | Mathieu Gonzalez et.al. | 2202.12384 | null |
2022-02-24 | Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion | Hyeonsoo Jang et.al. | 2202.12108 | null |
2022-02-23 | MITI: SLAM Benchmark for Laparoscopic Surgery | Regine Hartwig et.al. | 2202.11496 | null |
2022-02-23 | DL-SLOT: Dynamic Lidar SLAM and Object Tracking Based On Graph Optimization | Xuebo Tian et.al. | 2202.11431 | null |
2022-02-23 | Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets | Islam Ali et.al. | 2202.11312 | null |
2022-02-22 | SAGE: SLAM with Appearance and Geometry Prior for Endoscopy | Xingtong Liu et.al. | 2202.09487 | link |
2022-02-18 | OKVIS2: Realtime Scalable Visual-Inertial SLAM with Loop Closure | Stefan Leutenegger et.al. | 2202.09199 | null |
2022-02-18 | MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery | Ahmad Khaliq et.al. | 2202.09146 | link |
2022-02-18 | An Energy-Efficient and Runtime-Reconfigurable FPGA-Based Accelerator for Robotic Localization Systems | Qiang Liu et.al. | 2202.08952 | null |
2022-02-17 | Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study | Giovanni Cioffi et.al. | 2202.08894 | link |
2022-02-17 | LiDAR-Inertial 3D SLAM with Plane Constraint for Multi-story Building | Jiashi Zhang et.al. | 2202.08487 | null |
2022-02-16 | Virtual Maps for Autonomous Exploration of Cluttered Underwater Environments | Jinkun Wang et.al. | 2202.08359 | null |
2022-02-11 | Overhead Image Factors for Underwater Sonar-based SLAM | John McConnell et.al. | 2202.05811 | null |
2022-02-10 | Scale Estimation with Dual Quadrics for Monocular Object SLAM | Shuangfu Song et.al. | 2202.04816 | null |
2022-02-08 | A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition | Nie Jiwei et.al. | 2202.03677 | null |
2022-01-25 | Autonomous Vehicles: Open-Source Technologies, Considerations, and Development | Oussama Saoudi et.al. | 2202.03148 | null |
2022-02-07 | Temporal Point Cloud Completion with Pose Disturbance | Jieqi Shi et.al. | 2202.03084 | null |
2022-02-04 | DYP-SLAM: A Real-time Visual SLAM Based on YOLO and Probability in Dynamic Environments | Xinggang Hu et.al. | 2202.01938 | null |
2022-02-01 | A Model for Multi-View Residual Covariances based on Perspective Deformation | Alejandro Fontan et.al. | 2202.00765 | null |
2022-01-30 | Joint Vehicular Localization and Reflective Mapping Based on Team Channel-SLAM | Xinghe Chu et.al. | 2201.12726 | null |
2022-01-28 | RGB-D SLAM Using Attention Guided Frame Association | Ali Caglayan et.al. | 2201.12047 | null |
2022-02-04 | Learning to Act with Affordance-Aware Multimodal Neural SLAM | Zhiwei Jia et.al. | 2201.09862 | link |
2022-01-22 | Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems | Xi Zheng et.al. | 2201.09048 | link |
2022-01-17 | SC-LiDAR-SLAM: a Front-end Agnostic Versatile LiDAR SLAM System | Giseop Kim et.al. | 2201.06423 | null |
2022-01-14 | SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions | Ali Samadzadeh et.al. | 2201.05386 | link |
2022-01-19 | Multi-Hypothesis Scan Matching through Clustering | Giorgio Iavicoli et.al. | 2201.03814 | null |
2022-01-11 | Performance Guarantees for Spectral Initialization in Rotation Averaging and Pose-Graph SLAM | Kevin J. Doherty et.al. | 2201.03773 | null |
2022-01-10 | High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM | Brian M. Hopkinson et.al. | 2201.03364 | link |
2022-01-10 | Why-So-Deep: Towards Boosting Previously Trained Models for Visual Place Recognition | M. Usman Maqbool Bhutta et.al. | 2201.03212 | link |
2022-01-04 | Formulations of Hydrodynamic Force in the Transition Stage of the Water Entry of Linear Wedges with Constant and Varying Speeds | Xueliang Wen et.al. | 2201.00959 | null |
2021-12-29 | Efficient Belief Space Planning in High-Dimensional State Spaces using PIVOT: Predictive Incremental Variable Ordering Tactic | Khen Elimelech et.al. | 2112.14428 | null |
2021-12-19 | M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots | Jie Yin et.al. | 2112.13659 | link |
2021-12-27 | UV-SLAM: Unconstrained Line-based SLAM Using Vanishing Points for Structural Mapping | Hyunjun Lim et.al. | 2112.13515 | link |
2021-12-25 | Simultaneous Location of Rail Vehicles and Mapping of Environment with Multiple LiDARs | Yusheng Wang et.al. | 2112.13224 | null |
2021-12-25 | Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping | Peng Huang et.al. | 2112.13222 | null |
2021-12-24 | 3D Point Cloud Reconstruction and SLAM as an Input | Ziyu Li et.al. | 2112.12907 | null |
2021-12-22 | NICE-SLAM: Neural Implicit Scalable Encoding for SLAM | Zihan Zhu et.al. | 2112.12130 | link |
2021-12-18 | Fast and Robust Registration of Partially Overlapping Point Clouds | Eduardo Arnold et.al. | 2112.09922 | link |
2021-12-17 | Symmetry-aware Neural Architecture for Embodied Visual Navigation | Shuang Liu et.al. | 2112.09515 | null |
2021-12-27 | Homography Decomposition Networks for Planar Object Tracking | Xinrui Zhan et.al. | 2112.07909 | link |
2021-12-14 | Autonomous Navigation System from Simultaneous Localization and Mapping | Micheal Caracciolo et.al. | 2112.07723 | link |
2021-12-12 | 360-DFPE: Leveraging Monocular 360-Layouts for Direct Floor Plan Estimation | Bolivar Solarte et.al. | 2112.06180 | link |
2021-12-11 | Simultaneous Localization and Mapping: Through the Lens of Nonlinear Optimization | Amay Saxena et.al. | 2112.05921 | null |
2021-12-07 | Hybrid Visual SLAM for Underwater Vehicle Manipulator Systems | Gideon Billings et.al. | 2112.03826 | link |
2021-12-05 | Iterated Posterior Linearization PMB Filter for 5G SLAM | Yu Ge et.al. | 2112.02575 | null |
2021-12-03 | Fast Direct Stereo Visual SLAM | Jiawei Mo et.al. | 2112.01890 | link |
2021-12-02 | MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment | Jie Ren et.al. | 2112.01349 | link |
2021-12-01 | Research on Event Accumulator Settings for Event-Based SLAM | Kun Xiao et.al. | 2112.00427 | link |
2021-11-29 | An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments | Assem Sadek et.al. | 2111.14666 | null |
2021-11-29 | Deployment of Aerial Robots after a major fire of an industrial hall with hazardous substances, a report | Hartmut Surmann et.al. | 2111.14542 | null |
2021-11-24 | Automatic Mapping with Obstacle Identification for Indoor Human Mobility Assessment | V. Ayala-Alfaro et.al. | 2111.12690 | null |
2021-11-24 | Autonomous bot with ML-based reactive navigation for indoor environment | Yash Srivastava et.al. | 2111.12542 | null |
2021-11-22 | A General Framework for Lifelong Localization and Mapping in Changing Environment | Min Zhao et.al. | 2111.10946 | link |
2021-11-17 | Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network | Xiaoming Zhao et.al. | 2111.09006 | null |
2021-11-10 | Comparing dominance of tennis’ big three via multiple-output Bayesian quantile regression models | Bruno Santos et.al. | 2111.05631 | null |
2021-11-10 | TomoSLAM: factor graph optimization for rotation angle refinement in microtomography | Mark Griguletskii et.al. | 2111.05562 | null |
2021-11-07 | Hierarchical Segment-based Optimization for SLAM | Yuxin Tian et.al. | 2111.04101 | null |
2021-11-07 | Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM | Shing Yan Loo et.al. | 2111.04096 | null |
2021-11-05 | MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry | Joan P. Company-Corcoles et.al. | 2111.03408 | null |
2021-10-31 | Loop closure detection using local 3D deep descriptors | Youjie Zhou et.al. | 2111.00440 | link |
2021-10-27 | Millimeter Wave Wireless Assisted Robot Navigation with Link State Classification | Mingsheng Yin et.al. | 2110.14789 | link |
2021-10-27 | Efficient Placard Discovery for Semantic Mapping During Frontier Exploration | David Balaban et.al. | 2110.14742 | null |
2021-10-26 | Robust Multi-view Registration of Point Sets with Laplacian Mixture Model | Jin Zhang et.al. | 2110.13744 | null |
2021-10-25 | WOLF: A modular estimation framework for robotics based on factor graphs | Joan Sola et.al. | 2110.12919 | null |
2021-10-21 | Real-Time Ground-Plane Refined LiDAR SLAM | Fan Yang et.al. | 2110.11517 | null |
2021-10-21 | SymbioLCD: Ensemble-Based Loop Closure Detection using CNN-Extracted Objects and Visual Bag-of-Words | Jonathan J. Y. Kim et.al. | 2110.11491 | null |
2021-10-21 | InterpolationSLAM: A Novel Robust Visual SLAM System in Rotational Motion | Zhenkun Zhu et.al. | 2110.11040 | null |
2021-10-20 | SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training | Ankur Bapna et.al. | 2110.10329 | null |
2021-10-18 | Enhancing exploration algorithms for navigation with visual SLAM | Kirill Muravyev et.al. | 2110.09156 | null |
2021-10-18 | Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark Construction in Outdoor Environment | Rui Tian et.al. | 2110.08977 | null |
2021-10-16 | Partial Hierarchical Pose Graph Optimization for SLAM | Alexander Korovko et.al. | 2110.08639 | null |
2021-10-14 | Active SLAM over Continuous Trajectory and Control: A Covariance-Feedback Approach | Shumon Koga et.al. | 2110.07546 | null |
2021-10-13 | Collaborative Radio SLAM for Multiple Robots based on WiFi Fingerprint Similarity | Ran Liu et.al. | 2110.06541 | null |
2021-10-12 | Learning Efficient Multi-Agent Cooperative Visual Exploration | Chao Yu et.al. | 2110.05734 | null |
2021-10-07 | Self-Supervised Depth Completion for Active Stereo | Frederik Warburg et.al. | 2110.03234 | null |
2021-10-06 | InterpolationSLAM: A Novel Robust Visual SLAM System in Rotating Scenes | Zhenkun Zhu et.al. | 2110.02593 | null |
2021-10-03 | AEROS: Adaptive RObust least-Squares for Graph-Based SLAM | Milad Ramezani et.al. | 2110.02018 | null |
2021-10-04 | Fast Uncertainty Quantification for Active Graph SLAM | Julio A. Placed et.al. | 2110.01289 | link |
2021-10-04 | Geometry-based Graph Pruning for Lifelong SLAM | Gerhard Kurz et.al. | 2110.01286 | null |
2021-10-03 | Quadrotor Control on $SU(2)\times R^3$ with SLAM Integration | Marcus Greiff et.al. | 2110.01099 | null |
2021-10-02 | Online Incremental Non-Gaussian Inference for SLAM Using Normalizing Flows | Qiangqiang Huang et.al. | 2110.00876 | link |
SFM
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-18 | Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation | Rémi Marsal et.al. | 2412.14103 | null |
2024-12-16 | Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection | Beomseok Lee et.al. | 2412.11978 | null |
2024-12-18 | SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video | Jongmin Park et.al. | 2412.09982 | null |
2024-12-12 | CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework | Yushan Han et.al. | 2412.08344 | null |
2024-12-10 | Deep Non-rigid Structure-from-Motion Revisited: Canonicalization and Sequence Modeling | Hui Deng et.al. | 2412.07230 | null |
2024-12-08 | Unveiling True Talent: The Soccer Factor Model for Skill Evaluation | Alexandre Andorra et.al. | 2412.05911 | null |
2024-12-08 | Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features | Yuanbo Xiangli et.al. | 2412.05826 | null |
2024-12-06 | MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos | Zhengqi Li et.al. | 2412.04463 | null |
2024-12-03 | ASANet: Asymmetric Semantic Aligning Network for RGB and SAR image land cover classification | Pan Zhang et.al. | 2412.02044 | link |
2024-12-02 | SfM-Free 3D Gaussian Splatting via Hierarchical Training | Bo Ji et.al. | 2412.01553 | link |
2024-12-02 | MVImgNet2.0: A Larger-scale Dataset of Multi-view Images | Xiaoguang Han et.al. | 2412.01430 | null |
2024-12-02 | TAS-TsC: A Data-Driven Framework for Estimating Time of Arrival Using Temporal-Attribute-Spatial Tri-space Coordination of Truck Trajectories | Mengran Li et.al. | 2412.01122 | null |
2024-12-02 | Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM | Alejandro Fontan et.al. | 2412.01116 | null |
2024-11-27 | RoMo: Robust Motion Segmentation Improves Structure from Motion | Lily Goli et.al. | 2411.18650 | null |
2024-11-26 | The MAGPI Survey: radial trends in star formation across different cosmological simulations in comparison with observations at $z \sim$ 0.3 | Marcie Mun et.al. | 2411.17882 | null |
2024-11-25 | Characterizing Stellar and Gas Properties in NGC 628: Spatial Distributions, Radial Gradients, and Resolved Scaling Relations | Peng Wei et.al. | 2411.16150 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-11-20 | DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild | Weicai Ye et.al. | 2411.13291 | null |
2024-11-15 | SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction | Yutao Tang et.al. | 2411.12592 | link |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-13 | 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Mijeong Kim et.al. | 2411.08879 | null |
2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
2024-11-08 | From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$ -NeuS | Haoran Zhang et.al. | 2411.05362 | link |
2024-10-29 | A Cascade Approach for APT Campaign Attribution in System Event Logs: Technique Hunting and Subgraph Matching | Yi-Ting Huang et.al. | 2410.22602 | null |
2024-10-29 | LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues | Hanqing Jiang et.al. | 2410.22213 | null |
2024-10-17 | Stochastic Flow Matching for Resolving Small-Scale Physics | Stathi Fotiadis et.al. | 2410.19814 | null |
2024-10-25 | A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint | Changshi Mu et.al. | 2410.19473 | null |
2024-10-30 | Large Spatial Model: End-to-end Unposed Images to Semantic 3D | Zhiwen Fan et.al. | 2410.18956 | null |
2024-10-23 | CO-CAVITY project: Molecular gas and star formation in void galaxies | M. I. Rodríguez et.al. | 2410.18078 | null |
2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
2024-10-20 | Neural Active Structure-from-Motion in Dark and Textureless Environment | Kazuto Ichimaru et.al. | 2410.15378 | null |
2024-10-17 | SemSim: Revisiting Weak-to-Strong Consistency from a Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation | Shiao Xie et.al. | 2410.13486 | null |
2024-10-16 | Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks | Orchid Chetia Phukan et.al. | 2410.12947 | null |
2024-10-16 | Gravity-aligned Rotation Averaging with Circular Regression | Linfei Pan et.al. | 2410.12763 | link |
2024-10-16 | Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals | Orchid Chetia Phukan et.al. | 2410.12645 | null |
2024-10-15 | SplatPose+: Real-time Image-Based Pose-Agnostic 3D Anomaly Detection | Yizhe Liu et.al. | 2410.12080 | link |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-10-09 | Surgical Depth Anything: Depth Estimation for Surgical Scenes using Foundation Models | Ange Lou et.al. | 2410.07434 | null |
2024-10-09 | Deep HI Mapping of M 106 Group with FAST | Yao Liu et.al. | 2410.07038 | null |
2024-10-09 | MaD-Scientist: AI-based Scientist solving Convection-Diffusion-Reaction Equations Using Massive PINN-Based Prior Data | Mingu Kang et.al. | 2410.06442 | null |
2024-10-08 | Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation? | Charalambos Tzamos et.al. | 2410.05984 | link |
2024-10-04 | Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering | Laura Fink et.al. | 2410.03861 | null |
2024-10-01 | MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages | Marco Gaido et.al. | 2410.01036 | link |
2024-10-01 | Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance | Hongchao Shu et.al. | 2410.00386 | null |
2024-09-29 | Robust Incremental Structure-from-Motion with Hybrid Features | Shaohui Liu et.al. | 2409.19811 | null |
2024-09-27 | MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion | Bardienus Duisterhof et.al. | 2409.19152 | null |
2024-09-27 | Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras | Yipeng Lu et.al. | 2409.18673 | null |
2024-09-26 | BlinkTrack: Feature Tracking over 100 FPS via Events and Images | Yichen Shen et.al. | 2409.17981 | null |
2024-09-25 | How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not | Francesco Verdini et.al. | 2409.17044 | null |
2024-09-24 | Frequency-based View Selection in Gaussian Splatting Reconstruction | Monica M. Q. Li et.al. | 2409.16470 | null |
2024-10-07 | Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion | Juan-Diego Florez et.al. | 2409.16465 | null |
2024-09-24 | Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research | Vandita Shukla et.al. | 2409.15914 | null |
2024-09-23 | Assessment of Submillimeter Precision via Structure from Motion Technique in Close-Range Capture Environments | Francisco Roza de Moraes et.al. | 2409.15602 | null |
2024-09-23 | Evaluating Robot Influence on Pedestrian Behavior Models for Crowd Simulation and Benchmarking | Subham Agrawal et.al. | 2409.14844 | null |
2024-09-21 | Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models | Orchid Chetia Phukan et.al. | 2409.14131 | null |
2024-09-17 | GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module | Yichen Zhang et.al. | 2409.11307 | null |
2024-09-13 | Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints | Shan Chen et.al. | 2409.08613 | null |
2024-09-09 | KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction | Davide Di Nucci et.al. | 2409.05407 | null |
2024-09-06 | The Arizona Molecular ISM Survey with the SMT: Variations in the CO(2-1)/CO(1-0) Line Ratio Across the Galaxy Population | Ryan P. Keenan et.al. | 2409.03963 | null |
2024-09-05 | Active Galactic Nuclei in the Green Valley at z $\sim$ 0.7 | Charity Woodrum et.al. | 2409.03197 | null |
2024-09-04 | Object Gaussian for Monocular 6D Pose Estimation from Sparse Views | Luqing Luo et.al. | 2409.02581 | null |
2024-09-11 | Geometry-aware Feature Matching for Large-Scale Structure from Motion | Gonglin Chen et.al. | 2409.02310 | null |
2024-09-04 | The study of strongly intensive observables for $π^{\pm,0}$ in $pp$ collisions at LHC energy in the framework of PYTHIA model | Tumpa Biswas et.al. | 2409.00525 | null |
2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
2024-09-15 | Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Sierra Bonilla et.al. | 2408.16445 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-20 | TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai et.al. | 2408.10739 | null |
2024-08-16 | Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS | Wei Sun et.al. | 2408.08723 | null |
2024-08-15 | CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning | Wei Zhu et.al. | 2408.08134 | link |
2024-08-13 | A Miniature Vision-Based Localization System for Indoor Blimps | Shicong Ma et.al. | 2408.06648 | null |
2024-08-07 | Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM | Yan Song Hu et.al. | 2408.03825 | null |
2024-08-05 | Context-aware Mamba-based Reinforcement Learning for social robot navigation | Syed Muhammad Mustafa et.al. | 2408.02661 | null |
2024-08-04 | Birational geometry of critical loci in Algebraic Vision | Marina Bertolini et.al. | 2408.02067 | null |
2024-08-04 | PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone | Xin Yang et.al. | 2408.02053 | null |
2024-08-02 | Structure from Motion-based Motion Estimation and 3D Reconstruction of Unknown Shaped Space Debris | Kentaro Uno et.al. | 2408.01035 | null |
2024-08-01 | LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting | Zhenyu Bao et.al. | 2408.00254 | null |
2024-07-29 | Global Structure-from-Motion Revisited | Linfei Pan et.al. | 2407.20219 | link |
2024-08-06 | Revisit Self-supervised Depth Estimation with Local Structure-from-Motion | Shengjie Zhu et.al. | 2407.19166 | null |
2024-07-23 | The Hidden Variables: Harnessing Half-Shell Potentials for Enhanced Precision in Nuclear Reaction Calculations | Hao Liu et.al. | 2407.16452 | null |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-16 | NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models | Francesco Milano et.al. | 2407.12207 | link |
2024-07-15 | LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning | Zhuozhu Jian et.al. | 2407.10782 | null |
2024-07-15 | Towards Scale-Aware Full Surround Monodepth with Transformers | Yuchen Yang et.al. | 2407.10406 | null |
2024-07-14 | 3DEgo: 3D Editing on the Go! | Umar Khalid et.al. | 2407.10102 | null |
2024-07-10 | Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization | Jinjie Mai et.al. | 2407.08023 | link |
2024-07-10 | Euclid preparation. Forecasting the recovery of galaxy physical properties and their relations with template-fitting and machine-learning methods | Euclid Collaboration et.al. | 2407.07940 | null |
2024-07-10 | Controlling Space and Time with Diffusion Models | Daniel Watson et.al. | 2407.07860 | null |
2024-07-09 | Computer vision tasks for intelligent aerospace missions: An overview | Huilin Chen et.al. | 2407.06513 | null |
2024-07-08 | Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views | Jiawei Guo et.al. | 2407.05666 | null |
2024-07-05 | Efficient Detection of Long Consistent Cycles and its Application to Distributed Synchronization | Shaohan Li et.al. | 2407.04260 | null |
2024-07-15 | SfM on-the-fly: Get better 3D from What You Capture | Zongqian Zhan et.al. | 2407.03939 | null |
2024-07-03 | Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jiaxin Guo et.al. | 2407.02918 | link |
2024-07-02 | Indoor 3D Reconstruction with an Unknown Camera-Projector Pair | Zhaoshuai Qi et.al. | 2407.01945 | null |
2024-06-27 | SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas | John Lambert et.al. | 2406.19390 | link |
2024-06-27 | STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning | Yanan Zhang et.al. | 2406.19362 | null |
2024-06-26 | VDG: Vision-Only Dynamic Gaussian for Driving Simulation | Hao Li et.al. | 2406.18198 | null |
2024-06-25 | Consensus Learning with Deep Sets for Essential Matrix Estimation | Dror Moran et.al. | 2406.17414 | link |
2024-06-24 | Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction | Tong Qin et.al. | 2406.16289 | null |
2024-06-21 | The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization | Ivan Nikolić et.al. | 2406.15237 | link |
2024-06-19 | MVSBoost: An Efficient Point Cloud-based 3D Reconstruction | Umair Haroon et.al. | 2406.13515 | null |
2024-06-17 | MegaScenes: Scene-Level View Synthesis at Scale | Joseph Tung et.al. | 2406.11819 | link |
2024-06-15 | Benchmarking Children’s ASR with Supervised and Self-supervised Speech Foundation Models | Ruchao Fan et.al. | 2406.10507 | link |
2024-06-14 | On the Evaluation of Speech Foundation Models for Spoken Language Understanding | Siddhant Arora et.al. | 2406.10083 | null |
2024-06-12 | Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement | Maxime Pietrantoni et.al. | 2406.08463 | null |
2024-06-12 | SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models | Chun Yin et.al. | 2406.08445 | null |
2024-06-10 | Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis | Xin Jin et.al. | 2406.06216 | link |
2024-06-07 | The Star-Forming Main Sequence in JADES and CEERS at $z>1.4$ : Investigating the Burstiness of Star Formation | Leonardo Clarke et.al. | 2406.05178 | null |
2024-06-13 | Gaussian Splatting with Localized Points Management | Haosen Yang et.al. | 2406.04251 | null |
2024-06-05 | L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration | Yibo Liu et.al. | 2406.03298 | link |
2024-06-04 | CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation | Dejia Xu et.al. | 2406.02509 | null |
2024-05-29 | Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy | Zijie Jiang et.al. | 2405.18863 | null |
2024-05-29 | 3D Reconstruction with Fast Dipole Sums | Hanyu Chen et.al. | 2405.16788 | null |
2024-05-26 | MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups | Yusen Xie et.al. | 2405.16599 | null |
2024-05-26 | Categorical Flow Matching on Statistical Manifolds | Chaoran Cheng et.al. | 2405.16441 | link |
2024-05-22 | Exploring Galaxy Properties of eCALIFA with Contrastive Learning | G. Martínez-Solaeche et.al. | 2405.13471 | null |
2024-05-23 | Switched Flow Matching: Eliminating Singularities via Switching ODEs | Qunxi Zhu et.al. | 2405.11605 | null |
2024-05-28 | NeRO: Neural Road Surface Reconstruction | Ruibo Wang et.al. | 2405.10554 | link |
2024-05-15 | Three Dimensional Spatial Cognition: Bees and Bats | Robert Worden et.al. | 2405.09413 | null |
2024-05-09 | Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media | Zhizhen Zhang et.al. | 2405.05760 | null |
2024-05-09 | Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment | Simon Weber et.al. | 2405.05079 | link |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-07 | Non-rigid Structure-from-Motion: Temporally-smooth Procrustean Alignment and Spatially-variant Deformation Modeling | Jiawei Shi et.al. | 2405.04309 | null |
2024-05-06 | Transformer-based RGB-T Tracking with Channel and Spatial Feature Fusion | Yunfeng Li et.al. | 2405.03177 | link |
2024-05-03 | HoloGS: Instant Depth-based 3D Gaussian Splatting with Microsoft HoloLens 2 | Miriam Jäger et.al. | 2405.02005 | null |
2024-04-25 | The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time | Marcie Mun et.al. | 2404.16319 | null |
2024-04-22 | Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Eric Brachmann et.al. | 2404.14351 | null |
2024-04-22 | RESFM: Robust Equivariant Multiview Structure from Motion | Fadi Khatib et.al. | 2404.14280 | null |
2024-04-22 | Does Gaussian Splatting need SFM Initialization? | Yalda Foroutan et.al. | 2404.12547 | null |
2024-05-07 | A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion | Feng Yu et.al. | 2404.11590 | link |
2024-04-18 | DeblurGS: Gaussian Splatting for Camera Motion Blur | Jeongtaek Oh et.al. | 2404.11358 | null |
2024-05-21 | LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives | Jiadi Cui et.al. | 2404.09748 | null |
2024-04-12 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
2024-04-11 | Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation | Keonhee Han et.al. | 2404.07933 | null |
2024-04-07 | NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization | Peng Tu et.al. | 2404.04875 | null |
2024-04-04 | GaSpCT: Gaussian Splatting for Novel CT Projection View Synthesis | Emmanouil Nikolakakis et.al. | 2404.03126 | null |
2024-03-29 | InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds | Zhiwen Fan et.al. | 2403.20309 | link |
2024-03-29 | HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Zhuopeng Li et.al. | 2403.20032 | null |
2024-03-26 | NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation | Jiahao Chen et.al. | 2403.17537 | null |
2024-03-25 | INPC: Implicit Neural Point Clouds for Radiance Field Rendering | Florian Hahlbohm et.al. | 2403.16862 | null |
2024-03-18 | An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation | Zewen Xu et.al. | 2403.11639 | null |
2024-03-14 | Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting | Jaewoo Jung et.al. | 2403.09413 | link |
2024-03-13 | Refractive COLMAP: Refractive Structure-from-Motion Revisited | Mengkun She et.al. | 2403.08640 | null |
2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-11 | SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection | Yifu Tao et.al. | 2403.06877 | null |
2024-03-24 | BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng et.al. | 2403.04926 | link |
2024-02-22 | GaussianPro: 3D Gaussian Splatting with Progressive Propagation | Kai Cheng et.al. | 2402.14650 | null |
2024-02-25 | A Robust Error-Resistant View Selection Method for 3D Reconstruction | Shaojie Zhang et.al. | 2402.11431 | null |
2024-02-17 | Dense Matchers for Dense Tracking | Tomáš Jelínek et.al. | 2402.11287 | null |
2024-03-11 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-22 | HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs | Zelin Gao et.al. | 2401.11711 | null |
2024-01-19 | SCENES: Subpixel Correspondence Estimation With Epipolar Supervision | Dominik A. Kloepfer et.al. | 2401.10886 | null |
2024-01-15 | 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data | Mathilde Letard et.al. | 2401.09481 | link |
2024-01-17 | 3D Scene Geometry Estimation from 360 $^\circ$ Imagery: A Survey | Thiago Lopes Trugillo da Silveira et.al. | 2401.09252 | null |
2024-01-17 | ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization | Weiyao Wang et.al. | 2401.08937 | null |
2024-01-16 | Cross-Modal Semi-Dense 6-DoF Tracking of an Event Camera in Challenging Conditions | Yi-Fan Zuo et.al. | 2401.08043 | link |
2024-01-10 | Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects | Tianhang Cheng et.al. | 2401.05236 | link |
2024-01-07 | A Classification of Critical Configurations for any Number of Projective Views | Martin Bråtelund et.al. | 2401.03450 | link |
2023-12-24 | Residual Learning for Image Point Descriptors | Rashik Shrestha et.al. | 2312.15471 | null |
2023-12-16 | Transformers in Unsupervised Structure-from-Motion | Hemang Chawla et.al. | 2312.10529 | link |
2023-12-14 | HeadRecon: High-Fidelity 3D Head Reconstruction from Monocular Video | Xueying Wang et.al. | 2312.08863 | null |
2023-12-14 | CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning | Qingsong Yan et.al. | 2312.08760 | null |
2023-12-11 | Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach | Travis Driver et.al. | 2312.06865 | link |
2023-12-11 | Gaussian Splatting SLAM | Hidenobu Matsuki et.al. | 2312.06741 | null |
2023-12-10 | SuperPrimitive: Scene Reconstruction at a Primitive Level | Kirill Mazur et.al. | 2312.05889 | null |
2023-12-07 | Visual Geometry Grounded Deep Structure From Motion | Jianyuan Wang et.al. | 2312.04563 | null |
2023-11-30 | Distributed Global Structure-from-Motion with a Deep Front-End | Ayush Baid et.al. | 2311.18801 | link |
2023-11-21 | Robot Hand-Eye Calibration using Structure-from-Motion | Nicolas Andreff et.al. | 2311.11808 | null |
2023-11-18 | LOSTU: Fast, Scalable, and Uncertainty-Aware Triangulation | Sébastien Henry et.al. | 2311.11171 | null |
2023-11-10 | MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty | Rémi Marsal et.al. | 2311.06137 | link |
2023-11-08 | VET: Visual Error Tomography for Point Cloud Completion and High-Quality Neural Rendering | Linus Franke et.al. | 2311.04634 | link |
2023-10-22 | A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video | Jan Emily Mangulabnan et.al. | 2310.14364 | null |
2023-10-20 | FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer | Xinyu Zhang et.al. | 2310.13605 | null |
2023-10-09 | Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration | Chunge Bai et.al. | 2310.05504 | link |
2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
2023-11-29 | Pose-Free Generalizable Rendering Transformer | Zhiwen Fan et.al. | 2310.03704 | link |
2023-10-02 | Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images | Georg Bökman et.al. | 2310.01092 | null |
2023-10-01 | Propagating Semantic Labels in Video Data | David Balaban et.al. | 2310.00783 | null |
2023-09-22 | Scalable Semantic 3D Mapping of Coral Reefs with Deep Learning | Jonathan Sauder et.al. | 2309.12804 | null |
2023-09-21 | On-the-Fly SfM: What you capture is What you get | Zongqian Zhan et.al. | 2309.11883 | link |
2023-09-19 | Using an Uncrewed Surface Vehicle to Create a Volumetric Model of Non-Navigable Rivers and Other Shallow Bodies of Water | Jayesh Tripathi et.al. | 2309.10269 | null |
2023-09-16 | DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF | Mert Asim Karaoglu et.al. | 2309.08927 | null |
2023-09-08 | Robot Localization and Mapping Final Report – Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry | Akankshya Kar et.al. | 2309.04147 | null |
2023-09-01 | SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation | Youhong Wang et.al. | 2309.00526 | null |
2023-09-01 | Dense Voxel 3D Reconstruction Using a Monocular Event Camera | Haodong Chen et.al. | 2309.00385 | null |
2023-08-30 | Learning Structure-from-Motion with Graph Attention Networks | Lucas Brynte et.al. | 2308.15984 | link |
2023-08-26 | Disjoint Pose and Shape for 3D Face Reconstruction | Raja Kumar et.al. | 2308.13903 | null |
2023-08-30 | CamP: Camera Preconditioning for Neural Radiance Fields | Keunhong Park et.al. | 2308.10902 | null |
2023-08-18 | Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling | Haorui Ji et.al. | 2308.10705 | null |
2023-08-14 | Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation | Tao Liu et.al. | 2308.07231 | link |
2023-08-11 | Efficient Large-scale AUV-based Visual Seafloor Mapping | Mengkun She et.al. | 2308.06147 | null |
2023-08-04 | EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems | Weihan Wang et.al. | 2308.02670 | null |
2023-08-15 | Tirtha – An Automated Platform to Crowdsource Images and Create 3D Models of Heritage Sites | Jyotirmaya Shivottam et.al. | 2308.01246 | link |
2023-08-02 | Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network | Shenbagaraj Kannapiran et.al. | 2308.01125 | null |
2023-07-27 | PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking | Yang Zheng et.al. | 2307.15055 | link |
2023-07-28 | SACReg: Scene-Agnostic Coordinate Regression for Visual Localization | Jerome Revaud et.al. | 2307.11702 | null |
2023-07-19 | Lazy Visual Localization via Motion Averaging | Siyan Dong et.al. | 2307.09981 | null |
2023-07-10 | Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor | San Jiang et.al. | 2307.04520 | null |
2023-07-07 | RGB-D Mapping and Tracking in a Plenoxel Radiance Field | Andreas L. Teigen et.al. | 2307.03404 | link |
2023-06-29 | The Drunkard’s Odometry: Estimating Camera Motion in Deforming Scenes | David Recasens et.al. | 2306.16917 | link |
2023-06-27 | Detector-Free Structure from Motion | Xingyi He et.al. | 2306.15669 | link |
2023-06-28 | PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment | Jianyuan Wang et.al. | 2306.15667 | null |
2023-06-24 | 3D Reconstruction of Spherical Images based on Incremental Structure from Motion | San Jiang et.al. | 2306.12770 | link |
2023-06-15 | NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations | Varun Jampani et.al. | 2306.09109 | link |
2023-06-15 | Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization | Dror Aiger et.al. | 2306.09012 | link |
2023-06-10 | 3D reconstruction using Structure for Motion | Kshitij Karnawat et.al. | 2306.06360 | link |
2023-06-02 | Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images | Marcela Mera-Trujillo et.al. | 2306.01938 | null |
2023-05-31 | FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow | Cameron Smith et.al. | 2306.00180 | null |
2023-05-19 | SIDAR: Synthetic Image Dataset for Alignment & Restoration | Monika Kwiatkowski et.al. | 2305.12036 | link |
2023-05-09 | Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization | Clémentin Boittiaux et.al. | 2305.05301 | link |
2023-05-09 | Rotation Synchronization via Deep Matrix Factorization | Gk Tejus et.al. | 2305.05268 | link |
2023-04-20 | A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion | Miriam Jäger et.al. | 2304.10664 | null |
2023-04-14 | Fusing Structure from Motion and Simulation-Augmented Pose Regression from Optical Flow for Challenging Indoor Environments | Felix Ott et.al. | 2304.07250 | null |
2023-04-12 | Visual Localization using Imperfect 3D Models from the Internet | Vojtech Panek et.al. | 2304.05947 | link |
2023-04-08 | Photometric Correction for Infrared Sensors | Jincheng Zhang et.al. | 2304.03930 | null |
2023-04-07 | DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium | Antyanta Bangunharcana et.al. | 2304.03560 | link |
2023-04-05 | Semantic Validation in Structure from Motion | Joseph Rowell et.al. | 2304.02420 | link |
2023-03-31 | Learning Internal Representations of 3D Transformations from 2D Projected Inputs | Marissa Connor et.al. | 2303.17776 | null |
2023-03-30 | 3D Line Mapping Revisited | Shaohui Liu et.al. | 2303.17504 | link |
2023-03-27 | TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering | Jaehoon Choi et.al. | 2303.15060 | null |
2023-03-26 | On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks | HyunJun Jung et.al. | 2303.14840 | link |
2023-03-24 | Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container | Jinguang Tong et.al. | 2303.13805 | link |
2023-03-24 | Progressively Optimized Local Radiance Fields for Robust View Synthesis | Andreas Meuleman et.al. | 2303.13791 | null |
2023-03-15 | RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters | Shuja Khalid et.al. | 2303.08695 | null |
2023-03-09 | Revisiting Rotation Averaging: Uncertainties and Robust Losses | Ganlin Zhang et.al. | 2303.05195 | link |
2023-02-28 | Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images | Zhongli Fan et.al. | 2302.14239 | link |
2023-03-25 | BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling | Sameera Ramasinghe et.al. | 2302.13543 | null |
2023-02-21 | EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images | Zhichao Ye et.al. | 2302.10544 | link |
2023-02-18 | Bridge Damage Cause Estimation Using Multiple Images Based on Visual Question Answering | Tatsuro Yamane et.al. | 2302.09208 | null |
2023-02-12 | Uncertainty-Driven Dense Two-View Structure from Motion | Weirong Chen et.al. | 2302.00523 | null |
2023-01-28 | AdaSfM: From Coarse Global to Fine Incremental Adaptive Structure from Motion | Yu Chen et.al. | 2301.12135 | null |
2023-01-20 | A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles | Zhefan Xu et.al. | 2301.08422 | link |
2023-03-21 | Robust Dynamic Radiance Fields | Yu-Lun Liu et.al. | 2301.02239 | link |
2022-12-24 | Polarimetric Multi-View Inverse Rendering | Jinyu Zhao et.al. | 2212.12721 | null |
2022-12-13 | Accidental Turntables: Learning 3D Pose by Watching Objects Turn | Zezhou Cheng et.al. | 2212.06300 | null |
2022-12-04 | 3D Object Aided Self-Supervised Monocular Depth Estimation | Songlin Wei et.al. | 2212.01768 | null |
2022-12-02 | High-Res Facial Appearance Capture from Polarized Smartphone Images | Dejan Azinović et.al. | 2212.01160 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
2022-11-24 | JigsawPlan: Room Layout Jigsaw Puzzle Extreme Structure from Motion using Diffusion Models | Sepidehsadat Hosseini et.al. | 2211.13785 | null |
2022-11-24 | SfM-TTR: Using Structure from Motion for Test-Time Refinement of Single-View Depth Networks | Sergio Izquierdo et.al. | 2211.13551 | link |
2022-11-22 | Level-S $^2$ fM: Structure from Motion on Neural Level Set of Implicit Surfaces | Yuxi Xiao et.al. | 2211.12018 | link |
2022-11-21 | Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques | David Ramirez et.al. | 2211.11836 | null |
2022-11-14 | Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion | René Haas et.al. | 2211.07195 | null |
2022-10-13 | Quantifying and analyzing rock trait distributions of rocky fault scarps using a deep learning approach | Zhiang Chen et.al. | 2210.07349 | null |
2022-10-11 | DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion | Yuxi Xiao et.al. | 2210.05517 | null |
2022-10-07 | Leveraging Structure from Motion to Localize Inaccessible Bus Stops | Indu Panigrahi et.al. | 2210.03646 | link |
2022-10-01 | Structure-Aware NeRF without Posed Camera via Epipolar Constraint | Shu Chen et.al. | 2210.00183 | link |
2022-10-05 | FAST-LIO, Then Bayesian ICP, Then GTSFM | Jerred Chen et.al. | 2210.00146 | null |
2022-09-20 | BuFF: Burst Feature Finder for Light-Constrained 3D Reconstruction | Ahalya Ravendran et.al. | 2209.09470 | null |
2022-09-19 | A Hybrid Cable-Driven Robot for Non-Destructive Leafy Plant Monitoring and Mass Estimation using Structure from Motion | Gerry Chen et.al. | 2209.08690 | null |
2022-09-14 | End-to-End Multi-View Structure-from-Motion with Hypercorrelation Volumes | Qiao Chen et.al. | 2209.06926 | null |
2022-09-07 | Deployment of Aerial Robots during the Flood Disaster in Erftstadt / Blessem in July 2021 | Hartmut Surmann et.al. | 2209.03084 | null |
2022-08-27 | Weakly and Semi-Supervised Detection, Segmentation and Tracking of Table Grapes with Limited and Noisy Data | Thomas A. Ciarfuglia et.al. | 2208.13001 | null |
2022-08-12 | Handling Constrained Optimization in Factor Graphs for Autonomous Navigation | Barbara Bazzana et.al. | 2208.06325 | null |
2022-08-04 | Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training | Yao-Chih Lee et.al. | 2208.02709 | link |
2022-07-31 | One Object at a Time: Accurate and Robust Structure From Motion for Robots | Aravind Battaje et.al. | 2208.00487 | null |
2022-07-23 | Detection and Initial Assessment of Lunar Landing Sites Using Neural Networks | Daniel Posada et.al. | 2207.11413 | null |
2022-07-25 | MeshLoc: Mesh-Based Visual Localization | Vojtech Panek et.al. | 2207.10762 | link |
2022-07-19 | ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild | Wang Zhao et.al. | 2207.09137 | link |
2022-07-16 | Organic Priors in Non-Rigid Structure from Motion | Suryansh Kumar et.al. | 2207.06262 | null |
2022-07-06 | A Novel Hybrid Endoscopic Dataset for Evaluating Machine Learning-based Photometric Image Enhancement Models | Axel Garcia-Vega et.al. | 2207.02396 | null |
2022-06-24 | Parallel Structure from Motion for UAV Images via Weighted Connected Dominating Set | San Jiang et.al. | 2206.11499 | null |
2022-06-13 | TC-SfM: Robust Track-Community-Based Structure-from-Motion | Lei Wang et.al. | 2206.05866 | null |
2022-06-10 | EigenFairing: 3D Model Fairing using Image Coherence | Pragyana Mishra et.al. | 2206.05309 | null |
2022-06-01 | Semantic Room Wireframe Detection from a Single View | David Gillsjö et.al. | 2206.00491 | link |
2022-05-31 | Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction | Qiancheng Fu et.al. | 2205.15848 | null |
2022-05-09 | Is my Depth Ground-Truth Good Enough? HAMMER – Highly Accurate Multi-Modal Dataset for DEnse 3D Scene Regression | HyunJun Jung et.al. | 2205.04565 | null |
2022-05-07 | Optimizing Terrain Mapping and Landing Site Detection for Autonomous UAVs | Pedro F. Proença et.al. | 2205.03522 | null |
2022-05-06 | EVIMO2: An Event Camera Dataset for Motion Segmentation, Optical Flow, Structure from Motion, and Visual Inertial Odometry in Indoor Scenes with Monocular or Stereo Algorithms | Levi Burner et.al. | 2205.03467 | null |
2022-04-20 | Learned Monocular Depth Priors in Visual-Inertial Initialization | Yunwen Zhou et.al. | 2204.09171 | null |
2022-04-10 | Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective | Hui Deng et.al. | 2204.04730 | null |
2022-04-08 | Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems | Debao Huang et.al. | 2204.04145 | null |
2022-04-07 | SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation | Yi Wei et.al. | 2204.03636 | link |
2022-04-06 | Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion | Lukas Bommes et.al. | 2204.02733 | link |
2022-04-05 | Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows | Sheng Liu et.al. | 2204.02509 | link |
2022-03-31 | Fast, Accurate and Memory-Efficient Partial Permutation Synchronization | Shaohan Li et.al. | 2203.16505 | null |
2022-03-28 | Visual Odometry for RGB-D Cameras | Afonso Fontes et.al. | 2203.15119 | null |
2022-03-28 | Optimizing Elimination Templates by Greedy Parameter Search | Evgeniy Martyushev et.al. | 2203.14901 | link |
2022-03-23 | Event-Based Dense Reconstruction Pipeline | Kun Xiao et.al. | 2203.12270 | null |
2022-03-21 | DiffPoseNet: Direct Differentiable Camera Pose Estimation | Chethan M. Parameshwara et.al. | 2203.11174 | null |
2022-03-02 | Asynchronous Optimisation for Event-based Visual Odometry | Daqi Liu et.al. | 2203.01037 | null |
2022-03-02 | Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation | Yulun Tian et.al. | 2203.00851 | null |
2022-02-18 | MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery | Ahmad Khaliq et.al. | 2202.09146 | link |
2022-01-20 | GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry | Yunhan Zhao et.al. | 2201.08131 | null |
2022-01-13 | Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching | Yunpeng Shi et.al. | 2201.04797 | link |
2022-01-10 | High-resolution Ecosystem Mapping in Repetitive Environments Using Dual Camera SLAM | Brian M. Hopkinson et.al. | 2201.03364 | link |
2022-01-06 | De-rendering 3D Objects in the Wild | Felix Wimbauer et.al. | 2201.02279 | link |
2021-12-29 | On the Instability of Relative Pose Estimation and RANSAC’s Role | Hongyi Fan et.al. | 2112.14651 | null |
2021-12-16 | Road-aware Monocular Structure from Motion and Homography Estimation | Wei Sui et.al. | 2112.08635 | null |
2021-12-10 | Critical configurations for three projective views | Martin Bråtelund et.al. | 2112.05478 | null |
2021-12-09 | Critical configurations for two projective views, a new approach | Martin Bråtelund et.al. | 2112.05074 | null |
2021-12-06 | Dense Depth Priors for Neural Radiance Fields from Sparse Input Views | Barbara Roessle et.al. | 2112.03288 | link |
2021-12-10 | MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment | Jie Ren et.al. | 2112.01349 | link |
2021-11-11 | Multi-Resolution Elevation Mapping and Safe Landing Site Detection with Applications to Planetary Rotorcraft | Pascal Schoppmann et.al. | 2111.06271 | null |
2021-11-10 | Damage Estimation and Localization from Sparse Aerial Imagery | Rene Garcia Franceschini et.al. | 2111.03708 | null |
2021-11-03 | Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems | Swarnabja Bhaumik et.al. | 2111.02064 | null |
2021-10-14 | Modeling dynamic target deformation in camera calibration | Annika Hagemann et.al. | 2110.07322 | null |
2021-10-13 | Hyperspectral 3D Mapping of Underwater Environments | Maxime Ferrera et.al. | 2110.06571 | null |
2021-09-24 | Automatic Map Update Using Dashcam Videos | Aziza Zhanabatyrova et.al. | 2109.12131 | null |
2021-09-16 | Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs | Gabriel Moreira et.al. | 2109.08046 | link |
2021-09-06 | Single-Camera 3D Head Fitting for Mixed Reality Clinical Applications | Tejas Mane et.al. | 2109.02740 | null |
2021-09-02 | Dynamic Scene Novel View Synthesis via Deferred Spatio-temporal Consistency | Beatrix-Emőke Fülöp-Balogh et.al. | 2109.01018 | null |
2021-09-01 | On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation | Eric Brachmann et.al. | 2109.00524 | link |
2021-08-31 | DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension | Roman Shapovalov et.al. | 2109.00033 | null |
2021-08-29 | Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration | Seyed-Mahdi Nasiri et.al. | 2108.12876 | null |
2021-08-23 | Burst Imaging for Light-Constrained Structure-From-Motion | Ahalya Ravendran et.al. | 2108.09895 | null |
Visual Localization
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-19 | MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval | Junjie Zhou et.al. | 2412.14475 | null |
2024-12-18 | Adversarial Hubness in Multi-Modal Retrieval | Tingwei Zhang et.al. | 2412.14113 | link |
2024-12-18 | Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval | Giacomo Pacini et.al. | 2412.13834 | null |
2024-12-18 | ConDo: Continual Domain Expansion for Absolute Pose Regression | Zijun Li et.al. | 2412.13452 | null |
2024-12-17 | Three Things to Know about Deep Metric Learning | Yash Patel et.al. | 2412.12432 | null |
2024-12-15 | Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval | Zelong Sun et.al. | 2412.11087 | null |
2024-12-18 | Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval | Yuanmin Tang et.al. | 2412.11077 | null |
2024-12-13 | MVC-VPR: Mutual Learning of Viewpoint Classification and Visual Place Recognition | Qiwen Gu et.al. | 2412.09199 | null |
2024-12-12 | A Flexible Plug-and-Play Module for Generating Variable-Length | Liyang He et.al. | 2412.08922 | link |
2024-12-11 | Image Retrieval Methods in the Dissimilarity Space | Madhu Kiran et.al. | 2412.08618 | null |
2024-12-11 | Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization | Siyan Dong et.al. | 2412.08376 | null |
2024-12-11 | Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin | Benjamin D. Killeen et.al. | 2412.08020 | null |
2024-12-10 | On Motion Blur and Deblurring in Visual Place Recognition | Timur Ismagilov et.al. | 2412.07751 | null |
2024-12-10 | Image Retrieval with Intra-Sweep Representation Learning for Neck Ultrasound Scanning Guidance | Wanwen Chen et.al. | 2412.07741 | null |
2024-12-09 | An Efficient Scene Coordinate Encoding and Relocalization Method | Kuan Xu et.al. | 2412.06488 | link |
2024-12-09 | A Hyperdimensional One Place Signature to Represent Them All: Stackable Descriptors For Visual Place Recognition | Connor Malone et.al. | 2412.06153 | null |
2024-12-07 | Compositional Image Retrieval via Instruction-Aware Contrastive Learning | Wenliang Zhong et.al. | 2412.05756 | null |
2024-12-06 | DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification | Ying Jin et.al. | 2412.04828 | null |
2024-12-04 | Distillation of Diffusion Features for Semantic Correspondence | Frank Fundel et.al. | 2412.03512 | null |
2024-12-04 | Composed Image Retrieval for Training-Free Domain Conversion | Nikos Efthymiadis et.al. | 2412.03297 | link |
2024-12-03 | A Minimalistic 3D Self-Organized UAV Flocking Approach for Desert Exploration | Thulio Amorim et.al. | 2412.02881 | null |
2024-12-03 | Active Learning via Classifier Impact and Greedy Selection for Interactive Image Retrieval | Leah Bar et.al. | 2412.02310 | link |
2024-12-02 | Mutli-View 3D Reconstruction using Knowledge Distillation | Aditya Dutt et.al. | 2412.02039 | link |
2024-12-02 | Optimizing Domain-Specific Image Retrieval: A Benchmark of FAISS and Annoy with Fine-Tuned Features | MD Shaikh Rahman et.al. | 2412.01555 | null |
2024-12-02 | Neuron Abandoning Attention Flow: Visual Explanation of Dynamics inside CNN Models | Yi Liao et.al. | 2412.01202 | null |
2024-12-01 | EDTformer: An Efficient Decoder Transformer for Visual Place Recognition | Tong Jin et.al. | 2412.00784 | null |
2024-11-28 | EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval | Muhammad Huzaifa et.al. | 2412.00139 | null |
2024-11-29 | A Visual-inertial Localization Algorithm using Opportunistic Visual Beacons and Dead-Reckoning for GNSS-Denied Large-scale Applications | Liqiang Zhang Ye Tian Dongyan Wei et.al. | 2411.19845 | null |
2024-11-27 | Optimizing Image Retrieval with an Extended b-Metric Space | Abdelkader Belhenniche et.al. | 2411.18800 | null |
2024-11-26 | Learning Visual Hierarchies with Hyperbolic Embeddings | Ziwei Wang et.al. | 2411.17490 | null |
2024-11-24 | Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy | You Li et.al. | 2411.16752 | null |
2024-11-24 | AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks | You Li et.al. | 2411.16749 | null |
2024-11-25 | Image Generation Diversity Issues and How to Tame Them | Mischa Dombrowski et.al. | 2411.16171 | link |
2024-11-24 | PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments | Haoang Li et.al. | 2411.15800 | null |
2024-11-22 | Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval | Zengbao Sun et.al. | 2411.14704 | null |
2024-11-20 | Globally Correlation-Aware Hard Negative Generation | Wenjie Peng et.al. | 2411.13145 | link |
2024-11-18 | Exploring Emerging Trends and Research Opportunities in Visual Place Recognition | Antonios Gasteratos et.al. | 2411.11481 | null |
2024-11-13 | OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances | Youqi Liao et.al. | 2411.08665 | link |
2024-11-13 | Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval | Saul Santos et.al. | 2411.08590 | link |
2024-11-22 | Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments | Ashkan Nejad et.al. | 2411.08567 | link |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-05 | From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing | Xintian Sun et.al. | 2411.05826 | null |
2024-11-04 | TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives | Maitreya Patel et.al. | 2411.02545 | null |
2024-11-11 | INQUIRE: A Natural World Text-to-Image Retrieval Benchmark | Edward Vendrow et.al. | 2411.02537 | link |
2024-11-20 | Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models | Sharat Agarwal et.al. | 2411.01925 | null |
2024-11-04 | Semantic Masking and Visual Feature Matching for Robust Localization | Luisa Mao et.al. | 2411.01804 | null |
2024-11-03 | Efficient Medical Image Retrieval Using DenseNet and FAISS for BIRADS Classification | MD Shaikh Rahman et.al. | 2411.01473 | null |
2024-11-01 | Identifying Implicit Social Biases in Vision-Language Models | Kimia Hamidieh et.al. | 2411.00997 | null |
2024-10-31 | Nearest Neighbor Normalization Improves Multimodal Retrieval | Neil Chowdhury et.al. | 2410.24114 | link |
2024-10-31 | MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval | Haiwen Li et.al. | 2410.23736 | null |
2024-10-30 | Decoupling Semantic Similarity from Spatial Alignment for Neural Networks | Tassilo Wald et.al. | 2410.23107 | link |
2024-10-29 | Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications | Monica Riedler et.al. | 2410.21943 | link |
2024-10-28 | NYC-Event-VPR: A Large-Scale High-Resolution Event-Based Visual Place Recognition Dataset in Dense Urban Environments | Taiyi Pan et.al. | 2410.21615 | link |
2024-10-25 | Context-Based Visual-Language Place Recognition | Soojin Woo et.al. | 2410.19341 | link |
2024-10-24 | ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval | Zijia Zhao et.al. | 2410.18715 | link |
2024-10-25 | On Model-Free Re-ranking for Visual Place Recognition with Deep Learned Local Features | Tomáš Pivoňka et.al. | 2410.18573 | null |
2024-10-22 | Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval | Yuanmin Tang et.al. | 2410.17393 | null |
2024-10-20 | GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning | Haiwen Diao et.al. | 2410.15266 | link |
2024-10-19 | Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway’s Digitised Book Collection | Marie Roald et.al. | 2410.14969 | link |
2024-10-16 | Development of Image Collection Method Using YOLO and Siamese Network | Chan Young Shin et.al. | 2410.12561 | null |
2024-10-16 | LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment | Juelin Zhu et.al. | 2410.12269 | link |
2024-10-16 | Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization | Nanda Febri Istighfarin et.al. | 2410.12240 | null |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-15 | Multiview Scene Graph | Juexiao Zhang et.al. | 2410.11187 | link |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-10-11 | Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System | Zheng Liu et.al. | 2410.08935 | link |
2024-10-16 | Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP | Eunji Kim et.al. | 2410.08469 | null |
2024-10-11 | A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification | Eugene P. W. Ang et.al. | 2410.08456 | null |
2024-10-10 | A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks | Hoin Jung et.al. | 2410.07593 | link |
2024-10-09 | Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval | Mohammad Omama et.al. | 2410.07022 | null |
2024-10-09 | Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers | Stephen Hausler et.al. | 2410.06614 | null |
2024-10-09 | MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging | Noel C. F. Codella et.al. | 2410.06542 | null |
2024-10-08 | Temporal Image Caption Retrieval Competition – Description and Results | Jakub Pokrywka et.al. | 2410.06314 | null |
2024-10-08 | Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching | Gongxin Yao et.al. | 2410.06285 | null |
2024-10-08 | GSLoc: Visual Localization with 3D Gaussian Splatting | Kazii Botashev et.al. | 2410.06165 | null |
2024-10-08 | Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning | Ayush Singh et.al. | 2410.05928 | null |
2024-10-08 | RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps | Minsoo Kim et.al. | 2410.05621 | null |
2024-10-11 | LoTLIP: Improving Language-Image Pre-training for Long Text Understanding | Wei Wu et.al. | 2410.05249 | null |
2024-10-06 | LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation | Jianhao Jiao et.al. | 2410.04419 | null |
2024-10-02 | Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension | Zaiquan Yang et.al. | 2410.01544 | null |
2024-10-03 | EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections | Francesc Net et.al. | 2410.01536 | link |
2024-10-04 | CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment | Safouane El Ghazouali et.al. | 2410.01411 | link |
2024-09-30 | Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation | Aleyna Kütük et.al. | 2410.00266 | null |
2024-09-29 | CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation | Yifan Duan et.al. | 2409.19597 | null |
2024-09-28 | VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition | Ahmad Khaliq et.al. | 2409.19293 | link |
2024-09-27 | MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion | Bardienus Duisterhof et.al. | 2409.19152 | null |
2024-09-26 | Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval | Mankeerat Sidhu et.al. | 2409.18733 | null |
2024-09-26 | Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Kartik Garg et.al. | 2409.18049 | link |
2024-09-24 | GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Gennady Sidorov et.al. | 2409.16502 | link |
2024-09-23 | CamLoPA: A Hidden Wireless Camera Localization Framework via Signal Propagation Path Analysis | Xiang Zhang et.al. | 2409.15169 | null |
2024-09-21 | Combining Absolute and Semi-Generalized Relative Poses for Visual Localization | Vojtech Panek et.al. | 2409.14269 | null |
2024-09-21 | SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality | Hongjia Zhai et.al. | 2409.14067 | null |
2024-09-20 | Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval | Morris Florek et.al. | 2409.13513 | link |
2024-09-18 | Towards Global Localization using Multi-Modal Object-Instance Re-Identification | Aneesh Chavan et.al. | 2409.12002 | link |
2024-09-17 | Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching | Kurran Singh et.al. | 2409.11555 | null |
2024-09-17 | Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information | Kunal Chelani et.al. | 2409.11536 | null |
2024-09-17 | Improving the Efficiency of Visually Augmented Language Models | Paula Ontalvilla et.al. | 2409.11148 | link |
2024-09-21 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
2024-09-16 | SOLVR: Submap Oriented LiDAR-Visual Re-Localisation | Joshua Knights et.al. | 2409.10247 | null |
2024-09-16 | Garment Attribute Manipulation with Multi-level Attention | Vittorio Casula et.al. | 2409.10206 | null |
2024-09-14 | Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval | Amirreza Mahbod et.al. | 2409.09430 | link |
2024-09-12 | Structured Pruning for Efficient Visual Place Recognition | Oliver Grainge et.al. | 2409.07834 | null |
2024-09-10 | GeoCalib: Learning Single-image Calibration with Geometric Optimization | Alexander Veicht et.al. | 2409.06704 | link |
2024-09-10 | Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Yujiao Shi et.al. | 2409.06471 | link |
2024-09-10 | A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions | Zhicong Wu et.al. | 2409.06381 | null |
2024-09-09 | Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding | Bram Willemsen et.al. | 2409.05721 | link |
2024-09-09 | Open-World Dynamic Prompt and Continual Visual Representation Learning | Youngeun Kim et.al. | 2409.05312 | null |
2024-09-12 | Training-free ZS-CIR via Weighted Modality Fusion and Similarity | Ren-Di Wu et.al. | 2409.04918 | link |
2024-09-12 | Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models | Saghir Alfasly et.al. | 2409.04631 | null |
2024-09-06 | Reprojection Errors as Prompts for Efficient Scene Coordinate Regression | Ting-Ru Liu et.al. | 2409.04178 | null |
2024-09-06 | Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments | Therese Joseph et.al. | 2409.03998 | null |
2024-09-04 | Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications | Abby Stylianou et.al. | 2409.03012 | null |
2024-09-04 | NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval | Sepanta Zeighami et.al. | 2409.02343 | link |
2024-09-03 | Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment | Konstantin Schall et.al. | 2409.01936 | link |
2024-09-02 | A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches | Kim Jinwoo et.al. | 2409.01219 | null |
2024-09-02 | Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection | Manon Kok et.al. | 2409.01091 | null |
2024-09-02 | Evidential Transformers for Improved Image Retrieval | Danilo Dordevic et.al. | 2409.01082 | null |
2024-09-05 | EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System | Bonan Liu et.al. | 2409.00343 | null |
2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
2024-09-02 | RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance | Avideep Mukherjee et.al. | 2408.17095 | null |
2024-08-29 | A compact neuromorphic system for ultra energy-efficient, on-device robot localization | Adam D. Hines et.al. | 2408.16754 | link |
2024-08-29 | Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models | Kengo Nakata et.al. | 2408.16296 | null |
2024-08-28 | Temporal Attention for Cross-View Sequential Image Localization | Dong Yuan et.al. | 2408.15569 | link |
2024-08-27 | Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild | Tianqi Wei et.al. | 2408.14723 | null |
2024-08-25 | LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task | Ali Asgarov et.al. | 2408.13909 | link |
2024-08-15 | Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval | Lifeng Zhou et.al. | 2408.13705 | null |
2024-08-15 | Coarse-to-fine Alignment Makes Better Speech-image Retrieval | Lifeng Zhou et.al. | 2408.13119 | null |
2024-08-21 | FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization | Son Tung Nguyen et.al. | 2408.12037 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-21 | UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Xiangyu Zhao et.al. | 2408.11305 | link |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | null |
2024-08-19 | BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval | Zhenyu Lu et.al. | 2408.10383 | null |
2024-08-23 | Fashion Image-to-Image Translation for Complementary Item Retrieval | Matteo Attimonelli et.al. | 2408.09847 | link |
2024-08-20 | MambaLoc: Efficient Camera Localisation via State Space Model | Jialu Wang et.al. | 2408.09680 | null |
2024-08-15 | DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions | Ryosuke Korekata et.al. | 2408.07910 | null |
2024-08-13 | A Miniature Vision-Based Localization System for Indoor Blimps | Shicong Ma et.al. | 2408.06648 | null |
2024-08-10 | Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network | Junyan Ye et.al. | 2408.05475 | link |
2024-08-09 | Spherical World-Locking for Audio-Visual Localization in Egocentric Videos | Heeseung Yun et.al. | 2408.05364 | null |
2024-08-06 | AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval | Pavel Suma et.al. | 2408.03282 | link |
2024-08-05 | CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration | Gongxin Yao et.al. | 2408.02394 | null |
2024-08-09 | BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles | Lun Luo et.al. | 2408.01841 | link |
2024-08-02 | On Validation of Search & Retrieval of Tissue Images in Digital Pathology | H. R. Tizhoosh et.al. | 2408.01570 | null |
2024-07-31 | VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning | Yuhang Ming et.al. | 2407.21416 | null |
2024-07-31 | SuperVINS: A visual-inertial SLAM framework integrated deep learning features | Hongkun Luo et.al. | 2407.21348 | link |
2024-07-30 | Re-localization acceleration with Medoid Silhouette Clustering | Hongyi Zhang et.al. | 2407.20749 | null |
2024-07-29 | A flexible framework for accurate LiDAR odometry, map manipulation, and localization | José Luis Blanco-Claraco et.al. | 2407.20465 | link |
2024-07-26 | From 2D to 3D: AISG-SLA Visual Localization Challenge | Jialin Gao et.al. | 2407.18590 | null |
2024-07-24 | Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation | Yongqi Li et.al. | 2407.17274 | null |
2024-07-24 | Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments | Wei Gao et.al. | 2407.17078 | null |
2024-07-24 | Pose Estimation from Camera Images for Underwater Inspection | Luyuan Peng et.al. | 2407.16961 | null |
2024-07-22 | Memory Management for Real-Time Appearance-Based Loop Closure Detection | Mathieu Labbé et.al. | 2407.15890 | null |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-22 | Online Global Loop Closure Detection for Large-Scale Multi-Session Graph-Based SLAM | Mathieu Labbe et.al. | 2407.15305 | null |
2024-07-22 | Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation | Mathieu Labbé et.al. | 2407.15304 | null |
2024-07-19 | Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization | Yuehua Ding et.al. | 2407.14643 | null |
2024-07-18 | Visual Haystacks: Answering Harder Questions About Sets of Images | Tsung-Han Wu et.al. | 2407.13766 | link |
2024-07-17 | Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM | Markus Weißflog et.al. | 2407.12408 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-16 | EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis | Ruijie Yang et.al. | 2407.11401 | null |
2024-07-15 | No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations | Walter Simoncini et.al. | 2407.10964 | link |
2024-07-15 | DINO Pre-training for Vision-based End-to-end Autonomous Driving | Shubham Juneja et.al. | 2407.10803 | null |
2024-07-15 | Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Youngsun Lim et.al. | 2407.10683 | null |
2024-07-15 | An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots | J. J. Cabrera et.al. | 2407.10596 | link |
2024-07-15 | An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments | J. J. Cabrera et.al. | 2407.10536 | null |
2024-07-12 | Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval | Vaibhav Balloli et.al. | 2407.08908 | link |
2024-07-11 | Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates | Owen Claxton et.al. | 2407.08162 | link |
2024-07-12 | Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal | Xinyu Zhu et.al. | 2407.08153 | link |
2024-07-11 | SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM | Neng Wang et.al. | 2407.08106 | link |
2024-07-09 | LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition | Teng Wang et.al. | 2407.06730 | null |
2024-07-09 | CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding | Wenhao Xu et.al. | 2407.06611 | null |
2024-07-08 | Pseudo-triplet Guided Few-shot Composed Image Retrieval | Bohan Hou et.al. | 2407.06001 | null |
2024-07-09 | HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels | Yingying Jiang et.al. | 2407.05795 | null |
2024-07-05 | Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning | Mainak Singha et.al. | 2407.04207 | link |
2024-07-04 | Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models | Chang-Sheng Kao et.al. | 2407.03615 | link |
2024-07-03 | Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach | Pronay Debnath et.al. | 2407.03486 | null |
2024-07-02 | Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition | Sergio Izquierdo et.al. | 2407.02422 | link |
2024-07-01 | Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval | Aneeshan Sain et.al. | 2407.01810 | null |
2024-07-01 | Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval | Hanwen Su et.al. | 2407.00979 | null |
2024-07-01 | Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios | Connor Malone et.al. | 2407.00863 | null |
2024-06-27 | PathAlign: A vision-language model for whole slide images in histopathology | Faruk Ahmed et.al. | 2406.19578 | null |
2024-07-05 | 360 in the Wild: Dataset for Depth Prediction and View Synthesis | Kibaek Park et.al. | 2406.18898 | null |
2024-06-27 | Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs | Huaying Zhang et.al. | 2406.18836 | null |
2024-06-26 | WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images | Yannik Glaser et.al. | 2406.18765 | null |
2024-06-26 | View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis | Subin Varghese et.al. | 2406.18012 | null |
2024-06-25 | Tell Me Where You Are: Multimodal LLMs Meet Place Recognition | Zonglin Lyu et.al. | 2406.17520 | null |
2024-06-25 | SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation | Xu Liu et.al. | 2406.17249 | link |
2024-06-23 | Breaking the Frame: Image Retrieval by Visual Overlap Prediction | Tong Wei et.al. | 2406.16204 | link |
2024-06-19 | Towards a multimodal framework for remote sensing image change retrieval and captioning | Roger Ferrod et.al. | 2406.13424 | link |
2024-06-19 | CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval | Christian Lülf et.al. | 2406.13322 | link |
2024-06-17 | Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization | Huaiji Zhou et.al. | 2406.11766 | null |
2024-06-22 | Simple Yet Efficient: Towards Self-Supervised FG-SBIR with Unified Sample Feature Alignment | Jianan Jiang et.al. | 2406.11551 | link |
2024-06-17 | They’re All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias | Salma Abdel Magid et.al. | 2406.11331 | null |
2024-06-17 | Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion | Guoyuan An et.al. | 2406.11242 | null |
2024-06-14 | Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval | Genc Hoxha et.al. | 2406.10107 | null |
2024-06-14 | BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval | Imanol Miranda et.al. | 2406.09952 | link |
2024-06-13 | Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases | Meng Wang et.al. | 2406.09317 | link |
2024-06-13 | Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval | Jaeseok Byun et.al. | 2406.09188 | null |
2024-06-13 | DenoiseReID: Denoising Model for Representation Learning of Person Re-Identification | Zhengrui Xu et.al. | 2406.08773 | null |
2024-06-12 | Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement | Maxime Pietrantoni et.al. | 2406.08463 | null |
2024-06-12 | ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery | Kam Woh Ng et.al. | 2406.08457 | link |
2024-06-11 | Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions | Renjie Pi et.al. | 2406.07502 | link |
2024-06-11 | Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Shuvendu Roy et.al. | 2406.07450 | link |
2024-06-11 | Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval | Adrià Molina et.al. | 2406.07315 | null |
2024-06-10 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374 | link |
2024-06-09 | Unified Text-to-Image Generation and Retrieval | Leigang Qu et.al. | 2406.05814 | null |
2024-06-07 | The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better | Scott Geng et.al. | 2406.05184 | link |
2024-06-07 | PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction | Eduard Poesina et.al. | 2406.04746 | link |
2024-06-06 | GLACE: Global Local Accelerated Coordinate Encoding | Fangjinhua Wang et.al. | 2406.04340 | link |
2024-06-06 | Monocular Localization with Semantics Map for Autonomous Vehicles | Jixiang Wan et.al. | 2406.03835 | null |
2024-06-05 | Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach | Saehyung Lee et.al. | 2406.03411 | link |
2024-06-04 | MeshVPR: Citywide Visual Place Recognition Using 3D Meshes | Gabriele Berton et.al. | 2406.02776 | null |
2024-06-04 | Can CLIP help CLIP in learning 3D? | Cristian Sbrolli et.al. | 2406.02202 | null |
2024-06-03 | Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP | Sriram Balasubramanian et.al. | 2406.01583 | link |
2024-06-03 | Scale-Free Image Keypoints Using Differentiable Persistent Homology | Giovanni Barbarani et.al. | 2406.01315 | link |
2024-06-02 | Visual place recognition for aerial imagery: A survey | Ivan Moskalenko et.al. | 2406.00885 | link |
2024-06-01 | NuRF: Nudging the Particle Filter in Radiance Fields for Robot Visual Localization | Wugang Meng et.al. | 2406.00312 | null |
2024-05-31 | DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models | Linli Yao et.al. | 2405.20985 | link |
2024-05-29 | Multi-Modal Generative Embedding Model | Feipeng Ma et.al. | 2405.19333 | null |
2024-05-29 | ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions | Honglin Lin et.al. | 2405.19226 | null |
2024-05-30 | CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval | Xintong Jiang et.al. | 2405.19149 | link |
2024-05-29 | SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation | Zhenbei Wu et.al. | 2405.18801 | null |
2024-05-29 | Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs | Jialiang Xu et.al. | 2405.18740 | link |
2024-05-28 | EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition | Issar Tzachor et.al. | 2405.18065 | null |
2024-05-28 | AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval | Sihe Zhang et.al. | 2405.17718 | null |
2024-05-26 | MCGMapper: Light-Weight Incremental Structure from Motion and Visual Localization With Planar Markers and Camera Groups | Yusen Xie et.al. | 2405.16599 | null |
2024-05-29 | Composed Image Retrieval for Remote Sensing | Bill Psomas et.al. | 2405.15587 | link |
2024-05-24 | Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval | Yiming Wu et.al. | 2405.15451 | null |
2024-05-20 | UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization | Wenjia Xu et.al. | 2405.11936 | link |
2024-05-19 | Register assisted aggregation for Visual Place Recognition | Xuan Yu et.al. | 2405.11526 | null |
2024-05-26 | CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion | Gang Wang et.al. | 2405.10793 | null |
2024-05-16 | FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models | Adrian Bulat et.al. | 2405.10286 | null |
2024-05-15 | Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study | Farnaz Khun Jush et.al. | 2405.09334 | null |
2024-05-14 | BEVRender: Vision-based Cross-view Vehicle Registration in Off-road GNSS-denied Environment | Lihong Jin et.al. | 2405.09001 | null |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-13 | OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | Qiuchi Xiang et.al. | 2405.07966 | link |
2024-05-14 | HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval | Chao He et.al. | 2405.07524 | link |
2024-05-13 | JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation | Xubo Luo et.al. | 2405.07429 | link |
2024-05-12 | BoQ: A Place is Worth a Bag of Learnable Queries | Amar Ali-bey et.al. | 2405.07364 | link |
2024-05-07 | Breast Histopathology Image Retrieval by Attention-based Adversarially Regularized Variational Graph Autoencoder with Contrastive Learning-Based Feature Extraction | Nematollah Saeidi et.al. | 2405.04211 | null |
2024-05-06 | A New Robust Partial $p$ -Wasserstein-Based Metric for Comparing Distributions | Sharath Raghvendra et.al. | 2405.03664 | null |
2024-05-06 | Knowledge-aware Text-Image Retrieval for Remote Sensing Images | Li Mi et.al. | 2405.03373 | null |
2024-05-06 | Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval | Jiacheng Cheng et.al. | 2405.03190 | null |
2024-05-05 | iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval | Lorenzo Agnolucci et.al. | 2405.02951 | link |
2024-05-01 | Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval | Young Kyun Jang et.al. | 2405.00571 | null |
2024-04-30 | Large Language Model Informed Patent Image Retrieval | Hao-Cheng Lo et.al. | 2404.19360 | null |
2024-04-30 | XFeat: Accelerated Features for Lightweight Image Matching | Guilherme Potje et.al. | 2404.19174 | null |
2024-04-29 | Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models | Hongyi Zhu et.al. | 2404.18746 | null |
2024-04-29 | Dual-Modal Prompting for Sketch-Based Image Retrieval | Liying Gao et.al. | 2404.18695 | null |
2024-05-01 | Semantic Line Combination Detector | Jinwon Ko et.al. | 2404.18399 | link |
2024-04-26 | Learning text-to-video retrieval from image captioning | Lucas Ventura et.al. | 2404.17498 | null |
2024-04-25 | CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching | Samia Shafique et.al. | 2404.16972 | link |
2024-04-29 | Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval | Ryoya Nara et.al. | 2404.16398 | null |
2024-04-24 | Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval | Haokun Wen et.al. | 2404.15875 | link |
2024-04-24 | DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines | Xin Jiang et.al. | 2404.15771 | null |
2024-04-23 | Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval | Young Kyun Jang et.al. | 2404.15516 | null |
2024-04-22 | EcoPull: Sustainable IoT Image Retrieval Empowered by TinyML Models | Mathias Thorsager et.al. | 2404.14236 | null |
2024-04-22 | Hierarchical localization with panoramic views and triplet loss functions | Marcos Alfaro et.al. | 2404.14117 | link |
2024-04-20 | High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces | Baoru Huang et.al. | 2404.13437 | null |
2024-04-20 | Collaborative Visual Place Recognition through Federated Learning | Mattia Dutto et.al. | 2404.13324 | null |
2024-04-18 | SPOT: Point Cloud Based Stereo Visual Place Recognition for Similar and Opposing Viewpoints | Spencer Carmichael et.al. | 2404.12339 | null |
2024-04-17 | Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives | Zhangchi Feng et.al. | 2404.11317 | link |
2024-04-17 | Spatial-Aware Image Retrieval: A Hyperdimensional Computing Approach for Efficient Similarity Hashing | Sanggeon Yun et.al. | 2404.11025 | null |
2024-04-16 | SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments | Niklas Gard et.al. | 2404.10527 | link |
2024-04-20 | CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning | Haojian Huang et.al. | 2404.09640 | link |
2024-04-11 | PRAM: Place Recognition Anywhere Model for Efficient Visual Localization | Fei Xue et.al. | 2404.07785 | null |
2024-04-16 | 2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure | Bin Zhang et.al. | 2404.07644 | link |
2024-04-11 | Semantically-correlated memories in a dense associative model | Thomas F Burns et.al. | 2404.07123 | link |
2024-04-09 | Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation | Luca Barsellotti et.al. | 2404.06542 | null |
2024-04-09 | Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping | Anas Gouda et.al. | 2404.06277 | link |
2024-04-07 | Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval | Jinpeng Wang et.al. | 2404.04998 | link |
2024-04-06 | Soft-Prompting with Graph-of-Thought for Multi-modal Representation Learning | Juncheng Yang et.al. | 2404.04538 | link |
2024-04-05 | Towards introspective loop closure in 4D radar SLAM | Maximilian Hilger et.al. | 2404.03940 | null |
2024-04-02 | TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation | Yehui Shen et.al. | 2404.01587 | link |
2024-04-01 | On Train-Test Class Overlap and Detection for Image Retrieval | Chull Hwan Song et.al. | 2404.01524 | link |
2024-04-01 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400 | null |
2024-03-31 | On the Estimation of Image-matching Uncertainty in Visual Place Recognition | Mubariz Zaffar et.al. | 2404.00546 | null |
2024-03-31 | NYC-Indoor-VPR: A Long-Term Indoor Visual Place Recognition Dataset with Semi-Automatic Annotation | Diwei Sheng et.al. | 2404.00504 | null |
2024-03-30 | SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs | Yang Miao et.al. | 2404.00469 | null |
2024-03-30 | Do Vision-Language Models Understand Compound Nouns? | Sonal Kumar et.al. | 2404.00419 | link |
2024-04-05 | FairRAG: Fair Human Generation via Fair Retrieval Augmentation | Robik Shrestha et.al. | 2403.19964 | null |
2024-03-28 | JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition | Gabriele Berton et.al. | 2403.19787 | link |
2024-03-28 | MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions | Kai Zhang et.al. | 2403.19651 | link |
2024-03-27 | AIR-HLoc: Adaptive Image Retrieval for Efficient Visual Localisation | Changkun Liu et.al. | 2403.18281 | null |
2024-03-26 | Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge | Dongjin Kim et.al. | 2403.17420 | link |
2024-03-25 | Enhancing Visual Place Recognition via Fast and Slow Adaptive Biasing in Event Cameras | Gokul B. Nair et.al. | 2403.16425 | link |
2024-03-24 | Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval | Yucheng Suo et.al. | 2403.16005 | link |
2024-03-24 | BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval | Yinda Chen et.al. | 2403.15992 | null |
2024-03-22 | Long-CLIP: Unlocking the Long-Text Capability of CLIP | Beichen Zhang et.al. | 2403.15378 | link |
2024-03-22 | A Multimodal Approach for Cross-Domain Image Retrieval | Lucas Iijima et.al. | 2403.15152 | null |
2024-03-22 | Piecewise-Linear Manifolds for Deep Metric Learning | Shubhang Bhatnagar et.al. | 2403.14977 | null |
2024-03-21 | Enhancing Historical Image Retrieval with Compositional Cues | Tingyu Lin et.al. | 2403.14287 | link |
2024-03-20 | Leveraging High-Resolution Features for Improved Deep Hashing-based Image Retrieval | Aymene Berriche et.al. | 2403.13747 | null |
2024-03-20 | Flickr30K-CFQ: A Compact and Fragmented Query Dataset for Text-image Retrieval | Haoyu Liu et.al. | 2403.13317 | null |
2024-03-19 | Learning Neural Volumetric Pose Features for Camera Localization | Jingyu Lin et.al. | 2403.12800 | null |
2024-03-19 | Quantixar: High-performance Vector Data Management System | Gulshan Yadav et.al. | 2403.12583 | null |
2024-03-17 | 3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization | Peng Jiang et.al. | 2403.11367 | null |
2024-03-17 | MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data | Paul S. Scotti et.al. | 2403.11207 | link |
2024-03-16 | Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval | Shunsuke Tsubaki et.al. | 2403.10756 | null |
2024-03-16 | Vector search with small radiuses | Gergely Szilvasy et.al. | 2403.10746 | null |
2024-03-13 | Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer | Kenta Tsukahara et.al. | 2403.10552 | null |
2024-03-20 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297 | link |
2024-03-15 | Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline | Fangming Yuan et.al. | 2403.10283 | null |
2024-03-14 | The NeRFect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou et.al. | 2403.09577 | null |
2024-03-14 | VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition | Benjamin Ramtoula et.al. | 2403.09025 | null |
2024-03-13 | PAPERCLIP: Associating Astronomical Observations and Natural Language with Multi-Modal Models | Siddharth Mishra-Sharma et.al. | 2403.08851 | link |
2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-12 | It’s All About Your Sketch: Democratising Sketch Control in Diffusion Models | Subhadeep Koley et.al. | 2403.07234 | link |
2024-03-12 | You’ll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval | Subhadeep Koley et.al. | 2403.07222 | null |
2024-03-12 | Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers | Subhadeep Koley et.al. | 2403.07214 | null |
2024-03-11 | How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? | Subhadeep Koley et.al. | 2403.07203 | null |
2024-03-11 | EarthLoc: Astronaut Photography Localization by Indexing Earth from Space | Gabriele Berton et.al. | 2403.06758 | link |
2024-03-11 | BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues | Fudong Ge et.al. | 2403.06600 | link |
2024-03-11 | Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology | Stefan Denner et.al. | 2403.06567 | link |
2024-03-10 | RTAB-Map as an Open-Source Lidar and Visual SLAM Library for Large-Scale and Long-Term Online Operation | Mathieu Labbé et.al. | 2403.06341 | null |
2024-03-10 | Texture image retrieval using a classification and contourlet-based features | Asal Rouhafzay et.al. | 2403.06048 | null |
2024-03-11 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002 | link |
2024-03-11 | Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed | Yifan Wang et.al. | 2403.04765 | null |
2024-03-07 | mmPlace: Robust Place Recognition with Intermediate Frequency Signal of Low-cost Single-chip Millimeter Wave Radar | Chengzhen Meng et.al. | 2403.04703 | null |
2024-03-06 | Self-supervised Photographic Image Layout Representation Learning | Zhaoran Zhao et.al. | 2403.03740 | link |
2024-03-04 | Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models | Benedikt Blumenstiel et.al. | 2403.02059 | link |
2024-03-03 | Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval | Yongchao Du et.al. | 2403.01431 | null |
2024-03-01 | Asymmetric Feature Fusion for Image Retrieval | Hui Wu et.al. | 2403.00671 | null |
2024-03-01 | Structure Similarity Preservation Learning for Asymmetric Image Retrieval | Hui Wu et.al. | 2403.00648 | link |
2024-02-29 | CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition | Feng Lu et.al. | 2402.19231 | link |
2024-02-28 | Unsupervised Cross-Domain Image Retrieval via Prototypical Optimal Transport | Bin Li et.al. | 2402.18411 | link |
2024-02-28 | Balanced Similarity with Auxiliary Prompts: Towards Alleviating Text-to-Image Retrieval Bias for CLIP in Zero-shot Learning | Hanyao Wang et.al. | 2402.18400 | null |
2024-02-28 | Representing 3D sparse map points and lines for camera relocalization | Bach-Thuan Bui et.al. | 2402.18011 | link |
2024-02-27 | Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control | Thong Nguyen et.al. | 2402.17535 | link |
2024-02-29 | Active propulsion noise shaping for multi-rotor aircraft localization | Gabriele Serussi et.al. | 2402.17289 | link |
2024-02-27 | NocPlace: Nocturnal Visual Place Recognition Using Generative and Inherited Knowledge Transfer | Bingxi Liu et.al. | 2402.17159 | link |
2024-02-25 | Deep Homography Estimation for Visual Place Recognition | Feng Lu et.al. | 2402.16086 | link |
2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | link |
2024-02-28 | Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries | Zijun Long et.al. | 2402.15276 | null |
2024-02-23 | Fine-tuning CLIP Text Encoders with Two-step Paraphrasing | Hyunjae Kim et.al. | 2402.15120 | null |
2024-02-22 | Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition | Feng Lu et.al. | 2402.14505 | link |
2024-02-16 | Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition | Chenming Hu et.al. | 2402.10476 | null |
2024-02-15 | Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task | Mirko Nava et.al. | 2402.09886 | link |
2024-02-14 | Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency | Yannis Kalantidis et.al. | 2402.09237 | null |
2024-02-13 | Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Xiangming Gu et.al. | 2402.08567 | link |
2024-02-13 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359 | link |
2024-02-10 | Semantic Object-level Modeling for Robust Visual Camera Relocalization | Yifan Zhu et.al. | 2402.06951 | null |
2024-02-09 | Large Language Models for Captioning and Retrieving Remote Sensing Images | João Daniel Silva et.al. | 2402.06475 | null |
2024-02-09 | PAS-SLAM: A Visual SLAM System for Planar Ambiguous Scenes | Xinggang Hu et.al. | 2402.06131 | null |
2024-02-21 | MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction | Heng Zhou et.al. | 2402.03762 | null |
2024-02-04 | Region-Based Representations Revisited | Michal Shlapentokh-Rothman et.al. | 2402.02352 | link |
2024-02-03 | Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization | Bo Yang et.al. | 2402.02141 | link |
2024-02-01 | BrainSLAM: SLAM on Neural Population Activity Data | Kipp Freud et.al. | 2402.00588 | null |
2024-02-01 | Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering | Tianxiao Gao et.al. | 2402.00330 | link |
2024-01-31 | Improved Scene Landmark Detection for Camera Localization | Tien Do et.al. | 2401.18083 | link |
2024-01-31 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-29 | Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors | Shiyin Dong et.al. | 2401.16459 | null |
2024-01-29 | Cross-Modal Coordination Across a Diverse Set of Input Modalities | Jorge Sánchez et.al. | 2401.16347 | null |
2024-01-29 | Regressing Transformers for Data-efficient Visual Place Recognition | María Leyva-Vallina et.al. | 2401.16304 | null |
2024-01-27 | Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval | Ayush Dubey et.al. | 2401.15362 | null |
2024-01-24 | Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode | Naresh Kumar Lahajal et.al. | 2401.13613 | null |
2024-01-23 | PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion | Shyam Sundar Kannan et.al. | 2401.13082 | null |
2024-01-23 | SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization | Mingyang Li et.al. | 2401.13076 | link |
2024-01-25 | CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios | Xiangshuo Qiao et.al. | 2401.10475 | link |
2024-01-19 | PhotoScout: Synthesis-Powered Multi-Modal Image Search | Celeste Barnaby et.al. | 2401.10464 | null |
2024-01-19 | Cross-Modality Perturbation Synergy Attack for Person Re-identification | Yunpeng Gong et.al. | 2401.10090 | null |
2024-01-16 | Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging | Zahra Tabatabaei et.al. | 2401.08272 | null |
2024-01-16 | Multi-Technique Sequential Information Consistency For Dynamic Visual Place Recognition In Changing Environments | Bruno Arcanjo et.al. | 2401.08263 | null |
2024-01-15 | Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing | Jakob Hackstein et.al. | 2401.07782 | link |
2024-01-14 | HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval | Zexuan Qiu et.al. | 2401.07212 | link |
2024-01-11 | UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization | Rouwan Wu et.al. | 2401.05971 | link |
2024-01-10 | Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval | Eunyi Lyou et.al. | 2401.04860 | link |
2024-01-05 | Benchmarking PathCLIP for Pathology Image Analysis | Sunyi Zheng et.al. | 2401.02651 | null |
2024-01-03 | DDN-SLAM: Real-time Dense Dynamic Neural Implicit SLAM with Joint Semantic Encoding | Mingrui Li et.al. | 2401.01545 | null |
2024-01-02 | BEV-CLIP: Multi-modal BEV Retrieval Methodology for Complex Scene in Autonomous Driving | Dafeng Wei et.al. | 2401.01065 | null |
2023-12-31 | Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval | Liang Wang et.al. | 2401.00371 | link |
2023-12-29 | Bayesian Recursive Information Optical Imaging: A Ghost Imaging Scheme Based on Bayesian Filtering | Long-Kun Du et.al. | 2401.00032 | null |
2023-12-27 | LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization | Sai Shubodh Puligilla et.al. | 2312.16648 | null |
2023-12-26 | Recursive Distillation for Open-Set Distributed Robot Localization | Kenta Tsukahara et.al. | 2312.15897 | null |
2023-12-24 | Residual Learning for Image Point Descriptors | Rashik Shrestha et.al. | 2312.15471 | null |
2023-12-23 | CaLDiff: Camera Localization in NeRF via Pose Diffusion | Rashik Shrestha et.al. | 2312.15242 | null |
2023-12-20 | Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective And Lightweight Visual Place Recognition | Bruno Arcanjo et.al. | 2312.12995 | null |
2023-12-19 | VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering | Chun-Mei Feng et.al. | 2312.12273 | link |
2023-12-18 | Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback | Boaz Lerner et.al. | 2312.11078 | link |
2023-12-17 | PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields | Boming Zhao et.al. | 2312.10649 | null |
2023-12-17 | DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition | Sijie Wang et.al. | 2312.10616 | link |
2023-12-16 | Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval | Decheng Liu et.al. | 2312.10320 | link |
2023-12-15 | Data-Efficient Multimodal Fusion on a Single GPU | Noël Vouitsis et.al. | 2312.10144 | link |
2023-12-13 | Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques | Hamed Qazanfari et.al. | 2312.10089 | null |
2023-12-15 | Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval | Zhe Ma et.al. | 2312.09716 | link |
2023-12-14 | Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition | Oliver Grainge et.al. | 2312.09028 | null |
2023-12-14 | Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking | Shitong Sun et.al. | 2312.08924 | null |
2023-12-13 | C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation | Florian Fervers et.al. | 2312.08060 | null |
2023-12-12 | Contextually Affinitive Neighborhood Refinery for Deep Clustering | Chunlin Yu et.al. | 2312.07806 | link |
2023-12-12 | Collapse-Oriented Adversarial Training with Triplet Decoupling for Robust Image Retrieval | Qiwei Tian et.al. | 2312.07364 | link |
2023-12-12 | Attacking the Loop: Adversarial Attacks on Graph-based Loop Closure Detection | Jonathan J. Y. Kim et.al. | 2312.06991 | null |
2023-12-11 | Dynamic Weighted Combiner for Mixed-Modal Image Retrieval | Fuxiang Huang et.al. | 2312.06179 | link |
2023-12-06 | Lite-Mind: Towards Efficient and Versatile Brain Representation Network | Zixuan Gong et.al. | 2312.03781 | link |
2023-12-08 | FreestyleRet: Retrieving Images from Style-Diversified Queries | Hao Li et.al. | 2312.02428 | link |
2023-12-04 | Implicit Learning of Scene Geometry from Poses for Global Localization | Mohammad Altillawi et.al. | 2312.02029 | null |
2023-12-04 | Language-only Efficient Training of Zero-shot Composed Image Retrieval | Geonmo Gu et.al. | 2312.01998 | link |
2023-12-03 | G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training | Che Liu et.al. | 2312.01522 | link |
2023-12-01 | Improve Supervised Representation Learning with Masked Image Modeling | Kaifeng Chen et.al. | 2312.00950 | null |
2023-12-05 | Grounding Everything: Emerging Localization Properties in Vision-Language Transformers | Walid Bousselham et.al. | 2312.00878 | link |
2023-12-01 | Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras | Mohammad Altillawi et.al. | 2312.00500 | null |
2023-11-30 | HKUST at SemEval-2023 Task 1: Visual Word Sense Disambiguation with Context Augmentation and Visual Assistance | Zhuohao Yin et.al. | 2311.18273 | link |
2023-11-30 | Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models | Raviteja Vemulapalli et.al. | 2311.18237 | link |
2023-11-29 | Transformer-empowered Multi-modal Item Embedding for Enhanced Image Search in E-Commerce | Chang Liu et.al. | 2311.17954 | null |
2023-11-28 | Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames | Chao Chen et.al. | 2311.17940 | null |
2023-11-29 | 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries | Huajian Huang et.al. | 2311.17389 | link |
2023-11-27 | Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation | Samuele Poppi et.al. | 2311.16254 | link |
2023-11-27 | Optimal Transport Aggregation for Visual Place Recognition | Sergio Izquierdo et.al. | 2311.15937 | link |
2023-11-27 | AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval | Shicheng Xu et.al. | 2311.14084 | link |
2023-11-23 | 3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology | Asma Ben Abacha et.al. | 2311.13752 | link |
2023-11-22 | Medical Image Retrieval Using Pretrained Embeddings | Farnaz Khun Jush et.al. | 2311.13547 | null |
2023-11-22 | Applications of Spiking Neural Networks in Visual Place Recognition | Somayeh Hussaini et.al. | 2311.13186 | link |
2023-11-21 | Attribute-Aware Deep Hashing with Self-Consistency for Large-Scale Fine-Grained Image Retrieval | Xiu-Shen Wei et.al. | 2311.12894 | null |
2023-11-21 | Towards Accurate Loop Closure Detection in Semantic SLAM with 3D Semantic Covisibility Graphs | Zhentian Qian et.al. | 2311.12245 | null |
2023-11-19 | From Categories to Classifier: Name-Only Continual Learning by Exploring the Web | Ameya Prabhu et.al. | 2311.11293 | null |
2023-11-18 | Lesion Search with Self-supervised Learning | Kristin Qi et.al. | 2311.11014 | null |
2023-11-15 | Flow reconstruction and particle characterization from inertial Lagrangian tracks | Ke Zhou et.al. | 2311.09076 | null |
2023-11-15 | Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval | Junyang Chen et.al. | 2311.07622 | null |
2023-11-13 | VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search | Shuting He et.al. | 2311.07514 | null |
2023-11-10 | Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval | Xin Lu et.al. | 2311.06067 | null |
2023-11-08 | Energy-efficient Wireless Image Retrieval for IoT Devices by Transmitting a TinyML Model | Junya Shiraishi et.al. | 2311.04788 | null |
2023-11-08 | Training CLIP models on Data from Scientific Papers | Calvin Metzger et.al. | 2311.04711 | link |
2023-11-07 | DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding | Kehinde Ajayi et.al. | 2311.04098 | link |
2023-11-06 | Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences | Zador Pataki et.al. | 2311.03345 | null |
2023-11-06 | FocusTune: Tuning Visual Localization through Focus-Guided Sampling | Son Tung Nguyen et.al. | 2311.02872 | link |
2023-11-01 | DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing | Gaoshuang Huang et.al. | 2311.00230 | link |
2023-10-29 | Identifiable Contrastive Learning with Automatic Feature Importance Discovery | Qi Zhang et.al. | 2310.18904 | link |
2023-10-27 | LipSim: A Provably Robust Perceptual Similarity Metric | Sara Ghazanfari et.al. | 2310.18274 | link |
2023-10-27 | Split Covariance Intersection Filter Based Visual Localization With Accurate AprilTag Map For Warehouse Robot Navigation | Susu Fang et.al. | 2310.17879 | null |
2023-10-25 | FoundLoc: Vision-based Onboard Aerial Localization in the Wild | Yao He et.al. | 2310.16299 | null |
2023-10-24 | Cross-view Self-localization from Synthesized Scene-graphs | Ryogo Yamamoto et.al. | 2310.15504 | null |
2023-10-23 | Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval | Xu Yuan et.al. | 2310.14637 | link |
2023-10-21 | Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation | Anastasia Kritharoula et.al. | 2310.14025 | link |
2023-10-20 | FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer | Xinyu Zhang et.al. | 2310.13605 | null |
2023-10-20 | CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants | Shaoan Wang et.al. | 2310.13320 | link |
2023-10-27 | Representation Learning via Consistent Assignment of Views over Random Partitions | Thalles Silva et.al. | 2310.12692 | link |
2023-10-18 | Evaluating the Fairness of Discriminative Foundation Models in Computer Vision | Junaid Ali et.al. | 2310.11867 | null |
2023-10-17 | Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification | Shuanglin Yan et.al. | 2310.11210 | null |
2023-10-16 | Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People | Dharmateja Adapa et.al. | 2310.10290 | null |
2023-10-16 | EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge | Tom Bryan et.al. | 2310.10050 | null |
2023-10-15 | CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes | Yulei Qin et.al. | 2310.09761 | link |
2023-10-13 | Pairwise Similarity Learning is SimPLE | Yandong Wen et.al. | 2310.09449 | link |
2023-10-13 | Vision-by-Language for Training-Free Compositional Image Retrieval | Shyamgopal Karthik et.al. | 2310.09291 | link |
2023-10-12 | Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning | Shiyang Yan et.al. | 2310.08390 | null |
2023-10-12 | Jointly Optimized Global-Local Visual Localization of UAVs | Haoling Li et.al. | 2310.08082 | null |
2023-10-10 | Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization | Le Chen et.al. | 2310.06984 | null |
2023-10-10 | Distillation Improves Visual Place Recognition for Low-Quality Queries | Anbang Yang et.al. | 2310.06906 | link |
2023-10-10 | Efficient Retrieval of Images with Irregular Patterns using Morphological Image Analysis: Applications to Industrial and Healthcare datasets | Jiajun Zhang et.al. | 2310.06566 | null |
2023-10-10 | Topological RANSAC for instance verification and retrieval without fine-tuning | Guoyuan An et.al. | 2310.06486 | null |
2023-10-10 | 3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments | Ghanta Sai Krishna et.al. | 2310.06385 | null |
2023-10-09 | Collaborative Visual Place Recognition | Yiming Li et.al. | 2310.05541 | null |
2023-10-09 | Sentence-level Prompts Benefit Composed Image Retrieval | Yang Bai et.al. | 2310.05473 | link |
2023-10-08 | AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition | Feng Lu et.al. | 2310.05184 | link |
2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
2023-10-12 | ClusVPR: Efficient Visual Place Recognition with Clustering-based Weighted Transformer | Yifan Xu et.al. | 2310.04099 | null |
2023-10-06 | Sub-token ViT Embedding via Stochastic Resonance Transformers | Dong Lao et.al. | 2310.03967 | link |
2023-10-04 | Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach | Matthew Hanlon et.al. | 2310.02650 | null |
2023-10-02 | NEUCORE: Neural Concept Reasoning for Composed Image Retrieval | Shu Zhao et.al. | 2310.01358 | null |
2023-10-02 | Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images | Georg Bökman et.al. | 2310.01092 | null |
2023-10-05 | PlaceNav: Topological Navigation through Place Recognition | Lauri Suomela et.al. | 2309.17260 | null |
2023-09-29 | Segment Anything Model is a Good Teacher for Local Feature Learning | Jingqian Wu et.al. | 2309.16992 | link |
2023-09-28 | Dark Side Augmentation: Generating Diverse Night Examples for Metric Learning | Albert Mohwald et.al. | 2309.16351 | link |
2023-09-28 | FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding | Pengxiang Wu et.al. | 2309.16249 | link |
2023-09-28 | Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval | Yuanmin Tang et.al. | 2309.16137 | link |
2023-09-27 | GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization | Vicente Vivanco Cepeda et.al. | 2309.16020 | link |
2023-09-27 | Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization | Zhenbo Song et.al. | 2309.15556 | null |
2023-09-26 | Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features | Hila Levi et.al. | 2309.14999 | null |
2023-09-23 | Resolving References in Visually-Grounded Dialogue via Text Generation | Bram Willemsen et.al. | 2309.13430 | link |
2023-09-21 | Face Identity-Aware Disentanglement in StyleGAN | Adrian Suwała et.al. | 2309.12033 | null |
2023-09-21 | On-the-Fly SfM: What you capture is What you get | Zongqian Zhan et.al. | 2309.11883 | link |
2023-09-20 | 2D-3D Pose Tracking with Multi-View Constraints | Huai Yu et.al. | 2309.11335 | null |
2023-09-19 | VPRTempo: A Fast Temporally Encoded Spiking Neural Network for Visual Place Recognition | Adam D. Hines et.al. | 2309.10225 | link |
2023-09-18 | DynaPix SLAM: A Pixel-Based Dynamic SLAM Approach | Chenghao Xu et.al. | 2309.09879 | null |
2023-09-18 | Decompose Semantic Shifts for Composed Image Retrieval | Xingyu Yang et.al. | 2309.09531 | null |
2023-09-16 | Efficient Object Rearrangement via Multi-view Fusion | Dehao Huang et.al. | 2309.08994 | null |
2023-09-16 | DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF | Mert Asim Karaoglu et.al. | 2309.08927 | null |
2023-09-16 | Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning | Pengyu Yin et.al. | 2309.08914 | link |
2023-09-15 | Active Learning for Fine-Grained Sketch-Based Image Retrieval | Himanshu Thakur et.al. | 2309.08743 | null |
2023-09-15 | Optimization of Rank Losses for Image Retrieval | Elias Ramzi et.al. | 2309.08250 | link |
2023-09-18 | Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer | Yaoting Wang et.al. | 2309.07929 | link |
2023-09-14 | EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization | Minjung Kim et.al. | 2309.07471 | link |
2023-09-13 | RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline | Mirko Usuelli et.al. | 2309.07094 | null |
2023-09-11 | Towards Content-based Pixel Retrieval in Revisited Oxford and Paris | Guoyuan An et.al. | 2309.05438 | link |
2023-09-08 | Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning | Hiroki Nakamura et.al. | 2309.04148 | null |
2023-09-05 | Magnetic Navigation using Attitude-Invariant Magnetic Field Information for Loop Closure Detection | Natalia Pavlasek et.al. | 2309.02394 | null |
2023-09-05 | Dual Relation Alignment for Composed Image Retrieval | Xintong Jiang et.al. | 2309.02169 | null |
2023-09-04 | NLLB-CLIP – train performant multilingual image retrieval model on a budget | Alexander Visheratin et.al. | 2309.01859 | null |
2023-09-04 | Target-Guided Composed Image Retrieval | Haokun Wen et.al. | 2309.01366 | null |
2023-09-02 | Deep supervised hashing for fast retrieval of radio image cubes | Steven Ndung’u et.al. | 2309.00932 | null |
2023-08-31 | Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval | Prateksha Udhayanan et.al. | 2308.16649 | null |
2023-08-28 | Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics | Nils Böhne et.al. | 2308.14786 | null |
2023-08-28 | CoVR: Learning Composed Video Retrieval from Web Video Captions | Lucas Ventura et.al. | 2308.14746 | link |
2023-08-27 | Deep Learning for Visual Localization and Mapping: A Survey | Changhao Chen et.al. | 2308.14039 | null |
2023-08-26 | Learning Efficient Representations for Image-Based Patent Retrieval | Hongsong Wang et.al. | 2308.13749 | null |
2023-08-25 | Enhancing Landmark Detection in Cluttered Real-World Scenarios with Vision Transformers | Mohammad Javad Rajabi et.al. | 2308.13671 | null |
2023-08-24 | Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities | Jinze Bai et.al. | 2308.12966 | link |
2023-08-23 | Progressive Feature Mining and External Knowledge-Assisted Text-Pedestrian Image Retrieval | Huafeng Li et.al. | 2308.11994 | null |
2023-08-23 | OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes | Tao Xie et.al. | 2308.11928 | link |
2023-08-22 | Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features | Alberto Baldrati et.al. | 2308.11485 | link |
2023-08-22 | GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training | Xinchi Deng et.al. | 2308.11331 | null |
2023-08-22 | LDP-Feat: Image Features with Local Differential Privacy | Francesco Pittaluga et.al. | 2308.11223 | null |
2023-08-21 | EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition | Gabriele Berton et.al. | 2308.10832 | link |
2023-08-20 | FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory | Anwesan Pal et.al. | 2308.10170 | null |
2023-08-18 | 3D Model-free Visual localization System from Essential Matrix under Local Planar Motion | Yanmei Jiao et.al. | 2308.09566 | null |
2023-08-17 | FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings | Yulin Su et.al. | 2308.09012 | link |
2023-08-16 | Integrating Visual and Semantic Similarity Using Hierarchies for Image Retrieval | Aishwarya Venkataramanan et.al. | 2308.08431 | link |
2023-08-16 | Ranking-aware Uncertainty for Text-guided Image Retrieval | Junyang Chen et.al. | 2308.08131 | null |
2023-08-19 | Global Features are All You Need for Image Retrieval and Reranking | Shihao Shao et.al. | 2308.06954 | link |
2023-08-14 | MixBCT: Towards Self-Adapting Backward-Compatible Training | Yu Liang et.al. | 2308.06948 | link |
2023-08-10 | KS-APR: Keyframe Selection for Robust Absolute Pose Regression | Changkun Liu et.al. | 2308.05459 | null |
2023-08-09 | AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities | Jingdan Zhang et.al. | 2308.04992 | link |
2023-08-08 | Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval | Yi Bin et.al. | 2308.04343 | link |
2023-08-08 | Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval | Yunquan Zhu et.al. | 2308.04008 | link |
2023-08-05 | A Comprehensive Analysis of Real-World Image Captioning and Scene Identification | Sai Suprabhanu Nallapaneni et.al. | 2308.02833 | null |
2023-08-03 | Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies | Eunsuk Seo et.al. | 2308.01871 | null |
2023-08-01 | AnyLoc: Towards Universal Visual Place Recognition | Nikhil Keetha et.al. | 2308.00688 | link |
2023-07-31 | Guiding Image Captioning Models Toward More Specific Captions | Simon Kornblith et.al. | 2307.16686 | null |
2023-07-31 | Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks | Kousik Rajesh et.al. | 2307.16395 | null |
2023-07-28 | D2S: Representing local descriptors and global scene coordinates for camera relocalization | Bach-Thuan Bui et.al. | 2307.15250 | link |
2023-07-26 | Neural-based Cross-modal Search and Retrieval of Artwork | Yan Gong et.al. | 2307.14244 | null |
2023-07-26 | Boon: A Neural Search Engine for Cross-Modal Information Retrieval | Yan Gong et.al. | 2307.14240 | null |
2023-07-25 | Conditional Cross Attention Network for Multi-Space Embedding without Entanglement in Only a SINGLE Network | Chull Hwan Song et.al. | 2307.13254 | null |
2023-07-28 | SACReg: Scene-Agnostic Coordinate Regression for Visual Localization | Jerome Revaud et.al. | 2307.11702 | null |
2023-07-19 | Lazy Visual Localization via Motion Averaging | Siyan Dong et.al. | 2307.09981 | null |
2023-07-19 | Quantum Optics based Algorithm for Measuring the Similarity between Images | Vivek Mehta et.al. | 2307.09789 | null |
2023-07-18 | Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments | Max Moebius et.al. | 2307.09172 | null |
2023-07-18 | 3D-SeqMOS: A Novel Sequential 3D Moving Object Segmentation in Autonomous Driving | Qipeng Li et.al. | 2307.09044 | null |
2023-07-19 | Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation | Rundong Luo et.al. | 2307.08779 | null |
2023-07-17 | Divide&Classify: Fine-Grained Classification for City-Wide Visual Place Recognition | Gabriele Trivigno et.al. | 2307.08417 | link |
2023-07-17 | Bridging the Gap: Multi-Level Cross-Modality Joint Alignment for Visible-Infrared Person Re-Identification | Tengfei Liang et.al. | 2307.08316 | link |
2023-07-17 | NDT-Map-Code: A 3D global descriptor for real-time loop closure detection in lidar SLAM | Lizhou Liao et.al. | 2307.08221 | link |
2023-07-20 | Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer | Yujiao Shi et.al. | 2307.08015 | link |
2023-07-10 | Phoneme-retrieval; voice recognition; vowels recognition | Brunello Tirozzi et.al. | 2307.07407 | null |
2023-07-14 | Risk Controlled Image Retrieval | Kaiwen Cai et.al. | 2307.07336 | null |
2023-07-11 | ResMatch: Residual Attention Learning for Local Feature Matching | Yuxin Deng et.al. | 2307.05180 | link |
2023-07-11 | Feature Activation Map: Visual Explanation of Deep Learning Models for Image Classification | Yi Liao et.al. | 2307.05017 | null |
2023-07-10 | Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor | San Jiang et.al. | 2307.04520 | null |
2023-07-10 | RaPlace: Place Recognition for Imaging Radar using Radon Transform and Mutable Threshold | Hyesu Jang et.al. | 2307.04321 | link |
2023-07-08 | Calibration-Aware Margin Loss: Pushing the Accuracy-Calibration Consistency Pareto Frontier for Deep Metric Learning | Qin Zhang et.al. | 2307.04047 | null |
2023-07-04 | Unsupervised Quality Prediction for Improved Single-Frame and Weighted Sequential Visual Place Recognition | Helen Carson et.al. | 2307.01464 | null |
2023-07-04 | Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network | Zizhuo Li et.al. | 2307.01447 | null |
2023-07-03 | Cross-modal Place Recognition in Image Databases using Event-based Sensors | Xiang Ji et.al. | 2307.01047 | null |
2023-06-30 | DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions | Stephen Hausler et.al. | 2306.17536 | null |
2023-06-30 | Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization | Stephen Hausler et.al. | 2306.17529 | null |
2023-06-27 | Dental CLAIRES: Contrastive LAnguage Image REtrieval Search for Dental Research | Tanjida Kabir et.al. | 2306.15651 | null |
2023-06-27 | Mean Field Theory in Deep Metric Learning | Takuya Furusawa et.al. | 2306.15368 | null |
2023-06-26 | Hierarchical Matching and Reasoning for Multi-Query Image Retrieval | Zhong Ji et.al. | 2306.14460 | link |
2023-06-25 | Enhancing Dynamic Image Advertising with Vision-Language Pre-training | Zhoufutu Wen et.al. | 2306.14112 | null |
2023-06-23 | Catching Image Retrieval Generalization | Maksim Zhdanov et.al. | 2306.13357 | null |
2023-06-22 | Deep Metric Learning with Soft Orthogonal Proxies | Farshad Saberi-Movahed et.al. | 2306.13055 | null |
2023-06-22 | What to Learn: Features, Image Transformations, or Both? | Yuxuan Chen et.al. | 2306.13040 | null |
2023-06-22 | Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval | Katrin Glinka et.al. | 2306.12843 | null |
2023-06-26 | Annotation Cost Efficient Active Learning for Content Based Image Retrieval | Julia Henkel et.al. | 2306.11605 | null |
2023-06-19 | Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning | Shivaen Ramshetty et.al. | 2306.11065 | link |
2023-06-18 | LiDAR-Based Place Recognition For Autonomous Driving: A Survey | Pengcheng Shi et.al. | 2306.10561 | link |
2023-06-15 | Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization | Dror Aiger et.al. | 2306.09012 | link |
2023-06-15 | Prompt Performance Prediction for Generative IR | Nicolas Bizzozzero et.al. | 2306.08915 | null |
2023-06-15 | Graph Convolution Based Efficient Re-Ranking for Visual Retrieval | Yuqi Zhang et.al. | 2306.08792 | link |
2023-06-13 | GeneCIS: A Benchmark for General Conditional Image Similarity | Sagar Vaze et.al. | 2306.07969 | null |
2023-06-13 | MOFI: Learning Image Representations from Noisy Entity Annotated Images | Wentao Wu et.al. | 2306.07952 | link |
2023-06-12 | Zero-shot Composed Text-Image Retrieval | Yikun Liu et.al. | 2306.07272 | link |
2023-06-12 | Sticker820K: Empowering Interactive Retrieval with Stickers | Sijie Zhao et.al. | 2306.06870 | null |
2023-06-11 | Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models | Yuguang Yang et.al. | 2306.06691 | null |
2023-06-03 | Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval | Xu Zhang et.al. | 2306.02092 | null |
2023-06-03 | Class Anchor Margin Loss for Content-Based Image Retrieval | Alexandru Ghita et.al. | 2306.00630 | null |
2023-05-31 | Chatting Makes Perfect – Chat-based Image Retrieval | Matan Levy et.al. | 2305.20062 | link |
2023-05-31 | Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization | Junan Chen et.al. | 2305.20044 | null |
2023-05-30 | A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation | Omar Seddati et.al. | 2305.18988 | null |
2023-05-29 | Synfeal: A Data-Driven Simulator for End-to-End Camera Localization | Daniel Coelho et.al. | 2305.18260 | link |
2023-05-29 | Nanoscale visualization of the thermally-driven evolution of antiferromagnetic domains in FeTe thin films | Shrinkhala Sharma et.al. | 2305.18197 | null |
2023-05-29 | TReR: A Lightweight Transformer Re-Ranking Approach for 3D LiDAR Place Recognition | Tiago Barros et.al. | 2305.18013 | null |
2023-05-28 | ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval | Jiapeng Wang et.al. | 2305.17652 | null |
2023-06-01 | FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing | Zhuang Li et.al. | 2305.17497 | link |
2023-05-27 | Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation | Yueh-Cheng Huang et.al. | 2305.17463 | null |
2023-05-26 | Generating Images with Multimodal Language Models | Jing Yu Koh et.al. | 2305.17216 | link |
2023-05-25 | Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder | Zheyuan Liu et.al. | 2305.16304 | link |
2023-05-23 | Leveraging BEV Representation for 360-degree Visual Place Recognition | Xuecheng Xu et.al. | 2305.13814 | link |
2023-05-23 | EDIS: Entity-Driven Image Search over Multimodal Web Content | Siqi Liu et.al. | 2305.13631 | link |
2023-05-20 | DAC: Detector-Agnostic Spatial Covariances for Deep Local Features | Javier Tirado-Garín et.al. | 2305.12250 | link |
2023-05-19 | Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach | Zahra Tabatabaei et.al. | 2305.11728 | null |
2023-05-19 | Learning Sequence Descriptor based on Spatiotemporal Attention for Visual Place Recognition | Fenglin Zhang et.al. | 2305.11467 | link |
2023-05-12 | IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images | Varuna Krishna et.al. | 2305.10438 | null |
2023-05-17 | Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval | Haokun Wen et.al. | 2305.09979 | null |
2023-05-13 | Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance | Xinyu Lin et.al. | 2305.07943 | link |
2023-05-11 | Foundations of Spatial Perception for Robotics: Hierarchical Representations and Real-time Systems | Nathan Hughes et.al. | 2305.07154 | link |
2023-05-09 | Visual Place Recognition with Low-Resolution Images | Mihnea-Alexandru Tomita et.al. | 2305.05776 | null |
2023-05-09 | Vision-Language Models in Remote Sensing: Current Progress and Future Trends | Congcong Wen et.al. | 2305.05726 | null |
2023-05-09 | An Evaluation and Ranking of Different Voting Schemes for Improved Visual Place Recognition | Maria Waheed et.al. | 2305.05705 | null |
2023-05-09 | Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query | Ho Hin Lee et.al. | 2305.05598 | null |
2023-05-09 | ColonMapper: topological mapping and localization for colonoscopy | Javier Morlana et.al. | 2305.05546 | null |
2023-05-09 | Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization | Clémentin Boittiaux et.al. | 2305.05301 | link |
2023-05-09 | Patch-DrosoNet: Classifying Image Partitions With Fly-Inspired Models For Lightweight Visual Place Recognition | Bruno Arcanjo et.al. | 2305.05256 | null |
2023-05-09 | Adapt and Align to Improve Zero-Shot Sketch-Based Image Retrieval | Shiyin Dong et.al. | 2305.05144 | null |
2023-05-08 | Hierarchical Visual Localization Based on Sparse Feature Pyramid for Adaptive Reduction of Keypoint Map Size | Andrei Potapov et.al. | 2305.04856 | null |
2023-05-08 | Privacy-Preserving Representations are not Enough – Recovering Scene Content from Camera Poses | Kunal Chelani et.al. | 2305.04603 | link |
2023-05-06 | Keyword-Based Diverse Image Retrieval by Semantics-aware Contrastive Learning and Transformer | Minyi Zhao et.al. | 2305.04072 | null |
2023-05-06 | Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing | Swagatika Dash et.al. | 2305.03881 | link |
2023-05-05 | COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? | Arijit Ray et.al. | 2305.03689 | link |
2023-05-05 | HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer | Shuzhe Wang et.al. | 2305.03595 | null |
2023-05-05 | WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval | Zahra Tabatabaei et.al. | 2305.03383 | null |
2023-05-04 | Boundary-aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval | Tan Pan et.al. | 2305.02610 | link |
2023-05-03 | Learning-based Relational Object Matching Across Views | Cathrin Elich et.al. | 2305.02398 | null |
2023-05-05 | A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text | Yunxin Li et.al. | 2305.02265 | link |
2023-05-03 | AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation | Shentong Mo et.al. | 2305.01836 | null |
2023-04-30 | Second-order Anisotropic Gaussian Directional Derivative Filters for Blob Detection | Jie Ren et.al. | 2305.00435 | null |
2023-04-28 | SFD2: Semantic-guided Feature Detection and Description | Fei Xue et.al. | 2304.14845 | link |
2023-04-28 | Quantum enhanced non-interferometric quantitative phase imaging | Giuseppe Ortolano et.al. | 2304.14727 | null |
2023-04-26 | Hydra-Multi: Collaborative Online Construction of 3D Scene Graphs with Multi-Robot Teams | Yun Chang et.al. | 2304.13487 | null |
2023-04-27 | STIR: Siamese Transformer for Image Retrieval Postprocessing | Aleksei Shabanov et.al. | 2304.13393 | null |
2023-04-25 | DualSlide: Global-to-Local Sketching Interface for Slide Content and Layout Design | Jiahao Weng et.al. | 2304.12506 | null |
2023-04-24 | Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning | Lucas Pascotti Valem et.al. | 2304.12448 | link |
2023-04-23 | IDLL: Inverse Depth Line based Visual Localization in Challenging Environments | Wanting Li et.al. | 2304.11748 | null |
2023-04-23 | Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval | Mehdi Rafiei et.al. | 2304.11734 | null |
2023-04-17 | Features-over-the-Air: Contrastive Learning Enabled Cooperative Edge Inference | Haotian Wu et.al. | 2304.08221 | null |
2023-04-17 | NeRF-Loc: Visual Localization with Conditional Neural Radiance Field | Jianlin Liu et.al. | 2304.07979 | link |
2023-04-16 | Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification | Luca Piano et.al. | 2304.07883 | null |
2023-04-16 | Language Guided Local Infiltration for Interactive Image Retrieval | Fuxiang Huang et.al. | 2304.07747 | null |
2023-04-16 | Long-term Visual Localization with Mobile Sensors | Shen Yan et.al. | 2304.07691 | null |
2023-04-16 | Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging | Jielin Qiu et.al. | 2304.07675 | null |
2023-04-14 | CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression | Mubariz Zaffar et.al. | 2304.07426 | null |
2023-04-14 | FM-Loc: Using Foundation Models for Improved Vision-based Localization | Reihaneh Mirjalili et.al. | 2304.07058 | null |
2023-04-17 | Toward Real-Time Image Annotation Using Marginalized Coupled Dictionary Learning | Seyed Mahdi Roostaiyan et.al. | 2304.06907 | link |
2023-04-17 | You are here! Finding position and orientation on a 2D map from a single image: The Flatlandia localization problem and dataset | Matteo Toso et.al. | 2304.06373 | link |
2023-04-12 | Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation | Yifeng Shi et.al. | 2304.06051 | link |
2023-04-12 | Visual Localization using Imperfect 3D Models from the Internet | Vojtech Panek et.al. | 2304.05947 | link |
2023-04-12 | Are Local Features All You Need for Cross-Domain Visual Place Recognition? | Giovanni Barbarani et.al. | 2304.05887 | link |
2023-04-12 | Unicom: Universal and Compact Representation Learning for Image Retrieval | Xiang An et.al. | 2304.05884 | link |
2023-04-12 | SGL: Structure Guidance Learning for Camera Localization | Xudong Zhang et.al. | 2304.05571 | null |
2023-04-14 | Loop Closure Detection Based on Object-level Spatial Layout and Semantic Consistency | Xingwu Ji et.al. | 2304.05146 | link |
2023-04-10 | CAVL: Learning Contrastive and Adaptive Representations of Vision and Language | Shentong Mo et.al. | 2304.04399 | null |
2023-04-09 | Unsupervised Multi-Criteria Adversarial Detection in Deep Image Retrieval | Yanru Xiao et.al. | 2304.04228 | null |
2023-04-08 | SGIDN-LCD: An Appearance-based Loop Closure Detection Algorithm using Superpixel Grids and Incremental Dynamic Nodes | Baosheng Zhang et.al. | 2304.03872 | null |
2023-04-06 | $R^{2}$Former: Unified $R$etrieval and $R$ eranking Transformer for Place Recognition | Sijie Zhu et.al. | 2304.03410 | null |
2023-04-06 | Distributed formation-enforcing control for UAVs robust to observation noise in relative pose measurements | Viktor Walter et.al. | 2304.03057 | link |
2023-04-05 | Efficient OCR for Building a Diverse Digital History | Jacob Carlson et.al. | 2304.02737 | link |
2023-04-05 | LogoNet: a fine-grained network for instance-level logo sketch retrieval | Binbin Feng et.al. | 2304.02214 | link |
2023-04-04 | OrienterNet: Visual Localization in 2D Public Maps with Neural Matching | Paul-Edouard Sarlin et.al. | 2304.02009 | link |
2023-04-04 | Cross-Domain Image Captioning with Discriminative Finetuning | Roberto Dessì et.al. | 2304.01662 | link |
2023-04-02 | Learning Similarity between Scene Graphs and Images with Transformers | Yuren Cong et.al. | 2304.00590 | link |
2023-04-01 | NPR: Nocturnal Place Recognition in Street | Bingxi Liu et.al. | 2304.00276 | null |
2023-03-31 | Unsupervised crack detection on complex stone masonry surfaces | Panagiotis Agrafiotis et.al. | 2303.17989 | null |
2023-03-30 | If At First You Don’t Succeed: Test Time Re-ranking for Zero-shot, Cross-domain Retrieval | Finlay G. C. Hudson et.al. | 2303.17703 | null |
2023-03-30 | Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime | Rhydian Windsor et.al. | 2303.17644 | null |
2023-03-30 | 3D Line Mapping Revisited | Shaohui Liu et.al. | 2303.17504 | link |
2023-03-30 | Methods and advancement of content-based fashion image retrieval: A Review | Amin Muhammad Shoib et.al. | 2303.17371 | null |
2023-03-30 | Adaptive Cross Batch Normalization for Metric Learning | Thalaiyasingam Ajanthan et.al. | 2303.17127 | null |
2023-03-30 | MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks | Weicheng Kuo et.al. | 2303.16839 | null |
2023-03-29 | Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval | Leo Sampaio Ferraz Ribeiro et.al. | 2303.16769 | null |
2023-03-29 | Bi-directional Training for Composed Image Retrieval via Text Prompt Learning | Zheyuan Liu et.al. | 2303.16604 | link |
2023-03-27 | Model Cascades for Efficient Image Search | Robert Hönig et.al. | 2303.15595 | null |
2023-03-27 | Zero-Shot Composed Image Retrieval with Textual Inversion | Alberto Baldrati et.al. | 2303.15247 | link |
2023-03-27 | What Can Human Sketches Do for Object Detection? | Pinaki Nath Chowdhury et.al. | 2303.15149 | null |
2023-03-25 | Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style | Fengyin Lin et.al. | 2303.14348 | link |
2023-03-24 | A-MuSIC: An Adaptive Ensemble System For Visual Place Recognition In Changing Environments | Bruno Arcanjo et.al. | 2303.14247 | null |
2023-03-24 | PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View | Ze Shi et.al. | 2303.14095 | link |
2023-03-24 | Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR | Aneeshan Sain et.al. | 2303.13779 | null |
2023-03-28 | CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not | Aneeshan Sain et.al. | 2303.13440 | null |
2023-03-22 | Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval | Xunguang Wang et.al. | 2303.12658 | null |
2023-03-21 | CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion | Geonmo Gu et.al. | 2303.11916 | link |
2023-03-21 | LIMITR: Leveraging Local Information for Medical Image-Text Representation | Gefen Dawidowicz et.al. | 2303.11755 | null |
2023-03-25 | Data-efficient Large Scale Place Recognition with Graded Similarity Supervision | Maria Leyva-Vallina et.al. | 2303.11739 | link |
2023-03-20 | Picture that Sketch: Photorealistic Image Generation from Abstract Sketches | Subhadeep Koley et.al. | 2303.11162 | null |
2023-03-19 | Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths | Ming Xu et.al. | 2303.10778 | link |
2023-03-17 | MRIS: A Multi-modal Retrieval Approach for Image Synthesis on Diverse Modalities | Boqi Chen et.al. | 2303.10249 | null |
2023-03-17 | IRGen: Generative Modeling for Image Retrieval | Yidan Zhang et.al. | 2303.10126 | link |
2023-03-16 | Data Roaming and Early Fusion for Composed Image Retrieval | Matan Levy et.al. | 2303.09429 | link |
2023-03-16 | Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval | Yi Xie et.al. | 2303.09230 | null |
2023-03-16 | Metric-Free Exploration for Topological Mapping by Task and Motion Imitation in Feature Space | Yuhang He et.al. | 2303.09192 | null |
2023-03-16 | Unsupervised Facial Expression Representation Learning with Contrastive Local Warping | Fanglei Xue et.al. | 2303.09034 | null |
2023-03-15 | A Triplet-loss Dilated Residual Network for High-Resolution Representation Learning in Image Retrieval | Saeideh Yousefzadeh et.al. | 2303.08398 | null |
2023-03-14 | Data-Free Sketch-Based Image Retrieval | Abhra Chaudhuri et.al. | 2303.07775 | link |
2023-03-14 | PATS: Patch Area Transportation with Subdivision for Local Feature Matching | Junjie Ni et.al. | 2303.07700 | null |
2023-03-10 | Robotic Applications of Pre-Trained Vision-Language Models to Various Recognition Behaviors | Kento Kawaharazuka et.al. | 2303.05674 | null |
2023-03-09 | Dominating Set Database Selection for Visual Place Recognition | Anastasiia Kornilova et.al. | 2303.05123 | null |
2023-03-07 | Graph Neural Networks in Vision-Language Image Understanding: A Survey | Henry Senior et.al. | 2303.03761 | null |
2023-03-07 | Sketch-based Medical Image Retrieval | Kazuma Kobayashi et.al. | 2303.03633 | link |
2023-03-06 | Visual Place Recognition: A Tutorial | Stefan Schubert et.al. | 2303.03281 | link |
2023-03-06 | MABNet: Master Assistant Buddy Network with Hybrid Learning for Image Retrieval | Rohit Agarwal et.al. | 2303.03050 | link |
2023-03-06 | Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints | Chenjie Cao et.al. | 2303.02885 | link |
2023-03-05 | Composing Mood Board with User Feedback in Concept Space | Shin Sano et.al. | 2303.02547 | null |
2023-03-04 | FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks | Xiao Han et.al. | 2303.02483 | link |
2023-03-09 | Self-Supervised Learning for Place Representation Generalization across Appearance Changes | Mohamed Adel Musallam et.al. | 2303.02370 | null |
2023-03-03 | MixVPR: Feature Mixing for Visual Place Recognition | Amar Ali-bey et.al. | 2303.02190 | link |
2023-03-01 | A Complementarity-Based Switch-Fuse System for Improved Visual Place Recognition | Maria Waheed et.al. | 2303.00714 | null |
2023-03-01 | ORCHNet: A Robust Global Feature Aggregation approach for 3D LiDAR-based Place recognition in Orchards | T. Barros et.al. | 2303.00477 | link |
2023-03-03 | Renderable Neural Radiance Map for Visual Navigation | Obin Kwon et.al. | 2303.00304 | null |
2023-03-01 | Region Prediction for Efficient Robot Localization on Large Maps | Matteo Scucchia et.al. | 2303.00295 | link |
2023-02-28 | OEKG: The Open Event Knowledge Graph | Simon Gottschalk et.al. | 2302.14688 | null |
2023-02-28 | Global Proxy-based Hard Mining for Visual Place Recognition | Amar Ali-bey et.al. | 2302.14217 | link |
2023-02-27 | Efficient Informed Proposals for Discrete Distributions via Newton’s Series Approximation | Yue Xiang et.al. | 2302.13929 | link |
2023-02-26 | Data-Efficient Sequence-Based Visual Place Recognition with Highly Compressed JPEG Images | Mihnea-Alexandru Tomita et.al. | 2302.13314 | null |
2023-02-26 | Learning cross space mapping via DNN using large scale click-through logs | Wei Yu et.al. | 2302.13275 | null |
2023-02-25 | DeepBrainPrint: A Novel Contrastive Framework for Brain MRI Re-Identification | Lemuel Puglisi et.al. | 2302.13057 | null |
2023-02-23 | Teaching CLIP to Count to Ten | Roni Paiss et.al. | 2302.12066 | null |
2023-02-22 | Steerable Equivariant Representation Learning | Sangnie Bhardwaj et.al. | 2302.11349 | null |
2023-02-21 | iQPP: A Benchmark for Image Query Performance Prediction | Eduard Poesina et.al. | 2302.10126 | link |
2023-02-20 | Ontology-aware Network for Zero-shot Sketch-based Image Retrieval | Haoxiang Zhang et.al. | 2302.10040 | null |
2023-02-20 | TBPos: Dataset for Large-Scale Precision Visual Localization | Masud Fahim et.al. | 2302.09825 | link |
2023-02-17 | Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts | Zhihong Chen et.al. | 2302.08958 | link |
2023-02-22 | Fashion Image Retrieval with Multi-Granular Alignment | Jinkuan Zhu et.al. | 2302.08902 | null |
2023-02-15 | Unsupervised Hashing via Similarity Distribution Calibration | Kam Woh Ng et.al. | 2302.07669 | link |
2023-02-13 | Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior | Shen Yan et.al. | 2302.06287 | link |
2023-02-13 | Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation | Binqian Jiang et.al. | 2302.06149 | link |
2023-02-13 | Correspondence-Free Domain Alignment for Unsupervised Cross-Domain Image Retrieval | Xu Wang et.al. | 2302.06081 | link |
2023-02-11 | Sketch Less Face Image Retrieval: A New Challenge | Dawei Dai et.al. | 2302.05576 | link |
2023-02-10 | Is multi-modal vision supervision beneficial to language? | Avinash Madasu et.al. | 2302.05016 | link |
2023-02-06 | Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval | Kuniaki Saito et.al. | 2302.03084 | link |
2023-02-06 | Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs | Michael Kirchhof et.al. | 2302.02865 | link |
2023-02-03 | Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization | Yingying Zhu et.al. | 2302.01572 | link |
2023-02-04 | Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval | Frederik Warburg et.al. | 2302.01332 | link |
2023-01-31 | Grounding Language Models to Images for Multimodal Generation | Jing Yu Koh et.al. | 2301.13823 | link |
2023-01-31 | UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers | Dachuan Shi et.al. | 2301.13741 | link |
2023-01-23 | Lexi: Self-Supervised Learning of the UI Language | Pratyay Banerjee et.al. | 2301.10165 | link |
2023-01-17 | Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval | Yuchen Wu et.al. | 2301.06685 | null |
2023-01-19 | High-bandwidth Close-Range Information Transport through Light Pipes | Joowon Lim et.al. | 2301.06496 | null |
2023-01-13 | A LiDAR-Inertial-Visual SLAM System with Loop Detection | Kangcheng Liu et.al. | 2301.05604 | null |
2023-01-12 | GH-Feat: Learning Versatile Generative Hierarchical Features from GANs | Yinghao Xu et.al. | 2301.05315 | null |
2023-01-10 | Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images | Xindi Wu et.al. | 2301.04224 | null |
2023-01-10 | Collaborative Semantic Communication at the Edge | Wing Fei Lo et.al. | 2301.03996 | null |
2023-01-10 | Online Backfilling with No Regret for Large-Scale Image Retrieval | Seonguk Seo et.al. | 2301.03767 | null |
2023-01-06 | CyberLoc: Towards Accurate Long-term Visual Localization | Liu Liu et.al. | 2301.02403 | null |
2023-01-05 | A Probabilistic Framework for Visual Localization in Ambiguous Scenes | Fereidoon Zangeneh et.al. | 2301.02086 | link |
2022-12-31 | 4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions | Patrick Wenzel et.al. | 2301.01147 | null |
2022-12-30 | HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images | Dmitry Yudin et.al. | 2212.14649 | link |
2022-12-27 | Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning | Wooyoung Kang et.al. | 2212.13563 | link |
2022-12-23 | SuperGF: Unifying Local and Global Features for Visual Localization | Wenzheng Song et.al. | 2212.13105 | null |
2022-12-24 | GraffMatch: Global Matching of 3D Lines and Planes for Wide Baseline LiDAR Registration | Parker C. Lusk et.al. | 2212.12745 | null |
2022-12-19 | From a Bird’s Eye View to See: Joint Camera and Subject Registration without the Camera Calibration | Zekun Qian et.al. | 2212.09298 | link |
2022-12-14 | The Infinite Index: Information Retrieval on Generative Text-To-Image Models | Niklas Deckers et.al. | 2212.07476 | null |
2022-12-14 | Shared Coupling-bridge for Weakly Supervised Local Feature Learning | Jiayuan Sun et.al. | 2212.07047 | link |
2022-12-08 | Group Generalized Mean Pooling for Vision Transformer | Byungsoo Ko et.al. | 2212.04114 | null |
2022-12-12 | Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models | Gowthami Somepalli et.al. | 2212.03860 | null |
2022-12-07 | LSVL: Large-scale season-invariant visual localization for UAVs | Jouko Kinnari et.al. | 2212.03581 | null |
2022-12-06 | ADIR: Adaptive Diffusion for Image Reconstruction | Shady Abu-Hussein et.al. | 2212.03221 | null |
2022-12-08 | Privacy-Preserving Visual Localization with Event Cameras | Junho Kim et.al. | 2212.03177 | link |
2022-12-06 | Semantic Communication for Internet of Vehicles: A Multi-User Cooperative Approach | Wenjun Xu et.al. | 2212.03037 | null |
2022-12-06 | Attention-Enhanced Cross-modal Localization Between 360 Images and Point Clouds | Zhipeng Zhao et.al. | 2212.02757 | null |
2022-12-04 | Fast and Lightweight Scene Regressor for Camera Relocalization | Thuan B. Bui et.al. | 2212.01830 | link |
2022-12-02 | Information Retrieval from the Digitized Books | Riya Gupta et.al. | 2212.00999 | null |
2022-12-09 | StructVPR: Distill Structural Knowledge with Weighting Samples for Visual Place Recognition | Yanqing Shen et.al. | 2212.00937 | null |
2022-11-30 | Self-Supervised Feature Learning for Long-Term Metric Visual Localization | Yuxuan Chen et.al. | 2212.00122 | null |
2022-11-30 | SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation | Tianyu Zhang et.al. | 2211.16697 | link |
2022-11-28 | SLAN: Self-Locator Aided Network for Cross-Modal Understanding | Jiang-Tian Zhai et.al. | 2211.16208 | null |
2022-11-29 | RankDNN: Learning to Rank for Few-shot Learning | Qianyu Guo et.al. | 2211.15320 | link |
2022-11-28 | Safety-quantifiable Line Feature-based Monocular Visual Localization with 3D Prior Map | Xi Zheng et.al. | 2211.15127 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
2022-11-27 | BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images | Zhihuang Zhang et.al. | 2211.14927 | null |
2022-11-27 | A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition | Rui Huang et.al. | 2211.14864 | null |
2022-11-26 | Visual Place Recognition | Bailu Guo et.al. | 2211.14533 | null |
2022-11-26 | Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval | Fan Yang et.al. | 2211.14515 | link |
2022-11-30 | Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark | Floriana Ciaglia et.al. | 2211.13523 | link |
2022-11-23 | InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images | Konstantin Kobs et.al. | 2211.12760 | link |
2022-11-29 | Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments | Joshua Knights et.al. | 2211.12732 | link |
2022-11-23 | FE-Fusion-VPR: Attention-based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events | Kuanxu Hou et.al. | 2211.12244 | null |
2022-11-22 | Multimorbidity Content-Based Medical Image Retrieval Using Proxies | Yunyan Xing et.al. | 2211.12185 | null |
2022-11-22 | Vision-based localization methods under GPS-denied conditions | Zihao Lu et.al. | 2211.11988 | null |
2022-11-21 | ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari et.al. | 2211.11704 | null |
2022-11-21 | LISA: Localized Image Stylization with Audio via Implicit Neural Representation | Seung Hyun Lee et.al. | 2211.11381 | null |
2022-11-21 | NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization | Shitao Tang et.al. | 2211.11177 | link |
2022-11-16 | Improving Feature-based Visual Localization by Geometry-Aided Matching | Hailin Yu et.al. | 2211.08712 | link |
2022-11-15 | LiePoseNet: Heterogeneous Loss Function Based on Lie Group for Significant Speed-up of PoseNet Training Process | Mikhail Kurenkov et.al. | 2211.08480 | null |
2022-11-14 | Degeneracy removal of spin bands in antiferromagnets with non-interconvertible spin motif pair | Lin-Ding Yuan et.al. | 2211.07803 | null |
2022-11-14 | Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition | Farid Alijani et.al. | 2211.07696 | null |
2022-11-14 | Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization | Yiyang Chen et.al. | 2211.07394 | link |
2022-11-14 | Zero-shot Image Captioning by Anchor-augmented Vision-Language Space Alignment | Junyang Wang et.al. | 2211.07275 | null |
2022-11-14 | ContextCLIP: Contextual Alignment of Image-Text pairs on CLIP visual representations | Chanda Grover et.al. | 2211.07122 | null |
2022-11-14 | Few-shot Metric Learning: Online Adaptation of Embedding for Retrieval | Deunsol Jung et.al. | 2211.07116 | null |
2022-11-12 | Partial Visual-Semantic Embedding: Fashion Intelligence System with Sensitive Part-by-Part Learning | Ryotaro Shimizu et.al. | 2211.06688 | null |
2022-11-09 | Visual Named Entity Linking: A New Dataset and A Baseline | Wenxiang Sun et.al. | 2211.04872 | link |
2022-11-07 | Ultrafast Image Retrieval from a Holographic Memory Disc for High-Speed Operation of a Shift, Scale, and Rotation Invariant Target Recognition System | Julian Gamboa et.al. | 2211.03881 | null |
2022-11-06 | A Geometrically Constrained Point Matching based on View-invariant Cross-ratios, and Homography | Yueh-Cheng Huang et.al. | 2211.03007 | null |
2022-11-02 | Optimizing Fiducial Marker Placement for Improved Visual Localization | Qiangqiang Huang et.al. | 2211.01513 | link |
2022-11-02 | A comparison of uncertainty estimation approaches for DNN-based camera localization | Matteo Vaghi et.al. | 2211.01234 | null |
2022-11-02 | M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval | Layne Berry et.al. | 2211.01180 | null |
2022-11-11 | Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality | Anuj Diwan et.al. | 2211.00768 | link |
2022-11-07 | Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding | Ryotaro Shimizu et.al. | 2210.17417 | null |
2022-10-27 | Structuring User-Generated Content on Social Media with Multimodal Aspect-Based Sentiment Analysis | Miriam Anschütz et.al. | 2210.15377 | link |
2022-10-27 | Leveraging Computer Vision Application in Visual Arts: A Case Study on the Use of Residual Neural Network to Classify and Analyze Baroque Paintings | Daniel Kvak et.al. | 2210.15300 | null |
2022-10-27 | Towards Practicality of Sketch-Based Visual Understanding | Ayan Kumar Bhunia et.al. | 2210.15146 | null |
2022-10-27 | MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval | Chen Bao et.al. | 2210.15128 | null |
2022-10-26 | FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning | Suvir Mirchandani et.al. | 2210.15028 | null |
2022-10-26 | FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization | Junyang Wang et.al. | 2210.14562 | null |
2022-11-02 | A Framework for Collaborative Multi-Robot Mapping using Spectral Graph Wavelets | Lukas Bernreiter et.al. | 2210.13856 | null |
2022-10-27 | Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision | Tzu-Jui Julius Wang et.al. | 2210.13591 | null |
2022-10-24 | Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval | Zhaopeng Dou et.al. | 2210.13440 | link |
2022-10-23 | Neural Eigenfunctions Are Structured Representation Learners | Zhijie Deng et.al. | 2210.12637 | link |
2022-10-21 | Boosting vision transformers for image retrieval | Chull Hwan Song et.al. | 2210.11909 | link |
2022-10-20 | Communication breakdown: On the low mutual intelligibility between human and neural captioning | Roberto Dessì et.al. | 2210.11512 | link |
2022-10-19 | Image Semantic Relation Generation | Mingzhe Du et.al. | 2210.11253 | null |
2022-10-20 | General Image Descriptors for Open World Image Retrieval using ViT CLIP | Marcos V. Conde et.al. | 2210.11141 | link |
2022-10-20 | DeepRING: Learning Roto-translation Invariant Representation for LiDAR based Place Recognition | Sha Lu et.al. | 2210.11029 | null |
2022-10-19 | Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval | Abhra Chaudhuri et.al. | 2210.10486 | link |
2022-10-19 | GSV-Cities: Toward Appropriate Supervised Visual Place Recognition | Amar Ali-bey et.al. | 2210.10239 | link |
2022-10-18 | A Real-Time Fusion Framework for Long-term Visual Localization | Yuchen Yang et.al. | 2210.09757 | null |
2022-10-17 | Bridging the Gap between Local Semantic Concepts and Bag of Visual Words for Natural Scene Image Retrieval | Yousef Alqasrawi et.al. | 2210.08875 | null |
2022-10-17 | SGRAM: Improving Scene Graph Parsing via Abstract Meaning Representation | Woo Suk Choi et.al. | 2210.08675 | null |
2022-10-16 | Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers | Tao Tang et.al. | 2210.08458 | link |
2022-10-14 | Cross-Scale Context Extracted Hashing for Fine-Grained Image Binary Encoding | Xuetong Xue et.al. | 2210.07572 | link |
2022-10-14 | Boosting Performance of a Baseline Visual Place Recognition Technique by Predicting the Maximally Complementary Technique | Connor Malone et.al. | 2210.07509 | null |
2022-10-11 | Large-to-small Image Resolution Asymmetry in Deep Metric Learning | Pavel Suma et.al. | 2210.05463 | link |
2022-10-09 | Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning | Ali Safa et.al. | 2210.04236 | null |
2022-10-05 | Medical Image Retrieval via Nearest Neighbor Search on Pre-trained Image Features | Deepak Gupta et.al. | 2210.02401 | link |
2022-10-05 | Granularity-aware Adaptation for Image Retrieval over Multiple Tasks | Jon Almazán et.al. | 2210.02254 | null |
2022-10-05 | Improving Visual-Semantic Embedding with Adaptive Pooling and Optimization Objective | Zijian Zhang et.al. | 2210.02206 | link |
2022-10-04 | Supervised Metric Learning for Retrieval via Contextual Similarity Optimization | Christopher Liao et.al. | 2210.01908 | link |
2022-10-04 | Wi-Closure: Reliable and Efficient Search of Inter-robot Loop Closures Using Wireless Sensing | Weiying Wang et.al. | 2210.01320 | null |
2022-10-03 | Merging Classification Predictions with Sequential Information for Lightweight Visual Place Recognition in Changing Environments | Bruno Arcanjo et.al. | 2210.00834 | null |
2022-10-02 | Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval | Kei Nishimaki et.al. | 2210.00506 | null |
2022-09-29 | Guided Unsupervised Learning by Subaperture Decomposition for Ocean SAR Image Retrieval | Nicolae-Cătălin Ristea et.al. | 2209.15034 | null |
2022-09-28 | TVLT: Textless Vision-Language Transformer | Zineng Tang et.al. | 2209.14156 | link |
2022-09-28 | SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval | Yang Shen et.al. | 2209.13833 | link |
2022-09-28 | Learning Deep Representations via Contrastive Learning for Instance Retrieval | Tao Wu et.al. | 2209.13832 | null |
2022-09-28 | Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text | Cheng-An Hsieh et.al. | 2209.13764 | link |
2022-09-27 | Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors | Hao Dong et.al. | 2209.13586 | link |
2022-09-27 | Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability | Peisong Wen et.al. | 2209.13262 | link |
2022-09-26 | NDD: A 3D Point Cloud Descriptor Based on Normal Distribution for Loop Closure Detection | Ruihao Zhou et.al. | 2209.12513 | link |
2022-09-25 | Personalized Saliency in Task-Oriented Semantic Communications: Image Transmission and Performance Analysis | Jiawen Kang et.al. | 2209.12274 | link |
2022-09-24 | Closing the Loop: Graph Networks to Unify Semantic Objects and Visual Features for Multi-object Scenes | Jonathan J. Y. Kim et.al. | 2209.11894 | null |
2022-09-23 | Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs | Youya Xia et.al. | 2209.11673 | null |
2022-09-23 | Query-based Hard-Image Retrieval for Object Detection at Test Time | Edward Ayers et.al. | 2209.11559 | link |
2022-09-23 | Unsupervised Hashing with Semantic Concept Mining | Rong-Cheng Tu et.al. | 2209.11475 | link |
2022-09-22 | UNav: An Infrastructure-Independent Vision-Based Navigation System for People with Blindness and Low vision | Anbang Yang et.al. | 2209.11336 | null |
2022-09-21 | Visual Localization and Mapping in Dynamic and Changing Environments | João Carlos Virgolino Soares et.al. | 2209.10710 | null |
2022-09-20 | PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration using Panoptic Attention | José Arce et.al. | 2209.09699 | link |
2022-09-19 | Deep Metric Learning with Chance Constraints | Yeti Z. Gurbuz et.al. | 2209.09060 | link |
2022-09-18 | HGI-SLAM: Loop Closure With Human and Geometric Importance Features | Shuhul Mujoo et.al. | 2209.08608 | null |
2022-09-18 | Data-driven Loop Closure Detection in Bathymetric Point Clouds for Underwater SLAM | Jiarui Tan et.al. | 2209.08578 | link |
2022-09-17 | Data Efficient Visual Place Recognition Using Extremely JPEG-Compressed Images | Mihnea-Alexandru Tomita et.al. | 2209.08343 | null |
2022-09-15 | Efficient Planar Pose Estimation via UWB Measurements | Haodong Jiang et.al. | 2209.06779 | link |
2022-09-14 | Transformers and CNNs both Beat Humans on SBIR | Omar Seddati et.al. | 2209.06629 | null |
2022-09-14 | Tac2Structure: Object Surface Reconstruction Only through Multi Times Touch | J. Lu et.al. | 2209.06545 | link |
2022-09-14 | iSimLoc: Visual Global Localization for Previously Unseen Environments with Simulated Images | Peng Yin et.al. | 2209.06376 | null |
2022-09-09 | General Place Recognition Survey: Towards the Real-world Autonomy Age | Peng Yin et.al. | 2209.04497 | link |
2022-09-09 | Retinal Image Restoration and Vessel Segmentation using Modified Cycle-CBAM and CBAM-UNet | Alnur Alimanov et.al. | 2209.04234 | link |
2022-09-13 | Segment Augmentation and Differentiable Ranking for Logo Retrieval | Feyza Yavuz et.al. | 2209.02482 | null |
2022-09-12 | ScaleFace: Uncertainty-aware Deep Metric Learning | Roman Kail et.al. | 2209.01880 | link |
2022-09-04 | CloudVision: DNN-based Visual Localization of Autonomous Robots using Prebuilt LiDAR Point Cloud | Evgeny Yudin et.al. | 2209.01605 | null |
2022-08-31 | EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing | Qihua Feng et.al. | 2208.14657 | link |
2022-08-25 | A Deep Perceptual Measure for Lens and Camera Calibration | Yannick Hold-Geoffroy et.al. | 2208.12300 | null |
2022-08-25 | A Privacy-Preserving and End-to-End-Based Encrypted Image Retrieval Scheme | Zhixun Lu et.al. | 2208.11876 | null |
2022-08-23 | Satellite Image Search in AgoraEO | Ahmet Kerem Aksoy et.al. | 2208.10830 | null |
2022-08-20 | Fuse and Attend: Generalized Embedding Learning for Art and Sketches | Ujjal Kr Dutta et.al. | 2208.09698 | null |
2022-08-19 | Self-Supervised Visual Place Recognition by Mining Temporal and Feature Neighborhoods | Chao Chen et.al. | 2208.09315 | link |
2022-08-19 | TTT-UCDR: Test-time Training for Universal Cross-Domain Retrieval | Soumava Paul et.al. | 2208.09198 | link |
2022-08-17 | Visual Cross-View Metric Localization with Dense Uncertainty Estimates | Zimin Xia et.al. | 2208.08519 | link |
2022-08-17 | Understanding Attention for Vision-and-Language Tasks | Feiqi Cao et.al. | 2208.08104 | link |
2022-08-14 | Visual Localization via Few-Shot Scene Region Classification | Siyan Dong et.al. | 2208.06933 | link |
2022-08-14 | HyP $^2$ Loss: Beyond Hypersphere Metric Space for Multi-label Image Retrieval | Chengyin Xu et.al. | 2208.06866 | link |
2022-08-13 | Finding Point with Image: An End-to-End Benchmark for Vision-based UAV Localization | Ming Dai et.al. | 2208.06561 | link |
2022-08-16 | Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation | Georgios Kouros et.al. | 2208.06195 | link |
2022-08-12 | Instance Image Retrieval by Learning Purely From Within the Dataset | Zhongyan Zhang et.al. | 2208.06119 | null |
2022-08-07 | CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization | Yujiao Shi et.al. | 2208.03660 | null |
2022-08-05 | A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch | Patsorn Sangkloy et.al. | 2208.03354 | null |
2022-08-05 | ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding | Bingning Wang et.al. | 2208.03030 | link |
2022-08-04 | Pattern Spotting and Image Retrieval in Historical Documents using Deep Hashing | Caio da S. Dias et.al. | 2208.02397 | null |
2022-07-27 | On the robustness of self-supervised representations for multi-view object classification | David Torpey et.al. | 2208.00787 | null |
2022-07-26 | Multimodal Neural Machine Translation with Search Engine Based Image Retrieval | ZhenHao Tang et.al. | 2208.00767 | null |
2022-07-30 | Towards Privacy-Preserving, Real-Time and Lossless Feature Matching | Qiang Meng et.al. | 2208.00214 | link |
2022-07-30 | DAS: Densely-Anchored Sampling for Deep Metric Learning | Lizhao Liu et.al. | 2208.00119 | link |
2022-07-29 | Curriculum Learning for Data-Efficient Vision-Language Alignment | Tejas Srinivasan et.al. | 2207.14525 | null |
2022-07-29 | Neural Density-Distance Fields | Itsuki Ueda et.al. | 2207.14455 | link |
2022-07-27 | Abstracting Sketches through Simple Primitives | Stephan Alaniz et.al. | 2207.13543 | link |
2022-07-27 | Satellite Image Based Cross-view Localization for Autonomous Vehicle | Shan Wang et.al. | 2207.13506 | null |
2022-07-26 | RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments | Jiahui Zhang et.al. | 2207.12579 | null |
2022-07-25 | A hybrid-qudit representation of digital RGB images | Sreetama Das et.al. | 2207.12550 | null |
2022-07-19 | ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization | Ivan Cisneros et.al. | 2207.12317 | link |
2022-07-22 | PLD-SLAM: A Real-Time Visual SLAM Using Points and Line Segments in Dynamic Scenes | BaoSheng Zhang et.al. | 2207.10916 | null |
2022-07-25 | MeshLoc: Mesh-Based Visual Localization | Vojtech Panek et.al. | 2207.10762 | link |
2022-07-20 | Revisiting Hotels-50K and Hotel-ID | Aarash Feizi et.al. | 2207.10200 | link |
2022-07-20 | Feature Representation Learning for Unsupervised Cross-domain Image Retrieval | Conghui Hu et.al. | 2207.09721 | link |
2022-07-19 | SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany | Dominik Koßmann et.al. | 2207.09507 | null |
2022-07-19 | Context Unaware Knowledge Distillation for Image Retrieval | Bytasandram Yaswanth Reddy et.al. | 2207.09070 | link |
2022-07-17 | FashionViL: Fashion-Focused Vision-and-Language Representation Learning | Xiao Han et.al. | 2207.08150 | link |
2022-07-14 | AutoMerge: A Framework for Map Assembling and Smoothing in City-scale Environments | Peng Yin et.al. | 2207.06965 | null |
2022-07-14 | Semi-supervised Vector-Quantization in Visual SLAM using HGCN | Amir Zarringhalam et.al. | 2207.06738 | null |
2022-07-14 | Self-supervised Vector-Quantization in Visual SLAM using Deep Convolutional Autoencoders | Amir Zarringhalam et.al. | 2207.06732 | null |
2022-07-19 | Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo Cameras | Fangwen Shu et.al. | 2207.06058 | link |
2022-07-12 | CPO: Change Robust Panorama to Point Cloud Localization | Junho Kim et.al. | 2207.05317 | link |
2022-07-05 | Hierarchical Average Precision Training for Pertinent Image Retrieval | Elias Ramzi et.al. | 2207.04873 | link |
2022-07-11 | A clinically motivated self-supervised approach for content-based image retrieval of CT liver images | Kristoffer Knutsen Wickstrøm et.al. | 2207.04812 | link |
2022-07-09 | BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval | Wenqiao Zhang et.al. | 2207.04211 | null |
2022-07-08 | Learning Sequential Descriptors for Sequence-based Visual Place Recognition | Riccardo Mereu et.al. | 2207.03868 | link |
2022-07-08 | GEMS: Scene Expansion using Generative Models of Graphs | Rishi Agarwal et.al. | 2207.03729 | null |
2022-07-05 | Object-Level Targeted Selection via Deep Template Matching | Suraj Kothawade et.al. | 2207.01778 | null |
2022-07-06 | Adaptive Fine-Grained Sketch-Based Image Retrieval | Ayan Kumar Bhunia et.al. | 2207.01723 | link |
2022-07-04 | Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets | Paul Albert et.al. | 2207.01573 | link |
2022-07-08 | Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval | Keyu Wen et.al. | 2207.00733 | null |
2022-07-01 | DALG: Deep Attentive Local and Global Modeling for Image Retrieval | Yuxin Song et.al. | 2207.00287 | null |
2022-07-04 | BadHash: Invisible Backdoor Attacks against Deep Hashing with Clean Label | Shengshan Hu et.al. | 2207.00278 | link |
2022-06-28 | Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems | Stephen Hausler et.al. | 2206.13883 | null |
2022-07-08 | How Many Events do You Need? Event-based Visual Place Recognition Using Sparse But Varying Pixels | Tobias Fischer et.al. | 2206.13673 | link |
2022-06-25 | FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance | Yongzhi Fan et.al. | 2206.12628 | link |
2022-06-25 | Inverted Semantic-Index for Image Retrieval | Ying Wang et.al. | 2206.12623 | null |
2022-06-17 | RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval | Yihan Wu et.al. | 2206.11225 | null |
2022-06-22 | ICC++: Explainable Image Retrieval for Art Historical Corpora using Image Composition Canvas | Prathmesh Madhu et.al. | 2206.11115 | null |
2022-06-20 | Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval | Guile Wu et.al. | 2206.09806 | null |
2022-06-18 | Attention-based Dynamic Subspace Learners for Medical Image Analysis | Sukesh Adiga V et.al. | 2206.09068 | null |
2022-06-17 | Efficient WiFi LiDAR SLAM for Autonomous Robots in Large Environments | Khairuldanial Ismail et.al. | 2206.08733 | null |
2022-06-06 | Learning Treatment Plan Representations for Content Based Image Retrieval | Charles Huang et.al. | 2206.02912 | null |
2022-06-19 | NORPPA: NOvel Ringed seal re-identification by Pelage Pattern Aggregation | Ekaterina Nepovinnykh et.al. | 2206.02498 | link |
2022-06-05 | Autoregressive Model for Multi-Pass SAR Change Detection Based on Image Stacks | B. G. Palm et.al. | 2206.02278 | null |
2022-05-28 | FaIRCoP: Facial Image Retrieval using Contrastive Personalization | Devansh Gupta et.al. | 2205.15870 | null |
2022-05-31 | Investigating the Role of Image Retrieval for Visual Localization – An exhaustive benchmark | Martin Humenberger et.al. | 2205.15761 | link |
2022-05-27 | Improving Road Segmentation in Challenging Domains Using Similar Place Priors | Connor Malone et.al. | 2205.14112 | null |
2022-05-31 | LAMP 2.0: A Robust Multi-Robot SLAM System for Operation in Challenging Large-Scale Underground Environments | Yun Chang et.al. | 2205.13135 | link |
2022-05-26 | Fine-grained Image Captioning with CLIP Reward | Jaemin Cho et.al. | 2205.13115 | link |
2022-05-25 | Deep Dense Local Feature Matching and Vehicle Removal for Indoor Visual Localization | Kyung Ho Park et.al. | 2205.12544 | null |
2022-05-24 | OnePose: One-Shot Object Pose Estimation without CAD Models | Jiaming Sun et.al. | 2205.12257 | link |
2022-05-23 | VPAIR – Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments | Michael Schleiss et.al. | 2205.11567 | link |
2022-05-23 | VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering | Yanan Wang et.al. | 2205.11501 | null |
2022-05-23 | Deep Image Retrieval is not Robust to Label Noise | Stanislav Dereka et.al. | 2205.11195 | null |
2022-05-22 | Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval | Zelong Zeng et.al. | 2205.10878 | link |
2022-05-20 | Visually-Augmented Language Modeling | Weizhi Wang et.al. | 2205.10178 | link |
2022-05-18 | Deep Features for CBIR with Scarce Data using Hebbian Learning | Gabriele Lagani et.al. | 2205.08935 | null |
2022-05-19 | Text Detection & Recognition in the Wild for Robot Localization | Zobeir Raisi et.al. | 2205.08565 | null |
2022-05-12 | One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code | Yong Dai et.al. | 2205.06126 | null |
2022-05-11 | Review on Panoramic Imaging and Its Applications in Scene Understanding | Shaohua Gao et.al. | 2205.05570 | null |
2022-05-18 | Identical Image Retrieval using Deep Learning | Sayan Nath et.al. | 2205.04883 | link |
2022-05-09 | Introspective Deep Metric Learning | Chengkun Wang et.al. | 2205.04449 | link |
2022-05-11 | Improved Evaluation and Generation of Grid Layouts using Distance Preservation Quality and Linear Assignment Sorting | Kai Uwe Barthel et.al. | 2205.04255 | link |
2022-05-08 | Adversarial Learning of Hard Positives for Place Recognition | Wenxuan Fang et.al. | 2205.03871 | null |
2022-05-10 | AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching | Khanh Nguyen et.al. | 2205.02849 | link |
2022-04-29 | Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval | Shupeng Su et.al. | 2204.13919 | null |
2022-04-29 | Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval | Siyu Ren et.al. | 2204.13913 | link |
2022-04-28 | Spatio-Temporal Graph Localization Networks for Image-based Navigation | Takahiro Niwa et.al. | 2204.13237 | null |
2022-04-27 | The Revisiting Problem in Simultaneous Localization and Mapping: A Survey on Visual Loop Closure Detection | Konstantinos A. Tsintotas et.al. | 2204.12831 | null |
2022-04-25 | SceneTrilogy: On Scene Sketches and its Relationship with Text and Photo | Pinaki Nath Chowdhury et.al. | 2204.11964 | null |
2022-04-23 | On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning | Muhammad Umer Anwaar et.al. | 2204.11848 | null |
2022-04-24 | Progressive Learning for Image Retrieval with Hybrid-Modality Queries | Yida Zhao et.al. | 2204.11212 | null |
2022-04-23 | Training and challenging models for text-guided fashion image retrieval | Eric Dodds et.al. | 2204.11004 | link |
2022-04-18 | Centralized Adversarial Learning for Robust Deep Hashing | Xunguang Wang et.al. | 2204.10779 | link |
2022-04-22 | Transferring ConvNet Features from Passive to Active Robot Self-Localization: The Use of Ego-Centric and World-Centric Views | Kanya Kurauchi et.al. | 2204.10497 | null |
2022-04-21 | Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval | Zhiqiang Yuan et.al. | 2204.09868 | link |
2022-04-21 | Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information | Zhiqiang Yuan et.al. | 2204.09860 | link |
2022-04-20 | Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations | Leila Pishdad et.al. | 2204.09268 | null |
2022-04-19 | Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing | Georgii Mikriukov et.al. | 2204.08707 | null |
2022-04-18 | Multiple-environment Self-adaptive Network for Aerial-view Geo-localization | Tingyu Wang et.al. | 2204.08381 | link |
2022-04-15 | Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder | Hanjing Ye et.al. | 2204.07350 | link |
2022-04-14 | Composite Code Sparse Autoencoders for first stage retrieval | Carlos Lassance et.al. | 2204.07023 | null |
2022-04-13 | Reuse your features: unifying retrieval and feature-metric alignment | Javier Morlana et.al. | 2204.06292 | link |
2022-04-12 | Probabilistic Compositional Embeddings for Multimodal Image Retrieval | Andrei Neculai et.al. | 2204.05845 | link |
2022-04-12 | Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval | Yu-Wei Zhan et.al. | 2204.05666 | null |
2022-04-12 | HiTPR: Hierarchical Transformer for Place Recognition in Point Cloud | Zhixing Hou et.al. | 2204.05481 | null |
2022-04-11 | Optimized SC-F-LOAM: Optimized Fast LiDAR Odometry and Mapping Using Scan Context | Lizhou Liao et.al. | 2204.04932 | link |
2022-04-10 | Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image | Yujiao Shi et.al. | 2204.04752 | link |
2022-04-08 | A Generic Image Retrieval Method for Date Estimation of Historical Document Collections | Adrià Molina et.al. | 2204.04028 | null |
2022-04-08 | SnapMode: An Intelligent and Distributed Large-Scale Fashion Image Retrieval Platform Based On Big Data and Deep Generative Adversarial Network Technologies | Narges Norouzi et.al. | 2204.03998 | null |
2022-04-05 | Leveraging Equivariant Features for Absolute Pose Regression | Mohamed Adel Musallam et.al. | 2204.02163 | null |
2022-04-04 | “This is my unicorn, Fluffy”: Personalizing frozen vision-language representations | Niv Cohen et.al. | 2204.01694 | link |
2022-04-01 | Bi-directional Loop Closure for Visual SLAM | Ihtisham Ali et.al. | 2204.01524 | null |
2022-04-01 | LASER: LAtent SpacE Rendering for 2D Visual Localization | Zhixiang Min et.al. | 2204.00157 | link |
2022-03-31 | Semantic Pose Verification for Outdoor Visual Localization with Self-supervised Contrastive Learning | Semih Orhan et.al. | 2203.16945 | null |
2022-03-30 | AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift | Burak Yildiz et.al. | 2203.16291 | link |
2022-03-29 | Long-term Visual Map Sparsification with Heterogeneous GNN | Ming-Fang Chang et.al. | 2203.15182 | null |
2022-04-01 | A Simulation Benchmark for Vision-based Autonomous Navigation | Lauri Suomela et.al. | 2203.13048 | link |
2022-03-24 | Is Geometry Enough for Matching in Visual Localization? | Qunjie Zhou et.al. | 2203.12979 | link |
2022-03-21 | MatchFormer: Interleaving Attention in Transformers for Feature Matching | Qing Wang et.al. | 2203.09645 | link |
2022-03-10 | ReF – Rotation Equivariant Features for Local Feature Matching | Abhishek Peri et.al. | 2203.05206 | null |
2022-03-09 | Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction | Matthieu Zins et.al. | 2203.04613 | null |
2022-03-08 | Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM | Pierre-Yves Lajoie et.al. | 2203.04446 | link |
2022-03-07 | ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization | Simon Maurer et.al. | 2203.03610 | link |
2022-03-07 | Multi-Modal Lidar Dataset for Benchmarking General-Purpose Localization and Mapping Algorithms | Qingqing Li et.al. | 2203.03454 | link |
2022-03-01 | SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments | Maria Waheed et.al. | 2203.00591 | null |
2022-02-28 | Deep Camera Pose Regression Using Pseudo-LiDAR | Ali Raza et.al. | 2203.00080 | null |
2022-02-25 | RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation | Praveen Kumar Rajendran et.al. | 2202.12838 | null |
2022-02-24 | Highly-Efficient Binary Neural Networks for Visual Place Recognition | Bruno Ferrarini et.al. | 2202.12375 | null |
2022-02-18 | MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery | Ahmad Khaliq et.al. | 2202.09146 | link |
2022-02-14 | Tightly Coupled Learning Strategy for Weakly Supervised Hierarchical Place Recognition | Y. Shen et.al. | 2202.06470 | null |
2022-02-11 | Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition | Yingfeng Cai et.al. | 2202.05738 | null |
2022-02-09 | Object-Guided Day-Night Visual Localization in Urban Scenes | Assia Benbihi et.al. | 2202.04445 | null |
2022-02-08 | A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition | Nie Jiwei et.al. | 2202.03677 | null |
2022-02-25 | CFP-SLAM: A Real-time Visual SLAM Based on Coarse-to-Fine Probability in Dynamic Environments | Xinggang Hu et.al. | 2202.01938 | null |
2022-02-03 | Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization | Andrea Vallone et.al. | 2202.01821 | null |
2022-02-02 | Training Semantic Descriptors for Image-Based Localization | Ibrahim Cinaroglu et.al. | 2202.01212 | null |
2022-01-31 | Hydra: A Real-time Spatial Perception Engine for 3D Scene Graph Construction and Optimization | Nathan Hughes et.al. | 2201.13360 | null |
2022-01-31 | Rigidity Preserving Image Transformations and Equivariance in Perspective | Lucas Brynte et.al. | 2201.13065 | null |
2022-01-25 | Learning Semantics for Visual Place Recognition through Multi-Scale Attention | Valerio Paolicelli et.al. | 2201.09701 | link |
2022-01-22 | Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems | Xi Zheng et.al. | 2201.09048 | link |
2022-01-15 | A Critical Analysis of Image-based Camera Pose Estimation Techniques | Meng Xu et.al. | 2201.05816 | null |
2022-01-14 | SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions | Ali Samadzadeh et.al. | 2201.05386 | link |
2021-12-23 | NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning | Tony Ng et.al. | 2112.12785 | null |
2021-12-16 | CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data | Qi Yan et.al. | 2112.09081 | link |
2021-12-05 | RADA: Robust Adversarial Data Augmentation for Camera Localization in Challenging Weather | Jialu Wang et.al. | 2112.02469 | null |
2021-11-25 | MegLoc: A Robust and Accurate Visual Localization Pipeline | Shuxue Peng et.al. | 2111.13063 | null |
2021-10-08 | Semantic Image Alignment for Vehicle Localization | Markus Herb et.al. | 2110.04162 | null |
2021-10-05 | Season-invariant GNSS-denied visual localization for UAVs | Jouko Kinnari et.al. | 2110.01967 | link |
2021-09-30 | Forming a sparse representation for visual place recognition using a neurorobotic approach | Sylvain Colomer et.al. | 2109.14916 | null |
2021-09-22 | Audio-Visual Grounding Referring Expression for Robotic Manipulation | Yefei Wang et.al. | 2109.10571 | null |
2021-09-20 | Efficient shape mapping through dense touch and vision | Sudharshan Suresh et.al. | 2109.09884 | link |
2021-09-15 | S3LAM: Structured Scene SLAM | Mathieu Gonzalez et.al. | 2109.07339 | null |
2021-09-13 | Monocular Camera Localization for Automated Vehicles Using Image Retrieval | Eunhyek Joa et.al. | 2109.06296 | null |
2021-09-10 | Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization | Sungho Yoon et.al. | 2109.04753 | link |
2021-09-09 | CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization | Ara Jafarzadeh et.al. | 2109.04527 | null |
2021-09-09 | Keeping an Eye on Things: Deep Learned Features for Long-Term Visual Localization | Mona Gridseth et.al. | 2109.04041 | link |
Keypoint Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-19 | Corn Ear Detection and Orientation Estimation Using Deep Learning | Nathan Sprague et.al. | 2412.14954 | null |
2024-12-12 | Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models | Faith Johnson et.al. | 2412.09739 | null |
2024-12-09 | An Efficient Scene Coordinate Encoding and Relocalization Method | Kuan Xu et.al. | 2412.06488 | link |
2024-12-09 | ZeroKey: Point-Level Reasoning and Zero-Shot 3D Keypoint Detection from Large Language Models | Bingchen Gong et.al. | 2412.06292 | null |
2024-12-07 | Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures | Muhammad Umar Farooq et.al. | 2412.05487 | null |
2024-12-04 | Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment Anything | Yongkyu Lee et.al. | 2412.03472 | null |
2024-12-02 | MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection | Yonghao Dang et.al. | 2412.01422 | null |
2024-11-23 | OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs | Chen Xin et.al. | 2411.15653 | link |
2024-11-19 | IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose | Fei Ren et.al. | 2411.12676 | null |
2024-11-04 | Silver medal Solution for Image Matching Challenge 2024 | Yian Wang et.al. | 2411.01851 | null |
2024-11-04 | KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension | Jie Yang et.al. | 2411.01846 | null |
2024-10-31 | From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots | Vasileios Tzouras et.al. | 2410.23906 | null |
2024-10-04 | Self-Supervised Keypoint Detection with Distilled Depth Keypoint Representation | Aman Anand et.al. | 2410.14700 | null |
2024-11-27 | Sim2real Cattle Joint Estimation in 3D point clouds | Mohammad Okour et.al. | 2410.14419 | null |
2024-10-16 | PND-Net: Plant Nutrition Deficiency and Disease Classification using Graph Convolutional Network | Asish Bera et.al. | 2410.12742 | null |
2024-10-16 | RAFA-Net: Region Attention Network For Food Items And Agricultural Stress Recognition | Asish Bera et.al. | 2410.12718 | null |
2024-10-01 | A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference | Yuan Li et.al. | 2410.11848 | null |
2024-10-11 | Facial Chick Sexing: An Automated Chick Sexing System From Chick Facial Image | Marta Veganzones Rodriguez et.al. | 2410.09155 | null |
2024-10-08 | Unsupervised Model Diagnosis | Yinong Oliver Wang et.al. | 2410.06243 | null |
2024-10-08 | Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration | Xueyang Kang et.al. | 2410.05729 | link |
2024-10-16 | Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features | Chengkai Hou et.al. | 2410.02237 | null |
2024-10-02 | Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection | Hongru Yan et.al. | 2410.01404 | null |
2024-09-30 | OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection | Changsheng Lu et.al. | 2409.19899 | link |
2024-10-07 | SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation | Xin Li et.al. | 2409.18082 | null |
2024-09-24 | GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization | Gennady Sidorov et.al. | 2409.16502 | link |
2024-09-20 | Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators | Niloufar Amiri et.al. | 2409.13668 | null |
2024-09-25 | Precision Aquaculture: An Integrated Computer Vision and IoT Approach for Optimized Tilapia Feeding | Rania Hossam et.al. | 2409.08695 | link |
2024-09-06 | D4: Text-guided diffusion model-based domain adaptive data augmentation for vineyard shoot detection | Kentaro Hirahara et.al. | 2409.04060 | null |
2024-10-01 | Towards Practical Human Motion Prediction with LiDAR Point Clouds | Xiao Han et.al. | 2408.08202 | null |
2024-07-31 | Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods | Xusheng Luo et.al. | 2408.00117 | null |
2024-07-26 | SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski et.al. | 2407.18907 | null |
2024-07-25 | LION: Linear Group RNN for 3D Object Detection in Point Clouds | Zhe Liu et.al. | 2407.18232 | link |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-09 | LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition | Teng Wang et.al. | 2407.06730 | null |
2024-07-04 | PFGS: High Fidelity Point Cloud Rendering via Feature Splatting | Jiaxu Wang et.al. | 2407.03857 | link |
2024-07-03 | A Radiometric Correction based Optical Modeling Approach to Removing Reflection Noise in TLS Point Clouds of Urban Scenes | Li Fang et.al. | 2407.02830 | link |
2024-07-02 | Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning | Chengchao Shen et.al. | 2407.02014 | link |
2024-06-28 | Beyond First-Order: A Multi-Scale Approach to Finger Knuckle Print Biometrics | Chengrui Gao et.al. | 2406.19672 | null |
2024-07-23 | A Certifiable Algorithm for Simultaneous Shape Estimation and Object Tracking | Lorenzo Shaikewitz et.al. | 2406.16837 | link |
2024-06-03 | Scale-Free Image Keypoints Using Differentiable Persistent Homology | Giovanni Barbarani et.al. | 2406.01315 | link |
2024-06-23 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-05-25 | Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration | Junjie Gao et.al. | 2405.16085 | null |
2024-06-01 | Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection – Towards Precise Fish Morphological Assessment in Aquaculture Breeding | Weizhen Liu et.al. | 2405.12476 | link |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-15 | Vector-Symbolic Architecture for Event-Based Optical Flow | Hongzhi You et.al. | 2405.08300 | null |
2024-05-13 | RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration | Congjia Chen et.al. | 2405.07594 | null |
2024-05-08 | Unsupervised Skin Feature Tracking with Deep Neural Networks | Jose Chang et.al. | 2405.04943 | null |
2024-05-07 | A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images | László Kopácsi et.al. | 2405.04650 | null |
2024-04-30 | A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images | Wang Zhang et.al. | 2404.19311 | null |
2024-04-25 | Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach | Tahmim Hossain et.al. | 2404.14560 | null |
2024-04-19 | SkelFormer: Markerless 3D Pose and Shape Estimation using Skeletal Transformers | Vandad Davoodnia et.al. | 2404.12625 | null |
2024-04-17 | Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CAD Images | Junbiao Pang et.al. | 2404.10985 | null |
2024-03-28 | Towards Long Term SLAM on Thermal Imagery | Colin Keil et.al. | 2403.19885 | link |
2024-03-28 | Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation | Xiao Lin et.al. | 2403.19527 | link |
2024-03-27 | RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation | Yang Tian et.al. | 2403.18259 | null |
2024-03-18 | FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events | Xiangyuan Wang et.al. | 2403.11662 | link |
2024-03-05 | Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion | Meng Zheng et.al. | 2403.03217 | null |
2024-02-22 | A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets | Chengzhang Yu et.al. | 2402.14241 | null |
2024-02-25 | A Feature Matching Method Based on Multi-Level Refinement Strategy | Shaojie Zhang et.al. | 2402.13488 | null |
2024-03-05 | 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data | Zhi-Yi Lin et.al. | 2402.13172 | null |
2024-02-25 | Region Feature Descriptor Adapted to High Affine Transformations | Shaojie Zhang et.al. | 2402.09724 | null |
2024-01-29 | Reconstructing Close Human Interactions from Multiple Views | Qing Shuai et.al. | 2401.16173 | link |
2024-01-17 | To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection | Luyi Han et.al. | 2401.09336 | link |
2024-01-08 | Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach | Huanyu Liu et.al. | 2401.03742 | link |
2024-03-22 | 6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation | Li Xu et.al. | 2401.00029 | null |
2023-12-27 | Bezier-based Regression Feature Descriptor for Deformable Linear Objects | Fangqing Chen et.al. | 2312.16502 | null |
2023-12-24 | Residual Learning for Image Point Descriptors | Rashik Shrestha et.al. | 2312.15471 | null |
2023-12-22 | BonnBeetClouds3D: A Dataset Towards Point Cloud-based Organ-level Phenotyping of Sugar Beet Plants under Field Conditions | Elias Marks et.al. | 2312.14706 | null |
2023-12-19 | Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation | Jiaming Liu et.al. | 2312.12480 | null |
2023-12-19 | An effective image copy-move forgery detection using entropy image | Zhaowei Lu et.al. | 2312.11793 | link |
2023-12-11 | VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data | Jian Shi et.al. | 2312.08871 | link |
2023-12-11 | Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach | Travis Driver et.al. | 2312.06865 | link |
2023-12-01 | Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) | Emma Cramer et.al. | 2312.00592 | link |
2023-11-30 | Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications | Sahar Almahfouz Nasser et.al. | 2311.18281 | null |
2023-11-29 | Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features | Thomas Wimmer et.al. | 2311.18113 | link |
2023-11-28 | Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features | Niladri Shekhar Dutt et.al. | 2311.17024 | link |
2023-11-28 | Riemannian Self-Attention Mechanism for SPD Networks | Rui Wang et.al. | 2311.16738 | null |
2023-11-27 | A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor | Jialin Liu et.al. | 2311.15609 | null |
2023-11-21 | Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers | Bo Sun et.al. | 2311.12291 | null |
2023-11-20 | CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement | Boni Hu et.al. | 2311.11604 | link |
2023-11-17 | Video-based Sequential Bayesian Homography Estimation for Soccer Field Registration | Paul J. Claasen et.al. | 2311.10361 | link |
2023-11-13 | Processing and Segmentation of Human Teeth from 2D Images using Weakly Supervised Learning | Tomáš Kunzo et.al. | 2311.07398 | null |
2023-11-11 | CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer | Haoyu Ma et.al. | 2311.06443 | link |
2023-11-08 | 3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud | Jianchao Ci et.al. | 2311.04699 | null |
2023-11-06 | TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains | Alexander Naumann et.al. | 2311.03124 | link |
2023-11-06 | An invariant feature extraction for multi-modal images matching | Chenzhong Gao et.al. | 2311.02842 | null |
2023-10-20 | Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification | Mateus Roder et.al. | 2310.13490 | null |
2023-10-12 | UniPose: Detecting Any Keypoints | Jie Yang et.al. | 2310.08530 | link |
2023-10-10 | l-dyno: framework to learn consistent visual features using robot’s motion | Kartikeya Singh et.al. | 2310.06249 | link |
2023-10-10 | Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face | Hao Zhang et.al. | 2310.05056 | link |
2023-10-13 | H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation | Yanjie Ze et.al. | 2310.01404 | link |
2023-10-04 | Self-supervised Learning of Contextualized Local Visual Embeddings | Thalles Santos Silva et.al. | 2310.00527 | link |
2023-10-22 | ObVi-SLAM: Long-Term Object-Visual SLAM | Amanda Adkins et.al. | 2309.15268 | link |
2023-09-19 | LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation | Haizhou Zhang et.al. | 2309.10436 | link |
2023-09-18 | RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy | Mert Asim Karaoglu et.al. | 2309.09563 | null |
2023-09-17 | CryoAlign: feature-based method for global and local 3D alignment of EM density maps | Bintao He et.al. | 2309.09217 | null |
2023-09-14 | EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization | Minjung Kim et.al. | 2309.07471 | link |
2023-09-09 | Mirror-Aware Neural Humans | Daniel Ajisafe et.al. | 2309.04750 | link |
2023-09-07 | InstructDiffusion: A Generalist Modeling Interface for Vision Tasks | Zigang Geng et.al. | 2309.03895 | null |
2023-09-04 | SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras | Himanshu Pahadia et.al. | 2309.01324 | null |
2023-09-12 | Improving the matching of deformable objects by learning to detect keypoints | Felipe Cadar et.al. | 2309.00434 | link |
2023-08-31 | SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation | Jiaben Chen et.al. | 2308.16876 | null |
2023-08-30 | Learning Structure-from-Motion with Graph Attention Networks | Lucas Brynte et.al. | 2308.15984 | link |
2023-08-29 | A lightweight 3D dense facial landmark estimation model from position map data | Shubhajit Basak et.al. | 2308.15170 | link |
2023-08-27 | Automatic coarse co-registration of point clouds from diverse scan geometries: a test of detectors and descriptors | Francesco Pirotti et.al. | 2308.14047 | null |
2023-08-24 | VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition | Gengxuan Tian et.al. | 2308.12870 | null |
2023-08-22 | LDP-Feat: Image Features with Local Differential Privacy | Francesco Pittaluga et.al. | 2308.11223 | null |
2023-08-20 | Neural Interactive Keypoint Detection | Jie Yang et.al. | 2308.10174 | link |
2023-08-19 | ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment | Bingyang Zhou et.al. | 2308.09987 | null |
2023-09-03 | DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching | Johan Edstedt et.al. | 2308.08479 | link |
2023-08-15 | CoDeF: Content Deformation Fields for Temporally Consistent Video Processing | Hao Ouyang et.al. | 2308.07926 | link |
2023-08-15 | ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition | Wenyuan Xue et.al. | 2308.07743 | null |
2023-08-14 | DELO: Deep Evidential LiDAR Odometry using Partial Optimal Transport | Sk Aziz Ali et.al. | 2308.07153 | null |
2023-08-14 | 2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds | Minhao Li et.al. | 2308.05667 | link |
2023-08-02 | Automated Hit-frame Detection for Badminton Match Analysis | Yu-Hang Chien et.al. | 2307.16000 | link |
2023-07-25 | Mini-PointNetPlus: a local feature descriptor in deep learning model for 3d environment perception | Chuanyu Luo et.al. | 2307.13300 | null |
2023-07-21 | Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data | Sahar Almahfouz Nasser et.al. | 2307.10698 | link |
2023-07-19 | SAMConvex: Fast Discrete Optimization for CT Registration using Self-supervised Anatomical Embedding and Correlation Pyramid | Zi Li et.al. | 2307.09727 | link |
2023-07-01 | SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation | Fabian Duffhauss et.al. | 2307.00306 | link |
2023-06-27 | Detector-Free Structure from Motion | Xingyi He et.al. | 2306.15669 | link |
2023-06-26 | CLERA: A Unified Model for Joint Cognitive Load and Eye Region Analysis in the Wild | Li Ding et.al. | 2306.15073 | null |
2023-06-28 | Topology Repairing of Disconnected Pulmonary Airways and Vessels: Baselines and a Dataset | Ziqiao Weng et.al. | 2306.07089 | link |
2023-06-07 | Learning Probabilistic Coordinate Fields for Robust Correspondences | Weiyue Zhao et.al. | 2306.04231 | null |
2023-06-03 | LDEB – Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues | Amitabha Dey et.al. | 2306.02193 | null |
2023-06-02 | Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images | Marcela Mera-Trujillo et.al. | 2306.01938 | null |
2023-06-01 | A Probabilistic Relaxation of the Two-Stage Object Pose Estimation Paradigm | Onur Beker et.al. | 2306.00892 | null |
2023-05-30 | Align, Perturb and Decouple: Toward Better Leverage of Difference Information for RSI Change Detection | Supeng Wang et.al. | 2305.18714 | link |
2023-05-23 | Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence | Grace Luo et.al. | 2305.14334 | null |
2023-05-15 | Non-Separable Multi-Dimensional Network Flows for Visual Computing | Viktoria Ehm et.al. | 2305.08628 | null |
2023-05-13 | Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance | Xinyu Lin et.al. | 2305.07943 | link |
2023-05-05 | HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration | Canhui Tang et.al. | 2305.03487 | link |
2023-04-17 | Human Pose Estimation in Monocular Omnidirectional Top-View Images | Jingrui Yu et.al. | 2304.08186 | null |
2023-04-14 | CoPR: Towards Accurate Visual Localization With Continuous Place-descriptor Regression | Mubariz Zaffar et.al. | 2304.07426 | null |
2023-04-12 | SiLK – Simple Learned Keypoints | Pierre Gleize et.al. | 2304.06194 | link |
2023-04-06 | From Saliency to DINO: Saliency-guided Vision Transformer for Few-shot Keypoint Detection | Changsheng Lu et.al. | 2304.03140 | null |
2023-03-29 | NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud | Xiangyu Zhu et.al. | 2303.16465 | null |
2023-03-24 | PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View | Ze Shi et.al. | 2303.14095 | link |
2023-03-23 | Semantic Image Attack for Visual Model Diagnosis | Jinqi Luo et.al. | 2303.13010 | null |
2023-03-22 | Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation | Heng Yang et.al. | 2303.12246 | link |
2023-03-21 | RN-Net: Reservoir Nodes-Enabled Neuromorphic Vision Sensing Network | Sangmin Yoo et.al. | 2303.10770 | null |
2023-03-17 | ShaRPy: Shape Reconstruction and Hand Pose Estimation from RGB-D with Uncertainty | Vanessa Wirth et.al. | 2303.10042 | null |
2023-03-15 | Descriptor Distillation for Efficient Multi-Robot SLAM | Xiyue Guo et.al. | 2303.08420 | null |
2023-03-15 | From Local Binary Patterns to Pixel Difference Networks for Efficient Visual Representation Learning | Zhuo Su et.al. | 2303.08414 | null |
2023-03-16 | KGNv2: Separating Scale and Pose Prediction for Keypoint-based 6-DoF Grasp Synthesis on RGB-D input | Yiye Chen et.al. | 2303.05617 | link |
2023-03-07 | External Camera-based Mobile Robot Pose Estimation for Collaborative Perception with Smart Edge Sensors | Simon Bultmann et.al. | 2303.03797 | null |
2023-02-26 | PaRK-Detect: Towards Efficient Multi-Task Satellite Imagery Road Extraction via Patch-Wise Keypoints Detection | Shenwei Xie et.al. | 2302.13263 | null |
2023-02-24 | Hybrid machine-learned homogenization: Bayesian data mining and convolutional neural networks | Julian Lißner et.al. | 2302.12545 | null |
2023-02-21 | Deep Reinforcement Learning Based on Local GNN for Goal-conditioned Deformable Object Rearranging | Yuhong Deng et.al. | 2302.10446 | null |
2023-02-12 | A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training | Jingnan Shi et.al. | 2302.06019 | null |
2023-02-11 | Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing | Zitong Yu et.al. | 2302.05744 | null |
2023-02-09 | MAPS: A Noise-Robust Progressive Learning Approach for Source-Free Domain Adaptive Keypoint Detection | Yuhe Ding et.al. | 2302.04589 | link |
2023-02-03 | Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation | Jie Yang et.al. | 2302.01593 | link |
2023-02-03 | Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization | Yingying Zhu et.al. | 2302.01572 | link |
2023-01-21 | Vision Aided Environment Semantics Extraction and Its Application in mmWave Beam Selection | Feiyang Wen et.al. | 2301.08973 | null |
2023-01-18 | OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models | Xingyi He et.al. | 2301.07673 | null |
2023-01-12 | Towards High Performance One-Stage Human Pose Estimation | Ling Li et.al. | 2301.04842 | null |
2022-12-31 | Rethinking Rotation Invariance with Point Cloud Registration | Jianhui Yu et.al. | 2301.00149 | null |
2023-02-06 | Fruit Ripeness Classification: a Survey | Matteo Rizzo et.al. | 2212.14441 | null |
2022-12-28 | NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action | Kuan-Chieh Wang et.al. | 2212.13660 | link |
2022-12-24 | HandsOff: Labeled Dataset Generation With No Additional Human Annotations | Austin Xu et.al. | 2212.12645 | null |
2022-12-13 | Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images | Welerson Melo et.al. | 2212.09589 | link |
2022-12-15 | Learning Markerless Robot-Depth Camera Calibration and End-Effector Pose Estimation | Bugra C. Sefercik et.al. | 2212.07567 | null |
2023-02-01 | DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization | Xiangyu Xu et.al. | 2212.04575 | null |
2022-12-07 | ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation | Yufei Xu et.al. | 2212.04246 | link |
2022-12-15 | Designing Feature Vector Representations: A case study from Chemistry | Signe Sidwall Thygesen et.al. | 2212.03731 | null |
2022-12-09 | DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model | Jeongjun Choi et.al. | 2212.02796 | link |
2022-12-05 | Images Speak in Images: A Generalist Painter for In-Context Visual Learning | Xinlong Wang et.al. | 2212.02499 | link |
2022-12-06 | R2FD2: Fast and Robust Matching of Multimodal Remote Sensing Image via Repeatable Feature Detector and Rotation-invariant Feature Descriptor | Bai Zhu et.al. | 2212.02277 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
2022-11-29 | BALF: Simple and Efficient Blur Aware Local Feature Detector | Zhenjun Zhao et.al. | 2211.14731 | null |
2022-11-21 | Conjugate Product Graphs for Globally Optimal 2D-3D Shape Matching | Paul Roetzer et.al. | 2211.11589 | link |
2022-11-07 | Learning Feature Descriptors for Pre- and Intra-operative Point Cloud Matching for Laparoscopic Liver Registration | Zixin Yang et.al. | 2211.03688 | null |
2022-10-31 | Tree Detection and Diameter Estimation Based on Deep Learning | Vincent Grondin et.al. | 2210.17424 | link |
2022-10-26 | Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds | Zhiyuan Zhang et.al. | 2210.14899 | null |
2022-10-23 | Few-Shot Meta Learning for Recognizing Facial Phenotypes of Genetic Disorders | Ömer Sümer et.al. | 2210.12705 | null |
2022-10-21 | Real-time Detection of 2D Tool Landmarks with Synthetic Training Data | Bram Vanherle et.al. | 2210.11991 | null |
2022-10-09 | Fusing Event-based Camera and Radar for SLAM Using Spiking Neural Networks with Continual STDP Learning | Ali Safa et.al. | 2210.04236 | null |
2022-10-04 | Centroid Distance Keypoint Detector for Colored Point Clouds | Hanzhe Teng et.al. | 2210.01298 | link |
2022-09-28 | Category-Level Global Camera Pose Estimation with Multi-Hypothesis Point Cloud Correspondences | Jun-Jee Chao et.al. | 2209.14419 | null |
2022-09-28 | USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation | Zhengrong Xue et.al. | 2209.13864 | null |
2022-10-16 | Suture Thread Spline Reconstruction from Endoscopic Images for Robotic Surgery with Reliability-driven Keypoint Detection | Neelay Joglekar et.al. | 2209.13657 | link |
2022-09-27 | Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors | Hao Dong et.al. | 2209.13586 | link |
2022-09-26 | Performance Evaluation of 3D Keypoint Detectors and Descriptors on Coloured Point Clouds in Subsea Environments | Kyungmin Jung et.al. | 2209.12881 | null |
2022-10-07 | Long-Lived Accurate Keypoints in Event Streams | Philippe Chiberre et.al. | 2209.10385 | null |
2022-09-20 | Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence | Sunghwan Hong et.al. | 2209.08742 | null |
2022-09-15 | Online Marker-free Extrinsic Camera Calibration using Person Keypoint Detections | Bastian Pätzold et.al. | 2209.07393 | link |
2022-09-07 | Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip | Yang Li et.al. | 2209.03440 | null |
2022-08-27 | Learning to SLAM on the Fly in Unknown Environments: A Continual Learning Approach for Drones in Visually Ambiguous Scenes | Ali Safa et.al. | 2208.12997 | null |
2022-08-24 | Self-Supervised Endoscopic Image Key-Points Matching | Manel Farhat et.al. | 2208.11424 | link |
2022-08-19 | Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture | Muhammad Muzammel et.al. | 2208.08224 | null |
2022-08-08 | MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis | Maximilian Gilles et.al. | 2208.03963 | null |
2022-08-07 | CVLNet: Cross-View Semantic Correspondence Learning for Video-based Camera Localization | Yujiao Shi et.al. | 2208.03660 | null |
2022-07-29 | Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation | Qihao Liu et.al. | 2208.00090 | null |
2022-07-25 | Translating a Visual LEGO Manual to a Machine-Executable Plan | Ruocheng Wang et.al. | 2207.12572 | null |
2022-07-21 | Multi-modal Retinal Image Registration Using a Keypoint-Based Vessel Structure Aligning Network | Aline Sindel et.al. | 2207.10506 | null |
2022-07-15 | Human keypoint detection for close proximity human-robot interaction | Jan Docekal et.al. | 2207.07742 | null |
2022-07-15 | Adversarial Focal Loss: Asking Your Discriminator for Hard Examples | Chen Liu et.al. | 2207.07739 | null |
2022-07-13 | Rapid Person Re-Identification via Sub-space Consistency Regularization | Qingze Yin et.al. | 2207.05933 | null |
2022-07-07 | RWT-SLAM: Robust Visual SLAM for Highly Weak-textured Environments | Qihao Peng et.al. | 2207.03539 | null |
2022-08-15 | Semi-supervised Human Pose Estimation in Art-historical Images | Matthias Springstein et.al. | 2207.02976 | link |
2022-07-01 | Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling | Jiamin Liang et.al. | 2207.00474 | null |
2022-06-24 | Motion Estimation for Large Displacements and Deformations | Qiao Chen et.al. | 2206.12464 | null |
2022-06-24 | Deep embedded clustering algorithm for clustering PACS repositories | Teo Manojlović et.al. | 2206.12417 | null |
2022-06-21 | KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences | Xuanhan Wang et.al. | 2206.10090 | link |
2022-06-20 | Self-Supervised Consistent Quantization for Fully Unsupervised Image Retrieval | Guile Wu et.al. | 2206.09806 | null |
2022-06-15 | A Unified Sequence Interface for Vision Tasks | Ting Chen et.al. | 2206.07669 | link |
2022-06-09 | Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields | Mingtong Zhang et.al. | 2206.04669 | null |
2022-06-03 | SNAKE: Shape-aware Neural 3D Keypoint Field | Chengliang Zhong et.al. | 2206.01724 | link |
2022-05-17 | MulT: An End-to-End Multitask Learning Transformer | Deblina Bhattacharjee et.al. | 2205.08303 | null |
2022-05-10 | ConfLab: A Rich Multimodal Multisensor Dataset of Free-Standing Social Interactions In-the-Wild | Chirag Raman et.al. | 2205.05177 | link |
2022-04-28 | Polarimetric imaging for the detection of synthetic models of SARS-CoV-2: a proof of concept | Emilio Gomez-Gonzalez et.al. | 2204.14050 | null |
2022-05-02 | GRIT: General Robust Image Task Benchmark | Tanmay Gupta et.al. | 2204.13653 | link |
2022-05-24 | ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation | Yufei Xu et.al. | 2204.12484 | link |
2022-04-26 | Unified GCNs: Towards Connecting GCNs with CNNs | Ziyan Zhang et.al. | 2204.12300 | null |
2022-04-19 | Self-Supervised Equivariant Learning for Oriented Keypoint Detection | Jongmin Lee et.al. | 2204.08613 | link |
2022-04-17 | The Z-axis, X-axis, Weight and Disambiguation Methods for Constructing Local Reference Frame in 3D Registration: An Evaluation | Bao Zhao et.al. | 2204.08024 | null |
2022-04-15 | 2D Human Pose Estimation: A Survey | Haoming Chen et.al. | 2204.07370 | null |
2022-04-11 | Towards Homogeneous Modality Learning and Multi-Granularity Information Exploration for Visible-Infrared Person Re-Identification | Haojie Liu et.al. | 2204.04842 | null |
2022-04-07 | Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification | Yanan Wang et.al. | 2204.02611 | link |
2022-04-02 | SkeleVision: Towards Adversarial Resiliency of Person Tracking with Multi-Task Learning | Nilaksh Das et.al. | 2204.00734 | link |
2022-04-01 | MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration | Chenzhong Gao et.al. | 2204.00260 | null |
2022-03-29 | Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning | David Howard et.al. | 2203.15172 | null |
2022-03-28 | REGTR: End-to-end Point Cloud Correspondences with Transformers | Zi Jian Yew et.al. | 2203.14517 | link |
2022-03-27 | UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection | Ye Liu et.al. | 2203.12745 | link |
2022-03-21 | MatchFormer: Interleaving Attention in Transformers for Feature Matching | Qing Wang et.al. | 2203.09645 | link |
2022-03-16 | PosePipe: Open-Source Human Pose Estimation Pipeline for Clinical Research | R. James Cotton et.al. | 2203.08792 | link |
2022-03-11 | DRTAM: Dual Rank-1 Tensor Attention Module | Hanxing Chi et.al. | 2203.05893 | null |
2022-03-07 | Weakly Supervised Learning of Keypoints for 6D Object Pose Estimation | Meng Tian et.al. | 2203.03498 | null |
2022-02-10 | Motion-Aware Transformer For Occluded Person Re-identification | Mi Zhou et.al. | 2202.04243 | null |
2022-02-03 | Sim2Real Object-Centric Keypoint Detection and Description | Chengliang Zhong et.al. | 2202.00448 | null |
2022-01-16 | Cross-Centroid Ripple Pattern for Facial Expression Recognition | Monu Verma et.al. | 2201.05958 | null |
2022-01-14 | Reproducing BowNet: Learning Representations by Predicting Bags of Visual Words | Harry Nguyen et.al. | 2201.03556 | link |
2022-01-10 | TFS Recognition: Investigating MPH]{Thai Finger Spelling Recognition: Investigating MediaPipe Hands Potentials | Jinnavat Sanalohit et.al. | 2201.03170 | null |
2022-01-06 | A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration | Aline Sindel et.al. | 2201.02242 | null |
2021-12-28 | Skin feature point tracking using deep feature encodings | Jose Ramon Chang et.al. | 2112.14159 | null |
2021-12-23 | Data-efficient learning for 3D mirror symmetry detection | Yancong Lin et.al. | 2112.12579 | null |
2021-12-22 | Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations – combining input rotations and a kinematic model | Michael Zwölfer et.al. | 2112.12193 | null |
2021-12-22 | Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction | Henrique Siqueira et.al. | 2112.12002 | link |
2021-12-19 | Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection | Renjie Li et.al. | 2112.10275 | null |
2021-12-19 | GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor | Jean-Baptiste Carluer et.al. | 2112.10258 | link |
2021-12-16 | Masked Feature Prediction for Self-Supervised Visual Pre-Training | Chen Wei et.al. | 2112.09133 | link |
2021-12-13 | DenseGAP: Graph-Structured Dense Correspondence Learning with Anchor Points | Zhengfei Kuang et.al. | 2112.06910 | null |
2021-12-12 | Few-shot Keypoint Detection with Uncertainty Learning for Unseen Species | Changsheng Lu et.al. | 2112.06183 | link |
2021-12-13 | Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings | Mel Vecerik et.al. | 2112.04910 | null |
2021-12-06 | ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction | Xiaoming Zhao et.al. | 2112.02906 | link |
2021-11-25 | Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association | Sen Yang et.al. | 2111.12892 | link |
2021-11-08 | Template NeRF: Towards Modeling Dense Shape Correspondences from Category-Specific Object Images | Jianfei Guo et.al. | 2111.04237 | null |
2021-11-04 | Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image | Feng Liu et.al. | 2111.03098 | null |
2021-11-01 | Learning Event-based Spatio-Temporal Feature Descriptors via Local Synaptic Plasticity: A Biologically-realistic Perspective of Computer Vision | Ali Safa et.al. | 2111.00791 | null |
2021-10-30 | Geometry-Aware Hierarchical Bayesian Learning on Manifolds | Yonghui Fan et.al. | 2111.00184 | null |
2021-10-26 | CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration | Hao Yu et.al. | 2110.14076 | link |
2021-10-23 | HWTool: Fully Automatic Mapping of an Extensible C++ Image Processing Language to Hardware | James Hegarty et.al. | 2110.12106 | null |
2021-10-18 | Keypoint-Based Bimanual Shaping of Deformable Linear Objects under Environmental Constraints using Hierarchical Action Planning | Shengzeng Huo et.al. | 2110.08962 | null |
2021-10-11 | High-order Tensor Pooling with Attention for Action Recognition | Piotr Koniusz et.al. | 2110.05216 | null |
2021-10-10 | Digging Into Self-Supervised Learning of Feature Descriptors | Iaroslav Melekhov et.al. | 2110.04773 | null |
2021-10-04 | BPFNet: A Unified Framework for Bimodal Palmprint Alignment and Fusion | Zhaoqun Li et.al. | 2110.01179 | link |
2021-10-01 | Machine learning aided noise filtration and signal classification for CREDO experiment | Łukasz Bibrzycki et.al. | 2110.00297 | null |
2021-09-28 | PDC-Net+: Enhanced Probabilistic Dense Correspondence Network | Prune Truong et.al. | 2109.13912 | link |
2021-09-27 | HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines | Fabio Bellavia et.al. | 2109.12925 | null |
2021-09-24 | Catadioptric Stereo on a Smartphone | Kristijan Bartol et.al. | 2109.11872 | null |
2021-09-20 | Semi-supervised Dense Keypointsusing Unlabeled Multiview Images | Zhixuan Yu et.al. | 2109.09299 | null |
2021-08-31 | A Novel Dataset for Keypoint Detection of quadruped Animals from Images | Prianka Banik et.al. | 2108.13958 | link |
2021-08-27 | A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images | Xiaoteng Zhou et.al. | 2108.12151 | null |
Image Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-17 | Bringing Multimodality to Amazon Visual Search System | Xinliang Zhu et.al. | 2412.13364 | null |
2024-12-04 | Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis | Siyoon Jin et.al. | 2412.03150 | null |
2024-11-20 | DT-LSD: Deformable Transformer-based Line Segment Detection | Sebastian Janampa et.al. | 2411.13005 | link |
2024-11-15 | Image Matching Filtering and Refinement by Planes and Beyond | Fabio Bellavia et.al. | 2411.09484 | link |
2024-11-11 | XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration | Ismail Can Yagmur et.al. | 2411.07430 | link |
2024-11-07 | The Impact of Semi-Supervised Learning on Line Segment Detection | Johanna Engman et.al. | 2411.04596 | link |
2024-11-04 | Silver medal Solution for Image Matching Challenge 2024 | Yian Wang et.al. | 2411.01851 | null |
2024-10-30 | Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants | Azadeh Sharafi et.al. | 2410.23329 | null |
2024-11-05 | RelationBooth: Towards Relation-Aware Customized Object Generation | Qingyu Shi et.al. | 2410.23280 | null |
2024-10-31 | ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses | Junjie Ni et.al. | 2410.22733 | null |
2024-10-30 | LoFLAT: Local Feature Matching using Focused Linear Attention Transformer | Naijian Cao et.al. | 2410.22710 | null |
2024-10-26 | Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification | Yue Su et.al. | 2410.20097 | null |
2024-10-01 | A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference | Yuan Li et.al. | 2410.11848 | null |
2024-10-15 | LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images | Yuzhou Cheng et.al. | 2410.11505 | null |
2024-10-12 | Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence | Felipe Cadar et.al. | 2410.09533 | link |
2024-09-27 | Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras | Yipeng Lu et.al. | 2409.18673 | null |
2024-09-25 | Game4Loc: A UAV Geo-Localization Benchmark from Game Data | Yuxiang Ji et.al. | 2409.16925 | link |
2024-09-24 | Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge | Marek Wodzinski et.al. | 2409.15931 | null |
2024-09-10 | Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Yujiao Shi et.al. | 2409.06471 | link |
2024-09-05 | Enabling Practical and Privacy-Preserving Image Processing | Chao Wang et.al. | 2409.03568 | null |
2024-09-20 | A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering | Shuang Song et.al. | 2409.03032 | link |
2024-08-29 | Super-Resolution works for coastal simulations | Zhi-Song Liu et.al. | 2408.16553 | null |
2024-09-15 | Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Sierra Bonilla et.al. | 2408.16445 | link |
2024-08-26 | Affine steerers for structured keypoint description | Georg Bökman et.al. | 2408.14186 | link |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-09-11 | Coarse-to-fine Alignment Makes Better Speech-image Retrieval | Lifeng Zhou et.al. | 2408.13119 | null |
2024-08-19 | BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval | Zhenyu Lu et.al. | 2408.10383 | null |
2024-08-14 | RSD-DOG : A New Image Descriptor based on Second Order Derivatives | Darshan Venkatrayappa et.al. | 2408.07687 | null |
2024-08-09 | One Shot is Enough for Sequential Infrared Small Target Segmentation | Bingbing Dan et.al. | 2408.04823 | link |
2024-08-07 | PRISM: PRogressive dependency maxImization for Scale-invariant image Matching | Xudong Cai et.al. | 2408.03598 | null |
2024-08-05 | ConDL: Detector-Free Dense Image Matching | Monika Kwiatkowski et.al. | 2408.02766 | null |
2024-08-04 | Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image | Xinlin Ren et.al. | 2408.02079 | link |
2024-07-29 | Image-text matching for large-scale book collections | Artemis Llabrés et.al. | 2407.19812 | link |
2024-07-26 | PIV3CAMS: a multi-camera dataset for multiple computer vision problems and its application to novel view-point synthesis | Sohyeong Kim et.al. | 2407.18695 | null |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-16 | REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching | Han Nie et.al. | 2407.11637 | link |
2024-07-16 | A Self-Correcting Strategy of the Digital Volume Correlation Displacement Field Based on Image Matching: Application to Poor Speckles Quality and Complex-Large Deformation | Chengsheng Li et.al. | 2407.11287 | null |
2024-07-14 | Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching | Xiaoyong Lu et.al. | 2407.07789 | null |
2024-07-10 | Mutual Information calculation on different appearances | Jiecheng Liao et.al. | 2407.07410 | null |
2024-07-15 | SfM on-the-fly: Get better 3D from What You Capture | Zongqian Zhan et.al. | 2407.03939 | null |
2024-07-03 | IMC 2024 Methods & Solutions Review | Shyam Gupta et.al. | 2407.03172 | null |
2024-06-21 | High Resolution Surface Reconstruction of Cultural Heritage Objects Using Shape from Polarization Method | F. S. Mortazavi et.al. | 2406.15121 | null |
2024-06-16 | Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models | Yikai Zhang et.al. | 2406.10902 | link |
2024-06-14 | Grounding Image Matching in 3D with MASt3R | Vincent Leroy et.al. | 2406.09756 | link |
2024-06-05 | A Self-Supervised Denoising Strategy for Underwater Acoustic Camera Imageries | Xiaoteng Zhou et.al. | 2406.02914 | null |
2024-05-22 | Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching | Hongkai Chen et.al. | 2405.13874 | null |
2024-05-21 | OmniGlue: Generalizable Feature Matching with Foundation Model Guidance | Hanwen Jiang et.al. | 2405.12979 | link |
2024-07-09 | Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation | Rezkellah Noureddine Khiati et.al. | 2405.08556 | link |
2024-05-14 | TP3M: Transformer-based Pseudo 3D Image Matching with Reference | Liming Han et.al. | 2405.08434 | null |
2024-05-13 | Authentic Hand Avatar from a Phone Scan via Universal Hand Model | Gyeongsik Moon et.al. | 2405.07933 | null |
2024-04-30 | A Light-weight Transformer-based Self-supervised Matching Network for Heterogeneous Images | Wang Zhang et.al. | 2404.19311 | null |
2024-04-30 | XFeat: Accelerated Features for Lightweight Image Matching | Guilherme Potje et.al. | 2404.19174 | null |
2024-06-10 | MinBackProp – Backpropagating through Minimal Solvers | Diana Sungatullina et.al. | 2404.17993 | link |
2024-04-25 | Transformer-Based Local Feature Matching for Multimodal Image Registration | Remi Delaunay et.al. | 2404.16802 | null |
2024-04-23 | FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction | Hang Hua et.al. | 2404.14715 | null |
2024-04-22 | Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Eric Brachmann et.al. | 2404.14351 | null |
2024-04-17 | A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching | Francesco Pro et.al. | 2404.11302 | link |
2024-04-16 | Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction | John Francis et.al. | 2404.10626 | null |
2024-04-15 | XoFTR: Cross-modal Feature Matching Transformer | Önder Tuzcuoğlu et.al. | 2404.09692 | link |
2024-04-13 | DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector | Johan Edstedt et.al. | 2404.08928 | link |
2024-04-09 | Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences | Axel Barroso-Laguna et.al. | 2404.06337 | link |
2024-04-01 | Marrying NeRF with Feature Matching for One-step Pose Estimation | Ronghan Chen et.al. | 2404.00891 | null |
2024-04-01 | 3MOS: Multi-sources, Multi-resolutions, and Multi-scenes dataset for Optical-SAR image matching | Yibin Ye et.al. | 2404.00838 | null |
2024-03-31 | On the Estimation of Image-matching Uncertainty in Visual Place Recognition | Mubariz Zaffar et.al. | 2404.00546 | null |
2024-03-30 | Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation | Yuan Wang et.al. | 2404.00262 | null |
2024-03-26 | Staircase Localization for Autonomous Exploration in Urban Environments | Jinrae Kim et.al. | 2403.17330 | null |
2024-03-23 | MatchSeg: Towards Better Segmentation via Reference Image Matching | Ruiqiang Xiao et.al. | 2403.15901 | link |
2024-03-20 | Unifying Local and Global Multimodal Features for Place Recognition in Aliased and Low-Texture Environments | Alberto García-Hernández et.al. | 2403.13395 | link |
2024-03-19 | HCPM: Hierarchical Candidates Pruning for Efficient Detector-Free Matching | Ying Chen et.al. | 2403.12543 | null |
2024-03-16 | Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval | Shunsuke Tsubaki et.al. | 2403.10756 | null |
2024-03-16 | Vector search with small radiuses | Gergely Szilvasy et.al. | 2403.10746 | null |
2024-03-15 | Local positional graphs and attentive local features for a data and runtime-efficient hierarchical place recognition pipeline | Fangming Yuan et.al. | 2403.10283 | null |
2024-03-15 | Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning | Meixuan Li et.al. | 2403.10252 | null |
2024-03-14 | Virtual birefringence imaging and histological staining of amyloid deposits in label-free tissue using autofluorescence microscopy and deep learning | Xilin Yang et.al. | 2403.09100 | null |
2024-03-18 | Matching Non-Identical Objects | Yusuke Marumo et.al. | 2403.08227 | null |
2024-03-11 | Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed | Yifan Wang et.al. | 2403.04765 | null |
2024-03-07 | Scene Depth Estimation from Traditional Oriental Landscape Paintings | Sungho Kang et.al. | 2403.03408 | null |
2024-02-21 | Visual Style Prompting with Swapping Self-Attention | Jaeseok Jeong et.al. | 2402.12974 | link |
2024-02-16 | GIM: Learning Generalizable Image Matcher From Internet Videos | Xuelun Shen et.al. | 2402.11095 | link |
2024-02-13 | Are Semi-Dense Detector-Free Methods Good at Matching Local Features? | Matthieu Vilain et.al. | 2402.08671 | null |
2024-02-13 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359 | link |
2024-01-31 | Improved Scene Landmark Detection for Camera Localization | Tien Do et.al. | 2401.18083 | link |
2024-03-11 | Local Feature Matching Using Deep Learning: A Survey | Shibiao Xu et.al. | 2401.17592 | link |
2024-01-24 | Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry | Qi Cai et.al. | 2401.13357 | null |
2024-01-19 | SCENES: Subpixel Correspondence Estimation With Epipolar Supervision | Dominik A. Kloepfer et.al. | 2401.10886 | null |
2024-01-18 | Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Songhe Deng et.al. | 2401.09883 | link |
2024-01-26 | RomniStereo: Recurrent Omnidirectional Stereo Matching | Hualie Jiang et.al. | 2401.04345 | link |
2024-01-05 | CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs | Daoan Zhang et.al. | 2401.02582 | null |
2024-01-03 | Local Adaptive Clustering Based Image Matching for Automatic Visual Identification | Zhizhen Wang et.al. | 2401.01720 | null |
2024-01-03 | A Transformer-Based Adaptive Semantic Aggregation Method for UAV Visual Geo-Localization | Shishen Li et.al. | 2401.01574 | null |
2023-12-23 | BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation | Tavis Shore et.al. | 2312.15363 | link |
2023-12-22 | Harnessing Diffusion Models for Visual Perception with Meta Prompts | Qiang Wan et.al. | 2312.14733 | link |
2024-01-05 | MatchDet: A Collaborative Framework for Image Matching and Object Detection | Jinxiang Lai et.al. | 2312.10983 | null |
2023-12-07 | Visual Geometry Grounded Deep Structure From Motion | Jianyuan Wang et.al. | 2312.04563 | null |
2023-12-04 | Steerers: A framework for rotation equivariant keypoint descriptors | Georg Bökman et.al. | 2312.02152 | link |
2023-11-30 | DSeg: Direct Line Segments Detection | Berger Cyrille et.al. | 2311.18344 | null |
2023-11-30 | Utilizing Radiomic Feature Analysis For Automated MRI Keypoint Detection: Enhancing Graph Applications | Sahar Almahfouz Nasser et.al. | 2311.18281 | null |
2023-11-29 | LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching | Wenhao Zhong et.al. | 2311.17571 | link |
2023-11-08 | Zero-shot Translation of Attention Patterns in VQA Models to Natural Language | Leonard Salewski et.al. | 2311.05043 | link |
2023-11-06 | An invariant feature extraction for multi-modal images matching | Chenzhong Gao et.al. | 2311.02842 | null |
2023-10-23 | RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments | Jinyu Li et.al. | 2310.15072 | link |
2023-10-23 | Player Re-Identification Using Body Part Appearences | Mahesh Bhosale et.al. | 2310.14469 | null |
2023-10-20 | FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer | Xinyu Zhang et.al. | 2310.13605 | null |
2023-11-14 | RGM: A Robust Generalist Matching Model | Songyan Zhang et.al. | 2310.11755 | link |
2023-10-07 | UFD-PRiME: Unsupervised Joint Learning of Optical Flow and Stereo Depth through Pixel-Level Rigid Motion Estimation | Shuai Yuan et.al. | 2310.04712 | null |
2023-10-02 | Leveraging Cutting Edge Deep Learning Based Image Matching for Reconstructing a Large Scene from Sparse Images | Georg Bökman et.al. | 2310.01092 | null |
2023-09-29 | Segment Anything Model is a Good Teacher for Local Feature Learning | Jingqian Wu et.al. | 2309.16992 | link |
2023-09-27 | KDD-LOAM: Jointly Learned Keypoint Detector and Descriptors Assisted LiDAR Odometry and Mapping | Renlang Huang et.al. | 2309.15394 | null |
2023-10-13 | A Critical Analysis of Internal Reliability for Uncertainty Quantification of Dense Image Matching in Multi-view Stereo | Debao Huang et.al. | 2309.09379 | null |
2023-09-11 | Towards Content-based Pixel Retrieval in Revisited Oxford and Paris | Guoyuan An et.al. | 2309.05438 | link |
2023-09-09 | Neural Semantic Surface Maps | Luca Morreale et.al. | 2309.04836 | null |
2023-09-05 | Doppelgangers: Learning to Disambiguate Images of Similar Structures | Ruojin Cai et.al. | 2309.02420 | link |
2023-08-14 | Occ $^2$ Net: Robust Image Matching Based on 3D Occupancy Estimation for Occluded Regions | Miao Fan et.al. | 2308.16160 | null |
2023-08-29 | TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching | Yun Liao et.al. | 2308.15144 | null |
2023-08-27 | LDL: Line Distance Functions for Panoramic Localization | Junho Kim et.al. | 2308.13989 | link |
2023-08-22 | Scene-Aware Feature Matching | Xiaoyong Lu et.al. | 2308.09949 | null |
2023-09-03 | DeDoDe: Detect, Don’t Describe – Describe, Don’t Detect for Local Feature Matching | Johan Edstedt et.al. | 2308.08479 | link |
2023-08-19 | Global Features are All You Need for Image Retrieval and Reranking | Shihao Shao et.al. | 2308.06954 | link |
2023-08-02 | ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation | Bo Zhang et.al. | 2308.00400 | link |
2023-07-28 | Cross-Modal Concept Learning and Inference for Vision-Language Models | Yi Zhang et.al. | 2307.15460 | null |
2023-07-22 | CryptoMask : Privacy-preserving Face Recognition | Jianli Bai et.al. | 2307.12010 | null |
2023-07-22 | A Stronger Stitching Algorithm for Fisheye Images based on Deblurring and Registration | Jing Hao et.al. | 2307.11997 | null |
2023-07-21 | Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data | Sahar Almahfouz Nasser et.al. | 2307.10698 | link |
2023-08-08 | Balancing Privacy and Progress in Artificial Intelligence: Anonymization in Histopathology for Biomedical Research and Education | Neel Kanwal et.al. | 2307.09426 | null |
2023-08-01 | Unsupervised Deep Graph Matching Based on Cycle Consistency | Siddharth Tourani et.al. | 2307.08930 | link |
2023-07-15 | Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents | Ke Cao et.al. | 2307.07763 | null |
2023-07-09 | Augmenters at SemEval-2023 Task 1: Enhancing CLIP in Handling Compositionality and Ambiguity for Zero-Shot Visual WSD through Prompt Augmentation and Text-To-Image Diffusion | Jie S. Li et.al. | 2307.05564 | null |
2023-07-11 | ResMatch: Residual Attention Learning for Local Feature Matching | Yuxin Deng et.al. | 2307.05180 | link |
2023-07-11 | TIAM – A Metric for Evaluating Alignment in Text-to-Image Generation | Paul Grimal et.al. | 2307.05134 | link |
2023-07-02 | TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching | Khang Truong Giang et.al. | 2307.00485 | link |
2023-06-27 | Detector-Free Structure from Motion | Xingyi He et.al. | 2306.15669 | link |
2023-06-28 | PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment | Jianyuan Wang et.al. | 2306.15667 | null |
2023-06-25 | Enhancing Dynamic Image Advertising with Vision-Language Pre-training | Zhoufutu Wen et.al. | 2306.14112 | null |
2023-06-23 | LightGlue: Local Feature Matching at Light Speed | Philipp Lindenberger et.al. | 2306.13643 | link |
2023-06-19 | Graph Self-Supervised Learning for Endoscopic Image Matching | Manel Farhat et.al. | 2306.11141 | link |
2023-06-09 | Leaving the Lines Behind: Vision-Based Crop Row Exit for Agricultural Robot Navigation | Rajitha de Silva et.al. | 2306.05869 | null |
2023-06-07 | A2B: Anchor to Barycentric Coordinate for Robust Correspondence | Weiyue Zhao et.al. | 2306.02760 | null |
2023-05-27 | Pentagon-Match (PMatch): Identification of View-Invariant Planar Feature for Local Feature Matching-Based Homography Estimation | Yueh-Cheng Huang et.al. | 2305.17463 | null |
2023-05-19 | SIDAR: Synthetic Image Dataset for Alignment & Restoration | Monika Kwiatkowski et.al. | 2305.12036 | link |
2023-05-18 | LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation | Yujie Lu et.al. | 2305.11116 | link |
2023-05-16 | A Method for Training-free Person Image Picture Generation | Tianyu Chen et.al. | 2305.09817 | null |
2023-05-15 | Image Matching by Bare Homography | Fabio Bellavia et.al. | 2305.08946 | null |
2023-05-12 | CLIP-Count: Towards Text-Guided Zero-Shot Object Counting | Ruixiang Jiang et.al. | 2305.07304 | link |
2023-05-10 | SENDD: Sparse Efficient Neural Depth and Deformation for Tissue Tracking | Adam Schmidt et.al. | 2305.06477 | null |
2023-05-10 | Level-line Guided Edge Drawing for Robust Line Segment Detection | Xinyu Lin et.al. | 2305.05883 | link |
2023-05-09 | ColonMapper: topological mapping and localization for colonoscopy | Javier Morlana et.al. | 2305.05546 | null |
2023-04-29 | A Comprehensive Review of Image Line Segment Detection and Description: Taxonomies, Comparisons, and Challenges | Xinyu Lin et.al. | 2305.00264 | link |
2023-04-28 | SFD2: Semantic-guided Feature Detection and Description | Fei Xue et.al. | 2304.14845 | link |
2023-04-17 | DeepSim-Nets: Deep Similarity Networks for Stereo Image Matching | Mohamed Ali Chebbi et.al. | 2304.08056 | link |
2023-04-16 | Long-term Visual Localization with Mobile Sensors | Shen Yan et.al. | 2304.07691 | null |
2023-04-12 | SiLK – Simple Learned Keypoints | Pierre Gleize et.al. | 2304.06194 | link |
2023-04-16 | ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation | Xiaoming Zhao et.al. | 2304.03608 | link |
2023-04-04 | GlueStick: Robust Image Matching by Sticking Points and Lines Together | Rémi Pautrat et.al. | 2304.02008 | link |
2023-04-03 | PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching | Pedro Castro et.al. | 2304.01382 | null |
2023-04-02 | Enhancing Deformable Local Features by Jointly Learning to Detect and Describe Keypoints | Guilherme Potje et.al. | 2304.00583 | link |
2023-04-13 | Structured Epipolar Matcher for Local Feature Matching | Jiahao Chang et.al. | 2303.16646 | null |
2023-03-29 | Adaptive Spot-Guided Transformer for Consistent Local Feature Matching | Jiahuan Yu et.al. | 2303.16624 | null |
2023-03-28 | ASIC: Aligning Sparse in-the-wild Image Collections | Kamal Gupta et.al. | 2303.16201 | null |
2023-03-25 | Learning Rotation-Equivariant Features for Visual Correspondence | Jongmin Lee et.al. | 2303.15472 | null |
2023-03-27 | Learnable Graph Matching: A Practical Paradigm for Data Association | Jiawei He et.al. | 2303.15414 | link |
2023-03-24 | Efficient and Accurate Co-Visible Region Localization with Matching Key-Points Crop (MKPC): A Two-Stage Pipeline for Enhancing Image Matching Performance | Hongjian Song et.al. | 2303.13794 | null |
2023-03-15 | Rethinking Optical Flow from Geometric Matching Consistent Perspective | Qiaole Dong et.al. | 2303.08384 | link |
2023-04-04 | PATS: Patch Area Transportation with Subdivision for Local Feature Matching | Junjie Ni et.al. | 2303.07700 | null |
2023-03-07 | Parsing Line Segments of Floor Plan Images Using Graph Neural Networks | Mingxiang Chen et.al. | 2303.03851 | null |
2023-03-06 | Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints | Chenjie Cao et.al. | 2303.02885 | link |
2023-03-10 | ParaFormer: Parallel Attention Transformer for Efficient Feature Matching | Xiaoyong Lu et.al. | 2303.00941 | null |
2023-03-01 | RIFT2: Speeding-up RIFT with A New Rotation-Invariance Technique | Jiayuan Li et.al. | 2303.00319 | link |
2023-02-28 | Nonlinear Intensity, Scale and Rotation Invariant Matching for Multimodal Images | Zhongli Fan et.al. | 2302.14239 | link |
2023-02-25 | BrainCLIP: Bridging Brain and Visual-Linguistic Representation via CLIP for Generic Natural Visual Stimulus Decoding from fMRI | Yulong Liu et.al. | 2302.12971 | link |
2023-02-24 | Classification of structural building damage grades from multi-temporal photogrammetric point clouds using a machine learning model trained on virtual laser scanning data | Vivien Zahs et.al. | 2302.12591 | null |
2023-02-20 | A Large Scale Homography Benchmark | Daniel Barath et.al. | 2302.09997 | link |
2023-02-12 | OAMatcher: An Overlapping Areas-based Network for Accurate Local Feature Matching | Kun Dai et.al. | 2302.05846 | link |
2023-02-10 | General, Single-shot, Target-less, and Automatic LiDAR-Camera Extrinsic Calibration Toolbox | Kenji Koide et.al. | 2302.05094 | link |
2023-02-03 | Simple, Effective and General: A New Backbone for Cross-view Image Geo-localization | Yingying Zhu et.al. | 2302.01572 | link |
2023-01-27 | Harmonizing Flows: Unsupervised MR harmonization based on normalizing flows | Farzad Beizaee et.al. | 2301.11551 | link |
2023-01-25 | Local Feature Extraction from Salient Regions by Feature Map Transformation | Yerim Jung et.al. | 2301.10413 | null |
2023-01-24 | Feature-based Image Matching for Identifying Individual Kākā | Fintan O’Sullivan et.al. | 2301.06678 | null |
2023-01-18 | Instance Segmentation Based Graph Extraction for Handwritten Circuit Diagram Images | Johannes Bayer et.al. | 2301.03155 | null |
2023-01-08 | DeepMatcher: A Deep Transformer-based Network for Robust and Accurate Local Feature Matching | Tao Xie et.al. | 2301.02993 | link |
2023-01-07 | Deep Learning-Based UAV Aerial Triangulation without Image Control Points | Jiageng Zhong et.al. | 2301.02869 | null |
2023-01-06 | The UNCOVER Survey: A first-look HST+JWST catalog of 50,000 galaxies near Abell 2744 and beyond | John R. Weaver et.al. | 2301.02671 | link |
2023-02-13 | Translating Text Synopses to Video Storyboards | Xu Gu et.al. | 2301.00135 | link |
2022-12-23 | SuperGF: Unifying Local and Global Features for Visual Localization | Wenzheng Song et.al. | 2212.13105 | null |
2022-12-26 | Transformer and GAN Based Super-Resolution Reconstruction Network for Medical Images | Weizhi Du et.al. | 2212.13068 | null |
2022-12-20 | Seafloor-Invariant Caustics Removal from Underwater Imagery | Panagiotis Agrafiotis et.al. | 2212.10167 | null |
2022-12-15 | DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients | Rémi Pautrat et.al. | 2212.07766 | link |
2022-12-14 | Shared Coupling-bridge for Weakly Supervised Local Feature Learning | Jiayuan Sun et.al. | 2212.07047 | link |
2022-12-05 | Real Time Incremental Image Mosaicking Without Use of Any Camera Parameter | Suleyman Melih Portakal et.al. | 2212.02302 | null |
2022-12-05 | ObjectMatch: Robust Registration using Canonical Object Correspondences | Can Gümeli et.al. | 2212.01985 | null |
2022-12-07 | Universe Points Representation Learning for Partial Multi-Graph Matching | Zhakshylyk Nurlanov et.al. | 2212.00780 | null |
2022-11-30 | Self-Supervised Feature Learning for Long-Term Metric Visual Localization | Yuxuan Chen et.al. | 2212.00122 | null |
2022-11-28 | FeatureBooster: Boosting Feature Descriptors with a Lightweight Neural Network | Xinjiang Wang et.al. | 2211.15069 | link |
2022-11-19 | Person Text-Image Matching via Text-Feature Interpretability Embedding and External Attack Node Implantation | Fan Li et.al. | 2211.08657 | link |
2022-11-20 | Detecting Line Segments in Motion-blurred Images with Events | Huai Yu et.al. | 2211.07365 | link |
2022-11-15 | Fast Key Points Detection and Matching for Tree-Structured Images | Hao Wang et.al. | 2211.03242 | null |
2022-10-25 | A Comparative Study on Deep-Learning Methods for Dense Image Matching of Multi-angle and Multi-date Remote Sensing Stereo Images | Hessah Albanwan et.al. | 2210.14031 | null |
2022-10-11 | DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion | Yuxi Xiao et.al. | 2210.05517 | null |
2022-10-07 | Mars Rover Localization Based on A2G Obstacle Distribution Pattern Matching | Lang Zhou et.al. | 2210.03398 | link |
2022-09-27 | Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors | Hao Dong et.al. | 2209.13586 | link |
2022-09-25 | ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement | Dongli Tan et.al. | 2209.12213 | null |
2022-09-22 | DRKF: Distilled Rotated Kernel Fusion for Efficiently Boosting Rotation Invariance in Image Matching | Chao Li et.al. | 2209.10907 | null |
2022-11-15 | Uncertainty-aware Efficient Subgraph Isomorphism using Graph Topology | Arpan Kusari et.al. | 2209.09090 | null |
2022-09-16 | SRFeat: Learning Locally Accurate and Globally Consistent Non-Rigid Shape Correspondence | Lei Li et.al. | 2209.07806 | link |
2022-08-30 | ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer | Hongkai Chen et.al. | 2208.14201 | link |
2022-08-25 | A Gis Aided Approach for Geolocalizing an Unmanned Aerial System Using Deep Learning | Jianli Wei et.al. | 2208.12251 | link |
2022-08-25 | UAS Navigation in the Real World Using Visual Observation | Yuci Han et.al. | 2208.12125 | null |
2022-08-24 | Self-Supervised Endoscopic Image Key-Points Matching | Manel Farhat et.al. | 2208.11424 | link |
2022-08-22 | Equivariant Hypergraph Neural Networks | Jinwoo Kim et.al. | 2208.10428 | link |
2022-09-22 | Understanding Attention for Vision-and-Language Tasks | Feiqi Cao et.al. | 2208.08104 | link |
2022-08-16 | Hierarchical Attention Network for Few-Shot Object Detection via Meta-Contrastive Learning | Dongwoo Park et.al. | 2208.07039 | link |
2022-08-04 | Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification | Xinyu Lin et.al. | 2208.02450 | link |
2022-08-04 | OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images | Weijia Li et.al. | 2208.00928 | null |
2022-07-29 | Testing Relational Understanding in Text-Guided Image Generation | Colin Conwell et.al. | 2208.00005 | null |
2022-07-21 | Pose for Everything: Towards Category-Agnostic Pose Estimation | Lumin Xu et.al. | 2207.10387 | link |
2022-07-20 | Explaining Deepfake Detection by Analysing Image Matching | Shichao Dong et.al. | 2207.09679 | link |
2022-07-18 | Adaptive Assignment for Geometry Aware Local Feature Matching | Dihe Huang et.al. | 2207.08427 | link |
2022-07-16 | Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching | Jiazhen Liu et.al. | 2207.07932 | link |
2022-07-06 | Virtual staining of defocused autofluorescence images of unlabeled tissue using deep neural networks | Yijie Zhang et.al. | 2207.02946 | null |
2022-07-01 | TopicFM: Robust and Interpretable Feature Matching with Topic-assisted | Khang Truong Giang et.al. | 2207.00328 | link |
2022-06-16 | Virtual Correspondence: Humans as a Cue for Extreme-View Geometry | Wei-Chiu Ma et.al. | 2206.08365 | null |
2022-06-15 | Self-Supervised Learning of Image Scale and Orientation | Jongmin Lee et.al. | 2206.07259 | link |
2022-05-27 | Image Keypoint Matching using Graph Neural Networks | Nancy Xu et.al. | 2205.14275 | null |
2022-05-27 | Fine-tuning deep learning models for stereo matching using results from semi-global matching | Hessah Albanwan et.al. | 2205.14051 | null |
2022-05-23 | TransforMatcher: Match-to-Match Attention for Semantic Correspondence | Seungwook Kim et.al. | 2205.11634 | link |
2022-05-16 | ReDFeat: Recoupling Detection and Description for Multimodal Feature Learning | Yuxin Deng et.al. | 2205.07439 | null |
2022-05-06 | BDIS: Bayesian Dense Inverse Searching Method for Real-Time Stereo Surgical Image Matching | Jingwei Song et.al. | 2205.03133 | link |
2022-05-10 | AdaTriplet: Adaptive Gradient Triplet Loss with Automatic Margin Learning for Forensic Medical Image Matching | Khanh Nguyen et.al. | 2205.02849 | link |
2022-04-27 | Gleo-Det: Deep Convolution Feature-Guided Detector with Local Entropy Optimization for Salient Points | Chao Li et.al. | 2204.12884 | null |
2022-04-22 | SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite | Runzhe Zhu et.al. | 2204.10704 | link |
2022-04-20 | Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations | Leila Pishdad et.al. | 2204.09268 | null |
2022-04-19 | OpenGlue: Open Source Graph Neural Net Based Pipeline for Image Matching | Ostap Viniavskyi et.al. | 2204.08870 | link |
2022-04-19 | Self-Supervised Equivariant Learning for Oriented Keypoint Detection | Jongmin Lee et.al. | 2204.08613 | link |
2022-04-22 | Efficient Linear Attention for Fast and Accurate Keypoint Matching | Suwichaya Suwanwimolkul et.al. | 2204.07731 | null |
2022-04-08 | Lightweight starshade position sensing with convolutional neural networks and simulation-based inference | Andrew Chen et.al. | 2204.03853 | link |
2022-03-30 | AmsterTime: A Visual Place Recognition Benchmark Dataset for Severe Domain Shift | Burak Yildiz et.al. | 2203.16291 | link |
2022-03-29 | Photographic Visualization of Weather Forecasts with Generative Adversarial Networks | Christian Sigg et.al. | 2203.15601 | link |
2022-03-29 | Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots | Pranay Mathur et.al. | 2203.15272 | null |
2022-03-28 | Optimizing Elimination Templates by Greedy Parameter Search | Evgeniy Martyushev et.al. | 2203.14901 | link |
2022-03-28 | S2-Net: Self-supervision Guided Feature Representation Learning for Cross-Modality Images | Shasha Mei et.al. | 2203.14581 | null |
2022-03-26 | Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching | Yujiao Shi et.al. | 2203.14148 | link |
2022-03-24 | Keypoints Tracking via Transformer Networks | Oleksii Nasypanyi et.al. | 2203.12848 | link |
2022-03-21 | MatchFormer: Interleaving Attention in Transformers for Feature Matching | Qing Wang et.al. | 2203.09645 | link |
2022-03-14 | There’s no difference: Convolutional Neural Networks for transient detection without template subtraction | Tatiana Acero-Cuellar et.al. | 2203.07390 | link |
2022-03-25 | Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Jinheng Xie et.al. | 2203.02668 | link |
2022-03-01 | CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP | Zihao Wang et.al. | 2203.00386 | null |
2022-03-09 | Time-resolved Imaging of Stochastic Cascade Reactions over a Submillisecond to Second Time Range at the Angstrom Level | Toshiki Shimizu et.al. | 2202.13332 | null |
2022-02-16 | Cross-view and Cross-domain Underwater Localization based on Optical Aerial and Acoustic Underwater Images | Matheus M. Dos Santos et.al. | 2202.07817 | null |
2022-02-14 | CATs++: Boosting Cost Aggregation with Convolutions and Transformers | Seokju Cho et.al. | 2202.06817 | link |
2022-02-11 | Improving Image-recognition Edge Caches with a Generative Adversarial Network | Guilherme B. Souza et.al. | 2202.05929 | null |
2022-02-08 | Learning Optical Flow with Adaptive Graph Reasoning | Ao Luo et.al. | 2202.03857 | link |
2022-02-03 | Sim2Real Object-Centric Keypoint Detection and Description | Chengliang Zhong et.al. | 2202.00448 | null |
2022-01-27 | Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape context | Jie Shao et.al. | 2201.11296 | null |
2021-12-24 | Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation | Zhiwei Liu et.al. | 2112.12917 | null |
2021-12-20 | Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching | Yujie Fu et.al. | 2112.10485 | null |
2021-12-19 | GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast Descriptor | Jean-Baptiste Carluer et.al. | 2112.10258 | link |
2021-12-14 | More Control for Free! Image Synthesis with Semantic Diffusion Guidance | Xihui Liu et.al. | 2112.05744 | null |
2021-12-08 | Label-free virtual HER2 immunohistochemical staining of breast tissue using deep learning | Bijie Bai et.al. | 2112.05240 | null |
2021-12-01 | FaSS-MVS – Fast Multi-View Stereo with Surface-Aware Semi-Global Matching from UAV-borne Monocular Imagery | Boitumelo Ruf et.al. | 2112.00821 | null |
2021-12-01 | CLIPstyler: Image Style Transfer with a Single Text Condition | Gihyun Kwon et.al. | 2112.00374 | link |
2021-11-29 | Nonlinear Intensity Underwater Sonar Image Matching Method Based on Phase Information and Deep Convolution Features | Xiaoteng Zhou et.al. | 2111.15514 | null |
2021-11-29 | Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic | Yoad Tewel et.al. | 2111.14447 | link |
2021-11-29 | Heterogeneous Visible-Thermal and Visible-Infrared Face Recognition using Unit-Class Loss and Cross-Modality Discriminator | Usman Cheema et.al. | 2111.14339 | null |
2021-11-17 | Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network | Xiaoming Zhao et.al. | 2111.09006 | null |
2021-11-17 | Nonlinear Intensity Sonar Image Matching based on Deep Convolution Features | Xiaoteng Zhou et.al. | 2111.08994 | null |
2021-10-30 | A Deep Search for Faint Chandra X-ray Sources, Radio Sources, and Optical Counterparts in NGC 6752 | Haldan N. Cohn et.al. | 2111.00357 | null |
2021-10-01 | Robustly Removing Deep Sea Lighting Effects for Visual Mapping of Abyssal Plains | Kevin Köser et.al. | 2110.00480 | null |
2021-09-29 | Visually Grounded Concept Composition | Bowen Zhang et.al. | 2109.14115 | null |
2021-09-27 | HarrisZ $^+$ : Harris Corner Selection for Next-Gen Image Matching Pipelines | Fabio Bellavia et.al. | 2109.12925 | null |
2021-09-20 | Viewpoint Invariant Dense Matching for Visual Geolocalization | Gabriele Berton et.al. | 2109.09827 | link |
2021-09-20 | Image Subtraction in Fourier Space | Lei Hu et.al. | 2109.09334 | link |
2021-09-10 | Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization | Sungho Yoon et.al. | 2109.04753 | link |
2021-09-08 | Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes | Wenzheng Song et.al. | 2109.03585 | null |
2021-08-27 | A Matching Algorithm based on Image Attribute Transfer and Local Features for Underwater Acoustic and Optical Images | Xiaoteng Zhou et.al. | 2108.12151 | null |
2021-08-27 | Matching Underwater Sonar Images by the Learned Descriptor Based on Style Transfer Method | Xiaoteng Zhou et.al. | 2108.12072 | null |
2021-08-26 | Efficient Joint Object Matching via Linear Programming | Antonio De Rosa et.al. | 2108.11911 | null |
NeRF
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-12-19 | GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting | Qianpu Sun et.al. | 2412.14579 | null |
2024-12-19 | Bright-NeRF:Brightening Neural Radiance Field with Color Restoration from Low-light Raw Images | Min Wang et.al. | 2412.14547 | null |
2024-12-18 | GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians | Xiaobao Wei et.al. | 2412.13983 | link |
2024-12-17 | EOGS: Gaussian Splatting for Earth Observation | Luca Savant Aira et.al. | 2412.13047 | null |
2024-12-18 | Optimize the Unseen – Fast NeRF Cleanup with Free Space Prior | Leo Segre et.al. | 2412.12772 | null |
2024-12-17 | Towards a Training Free Approach for 3D Scene Editing | Vivek Madhavaram et.al. | 2412.12766 | null |
2024-12-16 | GS-ProCams: Gaussian Splatting-based Projector-Camera Systems | Qingyue Deng et.al. | 2412.11762 | null |
2024-12-18 | Sequence Matters: Harnessing Video Models in 3D Super-Resolution | Hyun-kyu Ko et.al. | 2412.11525 | null |
2024-12-16 | VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression | Qiang Hu et.al. | 2412.11362 | null |
2024-12-13 | NeRF-Texture: Synthesizing Neural Radiance Field Textures | Yi-Hua Huang et.al. | 2412.10004 | null |
2024-12-13 | Sharpening Your Density Fields: Spiking Neuron Aided Fast Geometry Learning | Yi Gu et.al. | 2412.09881 | null |
2024-12-12 | PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields | Sean Wu et.al. | 2412.09680 | link |
2024-12-11 | GN-FR:Generalizable Neural Radiance Fields for Flare Removal | Gopi Raju Matta et.al. | 2412.08200 | null |
2024-12-11 | NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods | Qiang Qu et.al. | 2412.08029 | link |
2024-12-10 | EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering | Toshiya Yura et.al. | 2412.07293 | null |
2024-12-09 | Diffusing Differentiable Representations | Yash Savani et.al. | 2412.06981 | null |
2024-12-09 | Dynamic EventNeRF: Reconstructing General Dynamic Scenes from Multi-view Event Cameras | Viktor Rudnev et.al. | 2412.06770 | null |
2024-12-09 | Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video | Renlong Wu et.al. | 2412.06424 | link |
2024-12-09 | Splatter-360: Generalizable 360 $^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images | Zheng Chen et.al. | 2412.06250 | link |
2024-12-07 | WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking | Yuqi Tan et.al. | 2412.05695 | null |
2024-12-06 | Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories | Susung Hong et.al. | 2412.05279 | null |
2024-12-11 | MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting | Peng Chen et.al. | 2412.04955 | link |
2024-12-04 | NeRF and Gaussian Splatting SLAM in the Wild | Fabian Schmidt et.al. | 2412.03263 | link |
2024-12-01 | SAGA: Surface-Aligned Gaussian Avatar | Ronghan Chen et.al. | 2412.00845 | null |
2024-12-01 | CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images | Jian Liu et.al. | 2412.00754 | null |
2024-11-30 | Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives | Alex Hanson et.al. | 2412.00578 | null |
2024-11-30 | Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects | Amir Barda et.al. | 2412.00518 | null |
2024-11-29 | $C^{3}$ -NeRF: Modeling Multiple Scenes via Conditional-cum-Continual Neural Radiance Fields | Prajwal Singh et.al. | 2411.19903 | null |
2024-11-29 | Gaussian Splashing: Direct Volumetric Rendering Underwater | Nir Mualem et.al. | 2411.19588 | null |
2024-11-29 | ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration | Chaojun Ni et.al. | 2411.19548 | null |
2024-11-29 | LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis | Tianqi Li et.al. | 2411.19525 | null |
2024-11-28 | SAMa: Material-aware 3D Selection and Segmentation | Michael Fischer et.al. | 2411.19322 | null |
2024-11-27 | Surf-NeRF: Surface Regularised Neural Radiance Fields | Jack Naylor et.al. | 2411.18652 | null |
2024-11-26 | MLI-NeRF: Multi-Light Intrinsic-Aware Neural Radiance Fields | Yixiong Yang et.al. | 2411.17235 | link |
2024-11-25 | The Radiance of Neural Fields: Democratizing Photorealistic and Dynamic Robotic Simulation | Georgina Nuthall et.al. | 2411.16940 | null |
2024-11-27 | SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving | Georg Hess et.al. | 2411.16816 | link |
2024-11-25 | Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction | Ziyu Zhang et.al. | 2411.16392 | null |
2024-11-25 | U2NeRF: Unsupervised Underwater Image Restoration and Neural Radiance Fields | Vinayak Gupta et.al. | 2411.16172 | null |
2024-11-24 | ZeroGS: Training 3D Gaussian Splatting from Unposed Images | Yu Chen et.al. | 2411.15779 | null |
2024-11-24 | GSurf: 3D Reconstruction via Signed Distance Fields with Direct Gaussian Supervision | Xu Baixin et.al. | 2411.15723 | link |
2024-11-23 | NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation | Menglin Zhang et.al. | 2411.15551 | null |
2024-11-23 | SplatSDF: Boosting Neural Implicit SDF via Gaussian Splatting Fusion | Runfa Blark Li et.al. | 2411.15468 | null |
2024-11-20 | Sparse Input View Synthesis: 3D Representations and Reliable Priors | Nagabhushan Somraj et.al. | 2411.13631 | null |
2024-11-20 | Robust SG-NeRF: Robust Scene Graph Aided Neural Surface Reconstruction | Yi Gu et.al. | 2411.13620 | null |
2024-11-20 | GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting | Xiaobao Wei et.al. | 2411.12981 | null |
2024-11-25 | SCIGS: 3D Gaussians Splatting from a Snapshot Compressive Image | Zixu Wang et.al. | 2411.12471 | null |
2024-11-19 | GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Shaoqing Xu et.al. | 2411.12452 | link |
2024-11-18 | Towards Degradation-Robust Reconstruction in Generalizable NeRF | Chan Ho Park et.al. | 2411.11691 | null |
2024-11-18 | LeC $^2$ O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes | Zhenxing Mi et.al. | 2411.11374 | null |
2024-11-15 | The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods | Yifu Tao et.al. | 2411.10546 | null |
2024-11-15 | USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting | Kang Chen et.al. | 2411.10504 | link |
2024-11-15 | GSEditPro: 3D Gaussian Splatting Editing with Attention-based Progressive Localization | Yanhao Sun et.al. | 2411.10033 | null |
2024-11-22 | BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis | David Svitov et.al. | 2411.08508 | link |
2024-11-13 | Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model | Yutao Shen et.al. | 2411.08453 | null |
2024-11-13 | MBA-SLAM: Motion Blur Aware Dense Visual SLAM with Radiance Fields Representation | Peng Wang et.al. | 2411.08279 | link |
2024-11-12 | TomoGRAF: A Robust and Generalizable Reconstruction Network for Single-View Computed Tomography | Di Xu et.al. | 2411.08158 | null |
2024-11-12 | Material Transforms from Disentangled NeRF Representations | Ivan Lopes et.al. | 2411.08037 | link |
2024-11-11 | LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes | Zefan Qu et.al. | 2411.06757 | null |
2024-11-10 | Through the Curved Cover: Synthesizing Cover Aberrated Scenes with Refractive Field | Liuyue Xie et.al. | 2411.06365 | null |
2024-11-09 | AI-Driven Stylization of 3D Environments | Yuanbo Chen et.al. | 2411.06067 | null |
2024-11-08 | A Nerf-Based Color Consistency Method for Remote Sensing Images | Zongcheng Zuo et.al. | 2411.05557 | null |
2024-11-08 | Rate-aware Compression for NeRF-based Volumetric Video | Zhiyu Zhang et.al. | 2411.05322 | null |
2024-11-07 | Planar Reflection-Aware Neural Radiance Fields | Chen Gao et.al. | 2411.04984 | null |
2024-11-07 | GANESH: Generalizable NeRF for Lensless Imaging | Rakesh Raj Madavan et.al. | 2411.04810 | null |
2024-11-08 | SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation | Xun Tu et.al. | 2411.04386 | null |
2024-11-06 | Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Rui Peng et.al. | 2411.03637 | link |
2024-11-05 | Enhancing Exploratory Capability of Visual Navigation Using Uncertainty of Implicit Scene Representation | Yichen Wang et.al. | 2411.03487 | link |
2024-11-05 | CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval | Xin Wen et.al. | 2411.02979 | null |
2024-11-05 | Exploring Seasonal Variability in the Context of Neural Radiance Fields for 3D Reconstruction on Satellite Imagery | Liv Kåreborn et.al. | 2411.02972 | null |
2024-11-05 | Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation | Xavier Timoneda et.al. | 2411.02969 | null |
2024-11-04 | NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields | Eric Zhu et.al. | 2411.02482 | null |
2024-11-05 | FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Ruihong Yin et.al. | 2411.02229 | null |
2024-11-06 | GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes | Gaochao Song et.al. | 2411.01853 | null |
2024-11-04 | A Probabilistic Formulation of LiDAR Mapping with Neural Radiance Fields | Matthew McDermott et.al. | 2411.01725 | link |
2024-11-01 | ZIM: Zero-Shot Image Matting for Anything | Beomyoung Kim et.al. | 2411.00626 | link |
2024-10-31 | Scaled Inverse Graphics: Efficiently Learning Large Sets of 3D Scenes | Karim Kassab et.al. | 2410.23742 | null |
2024-10-31 | Get a Grip: Multi-Finger Grasp Evaluation at Scale Enables Robust Sim-to-Real Transfer | Tyler Ga Wei Lum et.al. | 2410.23701 | null |
2024-10-31 | XRDSLAM: A Flexible and Modular Framework for Deep Learning based SLAM | Xiaomeng Wang et.al. | 2410.23690 | link |
2024-10-30 | Bringing NeRFs to the Latent Space: Inverse Graphics Autoencoder | Antoine Schnepf et.al. | 2410.22936 | null |
2024-10-28 | MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps | Yating Xu et.al. | 2410.21566 | link |
2024-10-29 | EEG-Driven 3D Object Reconstruction with Color Consistency and Diffusion Prior | Xin Xiang et.al. | 2410.20981 | null |
2024-10-28 | ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Suyoung Lee et.al. | 2410.20686 | link |
2024-10-27 | GUMBEL-NERF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields | Yusuke Sekikawa et.al. | 2410.20306 | null |
2024-10-25 | Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization | Weihang Liu et.al. | 2410.19483 | link |
2024-10-25 | Evaluation of strategies for efficient rate-distortion NeRF streaming | Pedro Martin et.al. | 2410.19459 | null |
2024-10-27 | Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis | Liang Han et.al. | 2410.18822 | null |
2024-10-24 | Real-time 3D-aware Portrait Video Relighting | Ziqi Cai et.al. | 2410.18355 | link |
2024-10-22 | Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies | Shrey Vishen et.al. | 2410.18137 | link |
2024-10-23 | VR-Splatting: Foveated Radiance Field Rendering via 3D Gaussian Splatting and Neural Points | Linus Franke et.al. | 2410.17932 | null |
2024-10-23 | Few-shot NeRF by Adaptive Rendering Loss Regularization | Qingshan Xu et.al. | 2410.17839 | null |
2024-10-23 | Efficient Neural Implicit Representation for 3D Human Reconstruction | Zexu Huang et.al. | 2410.17741 | link |
2024-10-23 | PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting | Yu Wang et.al. | 2410.17505 | null |
2024-10-22 | LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias | Haian Jin et.al. | 2410.17242 | null |
2024-10-18 | GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting | Yusen Xie et.al. | 2410.17084 | null |
2024-10-22 | E-3DGS: Gaussian Splatting with Exposure and Motion Events | Xiaoting Yin et.al. | 2410.16995 | link |
2024-10-21 | Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions | Malte Prinzler et.al. | 2410.16395 | null |
2024-10-21 | FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors | Chin-Yang Lin et.al. | 2410.16271 | null |
2024-10-22 | EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting | Bohao Liao et.al. | 2410.15392 | null |
2024-10-19 | Neural Radiance Field Image Refinement through End-to-End Sampling Point Optimization | Kazuhiro Ohta et.al. | 2410.14958 | null |
2024-10-18 | Learning autonomous driving from aerial imagery | Varun Murali et.al. | 2410.14177 | null |
2024-10-18 | DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction | Ange Lou et.al. | 2410.14169 | null |
2024-10-17 | DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering | Jiahao Lu et.al. | 2410.13607 | link |
2024-10-21 | DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation | Guosheng Zhao et.al. | 2410.13571 | null |
2024-10-17 | Object Pose Estimation Using Implicit Representation For Transparent Objects | Varun Burde et.al. | 2410.13465 | null |
2024-10-17 | GlossyGS: Inverse Rendering of Glossy Objects with 3D Gaussian Splatting | Shuichang Lai et.al. | 2410.13349 | null |
2024-10-16 | 3D Gaussian Splatting in Robotics: A Survey | Siting Zhu et.al. | 2410.12262 | null |
2024-10-16 | EG-HumanNeRF: Efficient Generalizable Human NeRF Utilizing Human Prior for Sparse View | Zhaorong Wang et.al. | 2410.12242 | null |
2024-10-14 | 3DArticCyclists: Generating Simulated Dynamic 3D Cyclists for Human-Object Interaction (HOI) and Autonomous Driving Applications | Eduardo R. Corral-Soto et.al. | 2410.10782 | null |
2024-10-14 | NeRF-enabled Analysis-Through-Synthesis for ISAR Imaging of Small Everyday Objects with Sparse and Noisy UWB Radar Data | Md Farhan Tasnim Oshim et.al. | 2410.10085 | null |
2024-10-13 | Magnituder Layers for Implicit Neural Representations in 3D | Sang Min Kim et.al. | 2410.09771 | null |
2024-10-12 | Improving 3D Finger Traits Recognition via Generalizable Neural Rendering | Hongbin Xu et.al. | 2410.09582 | null |
2024-10-11 | SceneCraft: Layout-Guided 3D Scene Generation | Xiuyu Yang et.al. | 2410.09049 | link |
2024-10-11 | MeshGS: Adaptive Mesh-Aligned Gaussian Splatting for High-Quality Rendering | Jaehoon Choi et.al. | 2410.08941 | null |
2024-10-11 | Optimizing NeRF-based SLAM with Trajectory Smoothness Constraints | Yicheng He et.al. | 2410.08780 | null |
2024-10-10 | RGM: Reconstructing High-fidelity 3D Car Assets with Relightable 3D-GS Generative Model from a Single Image | Xiaoxue Chen et.al. | 2410.08181 | null |
2024-10-10 | IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Jian Huang et.al. | 2410.08107 | link |
2024-10-11 | NeRF-Accelerated Ecological Monitoring in Mixed-Evergreen Redwood Forest | Adam Korycki et.al. | 2410.07418 | link |
2024-10-09 | DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation | Zhiqi Li et.al. | 2410.06756 | null |
2024-10-09 | MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Zhenhui Ye et.al. | 2410.06734 | null |
2024-10-09 | 3D Representation Methods: A Survey | Zhengren Wang et.al. | 2410.06475 | null |
2024-10-08 | Comparative Analysis of Novel View Synthesis and Photogrammetry for 3D Forest Stand Reconstruction and extraction of individual tree parameters | Guoji Tian et.al. | 2410.05772 | null |
2024-10-07 | Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors | Ziwei Liao et.al. | 2410.05514 | link |
2024-10-07 | PH-Dropout: Prctical Epistemic Uncertainty Quantification for View Synthesis | Chuanhao Sun et.al. | 2410.05468 | link |
2024-10-07 | LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting | Qifeng Chen et.al. | 2410.05111 | null |
2024-10-07 | 6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering | Zhongpai Gao et.al. | 2410.04974 | null |
2024-10-07 | TeX-NeRF: Neural Radiance Fields from Pseudo-TeX Vision | Chonghao Zhong et.al. | 2410.04873 | null |
2024-10-06 | Deformable NeRF using Recursively Subdivided Tetrahedra | Zherui Qiu et.al. | 2410.04402 | null |
2024-10-05 | Hybrid NeRF-Stereo Vision: Pioneering Depth Estimation and 3D Reconstruction in Endoscopy | Pengcheng Chen et.al. | 2410.04041 | null |
2024-10-02 | MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis | Xiaobiao Du et.al. | 2410.02103 | link |
2024-10-03 | EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis | Alexander Mai et.al. | 2410.01804 | null |
2024-10-02 | 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection | Yang Cao et.al. | 2410.01647 | link |
2024-10-02 | Gaussian Splatting in Mirrors: Reflection-Aware Rendering via Virtual Camera Optimization | Zihan Wang et.al. | 2410.01614 | null |
2024-10-02 | Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection | Hongru Yan et.al. | 2410.01404 | null |
2024-10-01 | GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer | Youngho Yoon et.al. | 2410.00672 | link |
2024-09-30 | Distributed NeRF Learning for Collaborative Multi-Robot Perception | Hongrui Zhao et.al. | 2409.20289 | null |
2024-09-30 | Active Neural Mapping at Scale | Zijia Kuang et.al. | 2409.20276 | null |
2024-09-30 | OPONeRF: One-Point-One NeRF for Robust Neural Rendering | Yu Zheng et.al. | 2409.20043 | link |
2024-09-28 | G3R: Gradient Guided Generalizable Reconstruction | Yun Chen et.al. | 2409.19405 | null |
2024-09-26 | LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Huan Wang et.al. | 2409.18057 | link |
2024-09-26 | Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions | Weng Fei Low et.al. | 2409.17988 | null |
2024-09-26 | Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry | Qi Zhang et.al. | 2409.17729 | null |
2024-09-26 | TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene | Sandika Biswas et.al. | 2409.17459 | link |
2024-09-25 | SeaSplat: Representing Underwater Scenes with 3D Gaussian Splatting and a Physically Grounded Image Formation Model | Daniel Yang et.al. | 2409.17345 | null |
2024-09-25 | TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans | Aggelina Chatziagapi et.al. | 2409.16666 | null |
2024-09-26 | Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities | Peizhi Yan et.al. | 2409.16147 | link |
2024-09-24 | Disentangled Generation and Aggregation for Robust Radiance Fields | Shihe Shen et.al. | 2409.15715 | null |
2024-09-24 | Plenoptic PNG: Real-Time Neural Radiance Fields in 150 KB | Jae Yong Lee et.al. | 2409.15689 | null |
2024-09-23 | AgriNeRF: Neural Radiance Fields for Agriculture in Challenging Lighting Conditions | Samarth Chopra et.al. | 2409.15487 | null |
2024-09-22 | MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Wangze Xu et.al. | 2409.14316 | null |
2024-09-21 | MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors | Zhenhua Du et.al. | 2409.14019 | null |
2024-09-19 | CrossRT: A cross platform programming technology for hardware-accelerated ray tracing in CG and CV applications | Vladimir Frolov et.al. | 2409.12617 | null |
2024-09-18 | JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation | Sai Tanmay Reddy Chakkera et.al. | 2409.12156 | null |
2024-09-25 | BRDF-NeRF: Neural Radiance Fields with Optical Satellite Images and BRDF Modelling | Lulin Zhang et.al. | 2409.12014 | link |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-21 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
2024-09-16 | Baking Relightable NeRF for Real-time Direct/Indirect Illumination Rendering | Euntae Choi et.al. | 2409.10327 | null |
2024-09-16 | DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments | Mahmud A. Mohamad et.al. | 2409.10041 | link |
2024-09-15 | NARF24: Estimating Articulated Object Structure for Implicit Rendering | Stanley Lewis et.al. | 2409.09829 | null |
2024-09-12 | DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors | Thomas Hanwen Zhu et.al. | 2409.08278 | null |
2024-09-11 | DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Haibo Yang et.al. | 2409.07454 | null |
2024-09-11 | ThermalGaussian: Thermal 3D Gaussian Splatting | Rongfeng Lu et.al. | 2409.07200 | null |
2024-09-10 | LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation | Archana Swaminathan et.al. | 2409.06703 | null |
2024-09-10 | Sources of Uncertainty in 3D Scene Reconstruction | Marcus Klasson et.al. | 2409.06407 | link |
2024-09-09 | LSE-NeRF: Learning Sensor Modeling Errors for Deblured Neural Radiance Fields with RGB-Event Stereo | Wei Zhi Tang et.al. | 2409.06104 | link |
2024-09-09 | G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis | Lutao Jiang et.al. | 2409.05617 | null |
2024-09-09 | From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models | Tessa Pulli et.al. | 2409.05413 | null |
2024-09-09 | KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction | Davide Di Nucci et.al. | 2409.05407 | null |
2024-09-09 | Lagrangian Hashing for Compressed Neural Field Representations | Shrisudhan Govindarajan et.al. | 2409.05334 | null |
2024-09-09 | Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems | Jianheng Liu et.al. | 2409.05310 | null |
2024-09-06 | SCARF: Scalable Continual Learning Framework for Memory-efficient Multiple Neural Radiance Fields | Yuze Wang et.al. | 2409.04482 | null |
2024-09-05 | Weight Conditioning for Smooth Optimization of Neural Networks | Hemanth Saratchandran et.al. | 2409.03424 | null |
2024-09-05 | Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction | Shen Chen et.al. | 2409.03213 | null |
2024-09-04 | UC-NeRF: Uncertainty-aware Conditional Neural Radiance Fields from Endoscopic Sparse Views | Jiaxin Guo et.al. | 2409.02917 | link |
2024-09-03 | GraspSplats: Efficient Manipulation with 3D Feature Splatting | Mazeyu Ji et.al. | 2409.02084 | null |
2024-09-03 | $S^2$ NeRF: Privacy-preserving Training Framework for NeRF | Bokang Zhang et.al. | 2409.01661 | link |
2024-08-30 | ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images | Xiaoshuai Zhang et.al. | 2408.17027 | null |
2024-08-29 | GameIR: A Large-Scale Synthesized Ground-Truth Dataset for Image Restoration over Gaming Content | Lebin Zhou et.al. | 2408.16866 | null |
2024-09-01 | Generic Objects as Pose Probes for Few-Shot View Synthesis | Zhirui Gao et.al. | 2408.16690 | null |
2024-08-29 | Spurfies: Sparse Surface Reconstruction using Local Geometry Priors | Kevin Raj et.al. | 2408.16544 | null |
2024-08-29 | NeRF-CA: Dynamic Reconstruction of X-ray Coronary Angiography with Extremely Sparse-views | Kirsten W. H. Maas et.al. | 2408.16355 | link |
2024-08-28 | Towards Realistic Example-based Modeling via 3D Gaussian Stitching | Xinyu Gao et.al. | 2408.15708 | null |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-08-27 | GeoTransfer : Generalizable Few-Shot Multi-View Reconstruction via Transfer Learning | Shubhendu Jena et.al. | 2408.14724 | null |
2024-08-28 | FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry | Chunran Zheng et.al. | 2408.14035 | link |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-08-24 | G3DST: Generalizing 3D Style Transfer with Neural Radiance Fields across Scenes and Styles | Adil Meric et.al. | 2408.13508 | null |
2024-08-23 | SIn-NeRF2NeRF: Editing 3D Scenes with Instructions through Segmentation and Inpainting | Jiseung Hong et.al. | 2408.13285 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-21 | Irregularity Inspection using Neural Radiance Field | Tianqi Ding et.al. | 2408.11251 | null |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | null |
2024-08-20 | Learning Part-aware 3D Representations by Fusing 2D Gaussians and Superquadrics | Zhirui Gao et.al. | 2408.10789 | null |
2024-08-20 | TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai et.al. | 2408.10739 | null |
2024-08-19 | $R^2$ -Mesh: Reinforcement Learning Powered Mesh Reconstruction via Geometry and Appearance Refinement | Haoyang Wang et.al. | 2408.10135 | null |
2024-08-19 | DiscoNeRF: Class-Agnostic Object Field for 3D Object Discovery | Corentin Dumery et.al. | 2408.09928 | null |
2024-08-20 | CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning | Haoyu Zhao et.al. | 2408.09663 | null |
2024-08-18 | S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis | Dongze Li et.al. | 2408.09347 | null |
2024-08-17 | SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation | Xiao Cao et.al. | 2408.09144 | null |
2024-08-17 | HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction | Xiao Zhao et.al. | 2408.09104 | null |
2024-08-16 | VF-NeRF: Learning Neural Vector Fields for Indoor Scene Reconstruction | Albert Gassol Puigjaner et.al. | 2408.08766 | link |
2024-08-15 | WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting | Huapeng Li et.al. | 2408.08206 | null |
2024-08-18 | Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space | Hyunjee Lee et.al. | 2408.07416 | null |
2024-08-13 | Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture | Yu Feng et.al. | 2408.06608 | null |
2024-08-13 | ActiveNeRF: Learning Accurate 3D Geometry by Active Pattern Projection | Jianyu Tao et.al. | 2408.06592 | link |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering | Jiameng Li et.al. | 2408.06286 | link |
2024-08-12 | 3D Reconstruction of Protein Structures from Multi-view AFM Images using Neural Radiance Fields (NeRFs) | Jaydeep Rade et.al. | 2408.06244 | null |
2024-08-10 | Radiance Field Learners As UAV First-Person Viewers | Liqi Yan et.al. | 2408.05533 | null |
2024-08-09 | DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-09 | FewShotNeRF: Meta-Learning-based Novel View Synthesis for Rapid Scene-Specific Adaptation | Piraveen Sivakumar et.al. | 2408.04803 | null |
2024-08-06 | LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting | Joanna Kaleta et.al. | 2408.04474 | link |
2024-08-08 | A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery | Mengya Xu et.al. | 2408.04426 | link |
2024-08-08 | Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods | Yiming Zhou et.al. | 2408.04268 | null |
2024-08-07 | Goal-oriented Semantic Communication for the Metaverse Application | Zhe Wang et.al. | 2408.03646 | null |
2024-08-06 | RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis | Hugo Blanc et.al. | 2408.03356 | null |
2024-08-06 | Efficient NeRF Optimization – Not All Samples Remain Equally Hard | Juuso Korhonen et.al. | 2408.03193 | null |
2024-08-06 | MGFs: Masked Gaussian Fields for Meshing Building based on Multi-View Images | Tengfei Wang et.al. | 2408.03060 | null |
2024-08-04 | PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone | Xin Yang et.al. | 2408.02053 | null |
2024-08-03 | FBINeRF: Feature-Based Integrated Recurrent Network for Pinhole and Fisheye Neural Radiance Fields | Yifan Wu et.al. | 2408.01878 | null |
2024-08-03 | E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images | Yunshan Qi et.al. | 2408.01840 | null |
2024-08-02 | NeRFoot: Robot-Footprint Estimation for Image-Based Visual Servoing | Daoxin Zhong et.al. | 2408.01251 | null |
2024-08-05 | UlRe-NeRF: 3D Ultrasound Imaging through Neural Rendering with Ultrasound Reflection Direction Parameterization | Ziwen Guo et.al. | 2408.00860 | null |
2024-07-31 | StyleRF-VolVis: Style Transfer of Neural Radiance Fields for Expressive Volume Visualization | Kaiyuan Tang et.al. | 2408.00150 | null |
2024-07-22 | PAV: Personalized Head Avatar from Unstructured Video Collection | Akin Caliskan et.al. | 2407.21047 | null |
2024-07-30 | Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering | Yanpeng Zhao et.al. | 2407.20908 | link |
2024-07-29 | Radiance Fields for Robotic Teleoperation | Maximum Wilder-Smith et.al. | 2407.20194 | link |
2024-07-29 | Garment Animation NeRF with Color Editing | Renke Wang et.al. | 2407.19774 | link |
2024-07-27 | Revisit Self-supervised Depth Estimation with Local Structure-from-Motion | Shengjie Zhu et.al. | 2407.19166 | null |
2024-07-26 | IOVS4NeRF:Incremental Optimal View Selection for Large-Scale NeRFs | Jingpeng Xie et.al. | 2407.18611 | null |
2024-07-24 | SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency | Yiming Xie et.al. | 2407.17470 | null |
2024-07-23 | HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images | Shreyas Singh et.al. | 2407.16503 | link |
2024-07-23 | DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors | Zizheng Yan et.al. | 2407.16260 | null |
2024-07-22 | BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes | Chih-Hai Su et.al. | 2407.15848 | null |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-19 | HOTS3D: Hyper-Spherical Optimal Transport for Semantic Alignment of Text-to-3D Generation | Zezeng Li et.al. | 2407.14419 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-19 | Semantic Communications for 3D Human Face Transmission with Neural Radiance Fields | Guanlin Wu et.al. | 2407.13992 | null |
2024-07-18 | EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting | Yuchen Weng et.al. | 2407.13520 | null |
2024-07-18 | GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields | Xiufeng Huang et.al. | 2407.13390 | null |
2024-07-18 | KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter | Yifan Zhan et.al. | 2407.13185 | null |
2024-07-17 | Generalizable Human Gaussians for Sparse View Synthesis | Youngjoong Kwon et.al. | 2407.12777 | link |
2024-07-17 | SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization | Yiyang Chen et.al. | 2407.12667 | link |
2024-07-17 | InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction | Xulong Wang et.al. | 2407.12661 | link |
2024-07-17 | Invertible Neural Warp for NeRF | Shin-Fang Chng et.al. | 2407.12354 | null |
2024-07-17 | Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections | Congrong Xu et.al. | 2407.12306 | null |
2024-07-18 | Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling | Jaehyeok Kim et.al. | 2407.11962 | null |
2024-07-18 | IPA-NeRF: Illusory Poisoning Attack Against Neural Radiance Fields | Wenxiang Jiang et.al. | 2407.11921 | link |
2024-07-16 | DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation | Jiwook Kim et.al. | 2407.11394 | link |
2024-07-15 | Evaluating geometric accuracy of NeRF reconstructions compared to SLAM method | Adam Korycki et.al. | 2407.11238 | null |
2024-07-15 | AirNeRF: 3D Reconstruction of Human with Drone and NeRF for Future Communication Systems | Alexey Kotcov et.al. | 2407.10865 | null |
2024-07-15 | Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis | Antoine Legrand et.al. | 2407.10762 | null |
2024-07-15 | IE-NeRF: Inpainting Enhanced Neural Radiance Fields in the Wild | Shuaixian Wang et.al. | 2407.10695 | null |
2024-07-15 | NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis | Yubin Hu et.al. | 2407.10482 | null |
2024-07-15 | Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering | Francesco Di Sario et.al. | 2407.10389 | null |
2024-07-14 | RS-NeRF: Neural Radiance Fields from Rolling Shutter Images | Muyao Niu et.al. | 2407.10267 | link |
2024-07-14 | SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion | Jiyuan Zhang et.al. | 2407.10062 | null |
2024-07-12 | Physics-Informed Learning of Characteristic Trajectories for Smoke Reconstruction | Yiming Wang et.al. | 2407.09679 | link |
2024-07-12 | Radiance Fields from Photons | Sacha Jungerman et.al. | 2407.09386 | null |
2024-07-12 | HPC: Hierarchical Progressive Coding Framework for Volumetric Video | Zihan Zheng et.al. | 2407.09026 | null |
2024-07-11 | Feasibility of Neural Radiance Fields for Crime Scene Video Reconstruction | Shariq Nadeem Malik et.al. | 2407.08795 | null |
2024-07-11 | WildGaussians: 3D Gaussian Splatting in the Wild | Jonas Kulhanek et.al. | 2407.08447 | link |
2024-07-11 | MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos | Yushuo Chen et.al. | 2407.08414 | link |
2024-07-11 | Explicit_NeRF_QA: A Quality Assessment Database for Explicit NeRF Model Compression | Yuke Xing et.al. | 2407.08165 | null |
2024-07-11 | Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fields | Haojie Lian et.al. | 2407.08154 | null |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2024-07-10 | Protecting NeRFs’ Copyright via Plug-And-Play Watermarking Base Model | Qi Song et.al. | 2407.07735 | null |
2024-07-10 | Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field | Ganlin Yang et.al. | 2407.07461 | null |
2024-07-09 | Reference-based Controllable Scene Stylization with Gaussian Splatting | Yiqun Mei et.al. | 2407.07220 | null |
2024-07-09 | Sparse-DeRF: Deblurred Neural Radiance Fields from Sparse View | Dogyoon Lee et.al. | 2407.06613 | null |
2024-07-08 | RRM: Relightable assets using Radiance guided Material extraction | Diego Gomez et.al. | 2407.06397 | null |
2024-07-08 | PanDORA: Casual HDR Radiance Acquisition for Indoor Scenes | Mohammad Reza Karimi Dastjerdi et.al. | 2407.06150 | null |
2024-07-08 | Enhancing Neural Radiance Fields with Depth and Normal Completion Priors from Sparse Views | Jiawei Guo et.al. | 2407.05666 | null |
2024-07-08 | GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields | Weiyi Xue et.al. | 2407.05597 | null |
2024-07-08 | Dynamic Neural Radiance Field From Defocused Monocular Video | Xianrui Luo et.al. | 2407.05586 | null |
2024-07-07 | GaussReg: Fast 3D Registration with Gaussian Splatting | Jiahao Chang et.al. | 2407.05254 | null |
2024-07-06 | SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction | Weixing Xie et.al. | 2407.05023 | null |
2024-07-04 | CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images | Junghe Lee et.al. | 2407.03923 | null |
2024-07-02 | MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering | Ahmad AlMughrabi et.al. | 2407.02668 | null |
2024-07-03 | BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream | Wenpu Li et.al. | 2407.02174 | link |
2024-07-01 | Active Human Pose Estimation via an Autonomous UAV Agent | Jingxi Chen et.al. | 2407.01811 | null |
2024-07-01 | DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction | Yujin Ham et.al. | 2407.01761 | null |
2024-07-01 | Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation | Zihan Gao et.al. | 2407.01220 | null |
2024-06-29 | Intrinsic PAPR for Point-level 3D Scene Albedo and Shading Editing | Alireza Moazeni et.al. | 2407.00500 | null |
2024-06-28 | ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction | Ding-Jiun Huang et.al. | 2406.20066 | null |
2024-06-28 | EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting | Daiwei Zhang et.al. | 2406.19811 | null |
2024-06-27 | Shorter SPECT Scans Using Self-supervised Coordinate Learning to Synthesize Skipped Projection Views | Zongyu Li et.al. | 2406.18840 | null |
2024-06-25 | Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Qi Ma et.al. | 2406.17438 | link |
2024-06-25 | NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods | Jonas Kulhanek et.al. | 2406.17345 | null |
2024-06-24 | From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking | Xiaohao Xu et.al. | 2406.16850 | link |
2024-06-24 | Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis | Jianning Deng et.al. | 2406.16623 | null |
2024-06-24 | Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction | Tong Qin et.al. | 2406.16289 | null |
2024-06-23 | Towards Real-Time Neural Volumetric Rendering on Mobile Devices: A Measurement Study | Zhe Wang et.al. | 2406.16068 | null |
2024-06-23 | Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction | Yangdi Lu et.al. | 2406.15982 | null |
2024-06-22 | psPRF:Pansharpening Planar Neural Radiance Field for Generalized 3D Reconstruction Satellite Imagery | Tongtong Zhang et.al. | 2406.15707 | null |
2024-06-21 | A3D: Does Diffusion Dream about 3D Alignment? | Savva Ignatyev et.al. | 2406.15020 | null |
2024-06-21 | E2GS: Event Enhanced Gaussian Splatting | Hiroyuki Deguchi et.al. | 2406.14978 | link |
2024-06-21 | Relighting Scenes with Object Insertions in Neural Radiance Fields | Xuening Zhu et.al. | 2406.14806 | null |
2024-06-20 | Deblurring Neural Radiance Fields with Event-driven Bundle Adjustment | Yunshan Qi et.al. | 2406.14360 | null |
2024-06-19 | NeRF-Feat: 6D Object Pose Estimation using Feature Rendering | Shishir Reddy Vutukur et.al. | 2406.13796 | null |
2024-06-19 | Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images | Haruo Fujiwara et.al. | 2406.13393 | null |
2024-06-19 | Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields | Youngin Park et.al. | 2406.13251 | link |
2024-06-18 | Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models | Paul Henderson et.al. | 2406.13099 | null |
2024-06-18 | Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings | Ruijie Tang et.al. | 2406.13048 | null |
2024-06-18 | Fast Global Localization on Neural Radiance Field | Mangyu Kong et.al. | 2406.12202 | null |
2024-06-20 | TutteNet: Injective 3D Deformations by Composition of 2D Mesh Deformations | Bo Sun et.al. | 2406.12121 | null |
2024-06-17 | DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Letian Wang et.al. | 2406.12095 | null |
2024-06-17 | Uncertainty modeling for fine-tuned implicit functions | Anna Susmelj et.al. | 2406.12082 | null |
2024-06-17 | LLaNA: Large Language and NeRF Assistant | Andrea Amaduzzi et.al. | 2406.11840 | null |
2024-06-17 | Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization | Huaiji Zhou et.al. | 2406.11766 | null |
2024-06-17 | InterNeRF: Scaling Radiance Fields via Parameter Interpolation | Clinton Wang et.al. | 2406.11737 | null |
2024-06-17 | NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation | Niu Guanchen et.al. | 2406.11259 | null |
2024-06-15 | NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows | Zhenggang Tang et.al. | 2406.10543 | link |
2024-06-15 | Federated Neural Radiance Field for Distributed Intelligence | Yintian Zhang et.al. | 2406.10474 | null |
2024-06-14 | Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections | Jiacong Xu et.al. | 2406.10373 | null |
2024-06-14 | PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Alex Hanson et.al. | 2406.10219 | link |
2024-06-14 | GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors | Xiqian Yu et.al. | 2406.10111 | null |
2024-06-14 | OrientDream: Streamlining Text-to-3D Generation with Explicit Orientation Control | Yuzhong Huang et.al. | 2406.10000 | null |
2024-06-14 | dGrasp: NeRF-Informed Implicit Grasp Policies with Supervised Optimization Slopes | Gergely Sóti et.al. | 2406.09939 | null |
2024-06-14 | RaNeuS: Ray-adaptive Neural Surface Reconstruction | Yida Wang et.al. | 2406.09801 | link |
2024-06-13 | Rethinking Score Distillation as a Bridge Between Image Distributions | David McAllister et.al. | 2406.09417 | null |
2024-06-13 | Preserving Identity with Variational Score for General-purpose 3D Editing | Duong H. Le et.al. | 2406.08953 | null |
2024-06-13 | Neural NeRF Compression | Tuan Pham et.al. | 2406.08943 | null |
2024-06-14 | AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis | Swapnil Bhosale et.al. | 2406.08920 | null |
2024-06-13 | NeRF Director: Revisiting View Selection in Neural Volume Rendering | Wenhui Xiao et.al. | 2406.08839 | null |
2024-06-12 | ICE-G: Image Conditional Editing of 3D Gaussian Splats | Vishnu Jaganathan et.al. | 2406.08488 | null |
2024-06-12 | OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Yinan Deng et.al. | 2406.08009 | link |
2024-06-12 | Spatial Annealing Smoothing for Efficient Few-shot Neural Rendering | Yuru Xiao et.al. | 2406.07828 | link |
2024-06-11 | C3DAG: Controlled 3D Animal Generation using 3D pose guidance | Sandeep Mishra et.al. | 2406.07742 | null |
2024-06-11 | M-LRM: Multi-view Large Reconstruction Model | Mengfei Li et.al. | 2406.07648 | null |
2024-06-11 | Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments | Christopher D. Hsu et.al. | 2406.07431 | null |
2024-06-11 | Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion | Xin Yuan et.al. | 2406.06972 | null |
2024-06-11 | Neural Visibility Field for Uncertainty-Driven Active Mapping | Shangjie Xue et.al. | 2406.06948 | null |
2024-06-10 | IllumiNeRF: 3D Relighting without Inverse Rendering | Xiaoming Zhao et.al. | 2406.06527 | null |
2024-06-10 | GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation | Haozhe Xie et.al. | 2406.06526 | link |
2024-06-10 | PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction | Danpeng Chen et.al. | 2406.06521 | null |
2024-06-10 | Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis | Xin Jin et.al. | 2406.06216 | link |
2024-06-10 | ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models | Meng-Li Shih et.al. | 2406.06133 | null |
2024-06-09 | GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement | Peiye Zhuang et.al. | 2406.05649 | null |
2024-06-07 | Multiplane Prior Guided Few-Shot Aerial Scene Rendering | Zihan Gao et.al. | 2406.04961 | null |
2024-06-07 | Multi-style Neural Radiance Field with AdaIN | Yu-Wen Pao et.al. | 2406.04960 | link |
2024-06-06 | Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization | Takuhiro Kaneko et.al. | 2406.04155 | null |
2024-06-06 | How Far Can We Compress Instant-NGP-Based NeRF? | Yihang Chen et.al. | 2406.04101 | link |
2024-06-06 | Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling | Xinhang Liu et.al. | 2406.03723 | null |
2024-06-06 | Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Diwen Wan et.al. | 2406.03697 | link |
2024-06-04 | 3D-HGS: 3D Half-Gaussian Splatting | Haolin Li et.al. | 2406.02720 | link |
2024-06-06 | Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting | Inkyu Shin et.al. | 2406.02541 | null |
2024-06-04 | Query-based Semantic Gaussian Field for Scene Representation in Reinforcement Learning | Jiaxu Wang et.al. | 2406.02370 | null |
2024-06-03 | Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting | Shaojie Ma et.al. | 2406.01593 | null |
2024-06-03 | Tetrahedron Splatting for 3D Generation | Chun Gu et.al. | 2406.01579 | link |
2024-06-03 | Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting | Fang Li et.al. | 2406.01042 | link |
2024-06-02 | PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency | Yeonsung Jung et.al. | 2406.00798 | null |
2024-06-02 | Representing Animatable Avatar via Factorized Neural Fields | Chunjin Song et.al. | 2406.00637 | null |
2024-06-04 | SuperGaussian: Repurposing Video Models for 3D Super Resolution | Yuan Shen et.al. | 2406.00609 | null |
2024-06-02 | Efficient Neural Light Fields (ENeLF) for Mobile Devices | Austin Peng et.al. | 2406.00598 | null |
2024-06-01 | Bilateral Guided Radiance Field Processing | Yuehao Wang et.al. | 2406.00448 | null |
2024-05-31 | R $^2$ -Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction | Ruyi Zha et.al. | 2405.20693 | link |
2024-05-31 | 4Diffusion: Multi-view Video Diffusion Model for 4D Generation | Haiyu Zhang et.al. | 2405.20674 | null |
2024-05-30 | $\textit{S}^3$ Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323 | link |
2024-05-30 | TetSphere Splatting: Representing High-Quality Geometry with Lagrangian Volumetric Meshes | Minghao Guo et.al. | 2405.20283 | null |
2024-05-31 | NeRF View Synthesis: Subjective Quality Assessment and Objective Metrics Evaluation | Pedro Martin et.al. | 2405.20078 | null |
2024-05-30 | IReNe: Instant Recoloring in Neural Radiance Fields | Alessio Mazzucchelli et.al. | 2405.19876 | null |
2024-05-30 | HINT: Learning Complete Human Neural Representations from Limited Viewpoints | Alessandro Sanvito et.al. | 2405.19712 | null |
2024-05-30 | View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields | Haodi He et.al. | 2405.19678 | link |
2024-05-29 | Neural Radiance Fields for Novel View Synthesis in Monocular Gastroscopy | Zijie Jiang et.al. | 2405.18863 | null |
2024-06-02 | NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild | Weining Ren et.al. | 2405.18715 | link |
2024-05-28 | Self-supervised Pre-training for Transferable Multi-modal Perception | Xiaohao Xu et.al. | 2405.17942 | link |
2024-05-28 | A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction | Bin Zhang et.al. | 2405.17891 | null |
2024-05-29 | HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction | Haoyu Zhao et.al. | 2405.17872 | link |
2024-05-28 | Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh | Xiangjun Gao et.al. | 2405.17811 | null |
2024-05-28 | F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting | Xiangyu Sun et.al. | 2405.17083 | null |
2024-05-29 | PyGS: Large-scale Scene Representation with Pyramidal 3D Gaussian Splatting | Zipeng Wang et.al. | 2405.16829 | null |
2024-05-26 | Sp2360: Sparse-view 360 Scene Reconstruction using Cascaded 2D Diffusion Priors | Soumava Paul et.al. | 2405.16517 | null |
2024-05-24 | Neural Elevation Models for Terrain Mapping and Path Planning | Adam Dai et.al. | 2405.15227 | link |
2024-05-27 | HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting | Yuanhao Cai et.al. | 2405.15125 | link |
2024-05-24 | GS-Hider: Hiding Messages into 3D Gaussian Splatting | Xuanyu Zhang et.al. | 2405.15118 | null |
2024-05-23 | NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections | Dor Verbin et.al. | 2405.14871 | null |
2024-05-23 | Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling | Liwen Wu et.al. | 2405.14847 | null |
2024-05-23 | Camera Relocalization in Shadow-free Neural Radiance Fields | Shiyao Xu et.al. | 2405.14824 | link |
2024-05-23 | LDM: Large Tensorial SDF Model for Textured Mesh Generation | Rengan Xie et.al. | 2405.14580 | link |
2024-05-23 | JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression | Zihan Zheng et.al. | 2405.14452 | null |
2024-05-22 | DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus | Yu Chen et.al. | 2405.13943 | link |
2024-05-22 | Gaussian Time Machine: A Real-Time Rendering Methodology for Time-Variant Appearances | Licheng Shen et.al. | 2405.13694 | null |
2024-05-21 | MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video | Hongsheng Wang et.al. | 2405.12806 | null |
2024-05-21 | Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations | Antoine Legrand et.al. | 2405.12728 | null |
2024-05-20 | Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo | Tianqi Liu et.al. | 2405.12218 | link |
2024-05-20 | Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents | Guanlin Wu et.al. | 2405.12155 | null |
2024-05-20 | NPLMV-PS: Neural Point-Light Multi-View Photometric Stereo | Fotios Logothetis et.al. | 2405.12057 | null |
2024-05-19 | Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems | Shengxiang Sun et.al. | 2405.11629 | null |
2024-05-19 | R-NeRF: Neural Radiance Fields for Modeling RIS-enabled Wireless Environments | Huiying Yang et.al. | 2405.11541 | link |
2024-05-18 | MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | Xinli Guo et.al. | 2405.11129 | link |
2024-05-16 | When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models | Xianzheng Ma et.al. | 2405.10255 | link |
2024-05-15 | From NeRFs to Gaussian Splats, and Back | Siming He et.al. | 2405.09717 | link |
2024-05-14 | Dynamic NeRF: A Review | Jinwei Lin et.al. | 2405.08609 | null |
2024-05-13 | Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs | Mingyu Kim et.al. | 2405.07857 | link |
2024-05-12 | Point Resampling and Ray Transformation Aid to Editable NeRF Models | Zhenyang Li et.al. | 2405.07306 | null |
2024-05-12 | Hologram: Realtime Holographic Overlays via LiDAR Augmented Reconstruction | Ekansh Agrawal et.al. | 2405.07178 | null |
2024-05-11 | TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization | Zhen Tan et.al. | 2405.07027 | link |
2024-05-10 | LIVE: LaTex Interactive Visual Editing | Jinwei Lin et.al. | 2405.06762 | null |
2024-05-14 | SketchDream: Sketch-based Text-to-3D Generation and Editing | Feng-Lin Liu et.al. | 2405.06461 | null |
2024-05-10 | Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering | Xiaohan Zhang et.al. | 2405.06214 | null |
2024-05-10 | Residual-NeRF: Learning Residual NeRFs for Transparent Object Manipulation | Bardienus P. Duisterhof et.al. | 2405.06181 | null |
2024-05-09 | DragGaussian: Enabling Drag-style Manipulation on 3D Gaussian Representation | Sitian Shen et.al. | 2405.05800 | null |
2024-05-10 | NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior | Gihoon Kim et.al. | 2405.05749 | null |
2024-05-09 | RPBG: Towards Robust Neural Point-based Graphics in the Wild | Qingtian Zhu et.al. | 2405.05663 | link |
2024-05-09 | Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview | Yuhang Ming et.al. | 2405.05526 | null |
2024-05-08 | ${M^2D}$ NeRF: Multi-Modal Decomposition NeRF with 3D Feature Fields | Ning Wang et.al. | 2405.05010 | null |
2024-05-08 | DistGrid: Scalable Scene Reconstruction with Distributed Multi-resolution Hash Grid | Sidun Liu et.al. | 2405.04416 | null |
2024-05-07 | Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications | Markus Hillemann et.al. | 2405.04345 | null |
2024-05-05 | Blending Distributed NeRFs with Tri-stage Robust Pose Optimization | Baijun Ye et.al. | 2405.02880 | null |
2024-05-05 | MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior | Honghua Chen et.al. | 2405.02859 | null |
2024-05-04 | TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes | Christopher Maxey et.al. | 2405.02762 | null |
2024-05-04 | ActiveNeuS: Active 3D Reconstruction using Neural Implicit Surface Uncertainty | Hyunseo Kim et.al. | 2405.02568 | null |
2024-05-03 | Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning | Dhruva Tirumala et.al. | 2405.02425 | null |
2024-05-03 | Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids | Junchen Liu et.al. | 2405.02386 | link |
2024-05-03 | WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights | Youngdong Jang et.al. | 2405.02066 | null |
2024-05-02 | NeRF in Robotics: A Survey | Guangming Wang et.al. | 2405.01333 | null |
2024-05-04 | LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun et.al. | 2405.00900 | null |
2024-05-01 | Depth Priors in Removal Neural Radiance Fields | Zhihao Guo et.al. | 2405.00630 | null |
2024-05-01 | NeRF-Guided Unsupervised Learning of RGB-D Registration | Zhinan Yu et.al. | 2405.00507 | null |
2024-05-01 | RTG-SLAM: Real-time 3D Reconstruction at Scale using Gaussian Splatting | Zhexi Peng et.al. | 2404.19706 | null |
2024-04-30 | NeRF-Insert: 3D Local Editing with Multimodal Control Signals | Benet Oriol Sabat et.al. | 2404.19204 | null |
2024-04-29 | SAGS: Structure-Aware 3D Gaussian Splatting | Evangelos Ververas et.al. | 2404.19149 | null |
2024-04-29 | GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting | Bo Chen et.al. | 2404.19040 | null |
2024-04-29 | Embedded Representation Learning Network for Animating Styled Video Portrait | Tianyong Wang et.al. | 2404.19038 | null |
2024-04-29 | Simple-RF: Regularizing Sparse Input Radiance Fields with Simpler Solutions | Nagabhushan Somraj et.al. | 2404.19015 | null |
2024-04-28 | S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM | Zhiyao Zhang et.al. | 2404.18284 | null |
2024-04-27 | DPER: Diffusion Prior Driven Neural Representation for Limited Angle and Sparse View CT Reconstruction | Chenhe Du et.al. | 2404.17890 | null |
2024-04-26 | Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields | Tianqi Liu et.al. | 2404.17528 | link |
2024-04-25 | Depth Supervised Neural Surface Reconstruction from Airborne Imagery | Vincent Hackstein et.al. | 2404.16429 | null |
2024-04-24 | NeRF-XL: Scaling NeRFs with Multiple GPUs | Ruilong Li et.al. | 2404.16221 | null |
2024-04-24 | ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images | Jinseo Jeong et.al. | 2404.15707 | null |
2024-04-23 | DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft | Sam Earle et.al. | 2404.15538 | null |
2024-04-28 | GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Hongyun Yu et.al. | 2404.14037 | null |
2024-04-22 | NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation | Chi Huang et.al. | 2404.13921 | null |
2024-04-23 | CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory | Yunlong Ran et.al. | 2404.13896 | null |
2024-04-26 | Neural Radiance Field in Autonomous Driving: A Survey | Lei He et.al. | 2404.13816 | null |
2024-04-26 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711 | link |
2024-04-21 | Generalizable Novel-View Synthesis using a Stereo Camera | Haechan Lee et.al. | 2404.13541 | null |
2024-04-20 | High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces | Baoru Huang et.al. | 2404.13437 | null |
2024-04-20 | EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment | Guanghao Li et.al. | 2404.13346 | link |
2024-04-19 | FlyNeRF: NeRF-Based Aerial Mapping for High-Quality 3D Scene Reconstruction | Maria Dronova et.al. | 2404.12970 | null |
2024-04-22 | Does Gaussian Splatting need SFM Initialization? | Yalda Foroutan et.al. | 2404.12547 | null |
2024-04-18 | MeshLRM: Large Reconstruction Model for High-Quality Mesh | Xinyue Wei et.al. | 2404.12385 | null |
2024-04-18 | AG-NeRF: Attention-guided Neural Radiance Fields for Multi-height Large-scale Outdoor Scene Rendering | Jingfeng Guo et.al. | 2404.11897 | link |
2024-04-18 | Cicero: Addressing Algorithmic and Architectural Bottlenecks in Neural Rendering by Radiance Warping and Memory Optimizations | Yu Feng et.al. | 2404.11852 | null |
2024-04-17 | SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping | Vincent Cartillier et.al. | 2404.11419 | null |
2024-04-16 | Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2404.10625 | null |
2024-04-16 | Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences | Seungwook Kim et.al. | 2404.10603 | null |
2024-04-16 | 1st Place Solution for ICCV 2023 OmniObject3D Challenge: Sparse-View Reconstruction | Hang Du et.al. | 2404.10441 | null |
2024-04-16 | SRGS: Super-Resolution 3D Gaussian Splatting | Xiang Feng et.al. | 2404.10318 | link |
2024-04-16 | Plug-and-Play Acceleration of Occupancy Grid-based NeRF Rendering using VDB Grid and Hierarchical Ray Traversal | Yoshio Kato et.al. | 2404.10272 | link |
2024-04-15 | Taming Latent Diffusion Model for Neural Radiance Field Inpainting | Chieh Hubert Lin et.al. | 2404.09995 | null |
2024-04-15 | Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video | Hongchi Xia et.al. | 2404.09833 | null |
2024-04-15 | DeferredGS: Decoupled and Editable Gaussian Splatting with Deferred Shading | Tong Wu et.al. | 2404.09412 | null |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-04-15 | OccGaussian: 3D Gaussian Splatting for Occluded Human Rendering | Jingrui Ye et.al. | 2404.08449 | null |
2024-04-12 | GPN: Generative Point-based NeRF | Haipeng Wang et.al. | 2404.08312 | link |
2024-04-12 | MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance | Yuqun Wu et.al. | 2404.08252 | null |
2024-04-11 | Connecting NeRFs, Images, and Text | Francesco Ballerini et.al. | 2404.07993 | null |
2024-04-11 | Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation | Keonhee Han et.al. | 2404.07933 | link |
2024-04-12 | NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | William Ljungbergh et.al. | 2404.07762 | link |
2024-04-11 | G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images | Zixiong Huang et.al. | 2404.07474 | link |
2024-04-10 | SplatPose & Detect: Pose-Agnostic 3D Anomaly Detection | Mathis Kruse et.al. | 2404.06832 | link |
2024-04-10 | MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views | Runfa Li et.al. | 2404.06753 | null |
2024-04-10 | Bayesian NeRF: Quantifying Uncertainty with Volume Density in Neural Radiance Fields | Sibeak Lee et.al. | 2404.06727 | link |
2024-04-11 | SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera | Gaole Dai et.al. | 2404.06710 | null |
2024-04-09 | Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion | Fan Yang et.al. | 2404.06429 | null |
2024-04-09 | 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis | Zhicheng Lu et.al. | 2404.06270 | null |
2024-04-09 | GHNeRF: Learning Generalizable Human Features with Efficient Neural Radiance Fields | Arnab Dey et.al. | 2404.06246 | null |
2024-04-09 | HFNeRF: Learning Human Biomechanic Features with Neural Radiance Fields | Arnab Dey et.al. | 2404.06152 | null |
2024-04-08 | Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation | Y. Wang et.al. | 2404.05236 | null |
2024-04-08 | StylizedGS: Controllable Stylization for 3D Gaussian Splatting | Dingxi Zhang et.al. | 2404.05220 | null |
2024-04-08 | Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos | Fengrui Tian et.al. | 2404.05163 | link |
2024-04-07 | CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis | Gyeongjin Kang et.al. | 2404.04913 | null |
2024-04-07 | GauU-Scene V2: Expanse Lidar Image Dataset Shows Unreliable Geometric Reconstruction Using Gaussian Splatting and NeRF | Butian Xiong et.al. | 2404.04880 | null |
2024-04-07 | NeRF2Points: Large-Scale Point Cloud Generation From Street Views’ Radiance Field Optimization | Peng Tu et.al. | 2404.04875 | null |
2024-04-06 | DATENeRF: Depth-Aware Text-based Editing of NeRFs | Sara Rojas et.al. | 2404.04526 | null |
2024-04-05 | Robust Gaussian Splatting | François Darmon et.al. | 2404.04211 | null |
2024-04-04 | SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer | Zijie Wu et.al. | 2404.03736 | link |
2024-04-07 | RaFE: Generative Radiance Fields Restoration | Zhongkai Wu et.al. | 2404.03654 | null |
2024-04-04 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views | Francis Engelmann et.al. | 2404.03650 | null |
2024-04-04 | VF-NeRF: Viewshed Fields for Rigid NeRF Registration | Leo Segre et.al. | 2404.03349 | null |
2024-04-03 | GenN2N: Generative NeRF2NeRF Translation | Xiangyue Liu et.al. | 2404.02788 | null |
2024-04-03 | LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis | Zehan Zheng et.al. | 2404.02742 | link |
2024-04-03 | Neural Radiance Fields with Torch Units | Bingnan Ni et.al. | 2404.02617 | null |
2024-04-03 | Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition | Yisheng He et.al. | 2404.02514 | null |
2024-04-02 | NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li et.al. | 2404.02185 | null |
2024-04-02 | Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields | Joshua Ahn et.al. | 2404.02155 | null |
2024-04-02 | Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions | Saptarshi Dasgupta et.al. | 2404.01812 | null |
2024-04-01 | NVINS: Robust Visual Inertial Navigation Fused with NeRF-augmented Camera Pose Regressor and Uncertainty Quantification | Juyeop Han et.al. | 2404.01400 | null |
2024-04-01 | NeRF-MAE : Masked AutoEncoders for Self Supervised 3D representation Learning for Neural Radiance Fields | Muhammad Zubair Irshad et.al. | 2404.01300 | link |
2024-04-01 | MagicMirror: Fast and High-Quality Avatar Generation with a Constrained Search Space | Armand Comas-Massagué et.al. | 2404.01296 | null |
2024-04-02 | StructLDM: Structured Latent Diffusion for 3D Human Generation | Tao Hu et.al. | 2404.01241 | null |
2024-04-01 | Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting | Jiarui Meng et.al. | 2404.01168 | null |
2024-04-01 | SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance | Yuru Xiao et.al. | 2404.00992 | null |
2024-04-01 | FlexiDreamer: Single Image-to-3D Generation with FlexiCubes | Ruowen Zhao et.al. | 2404.00987 | link |
2024-04-01 | Marrying NeRF with Feature Matching for One-step Pose Estimation | Ronghan Chen et.al. | 2404.00891 | null |
2024-03-29 | HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes | Ke Wu et.al. | 2403.20159 | null |
2024-03-29 | Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior | Jaehoon Ko et.al. | 2403.20153 | link |
2024-03-29 | SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior | Zhongrui Yu et.al. | 2403.20079 | null |
2024-03-29 | NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising | Tianchen Deng et.al. | 2403.20034 | link |
2024-03-29 | SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image | Yunhao Li et.al. | 2403.20018 | link |
2024-03-29 | DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal | Yunhao Li et.al. | 2403.20013 | link |
2024-03-29 | Stable Surface Regularization for Fast Few-Shot NeRF | Byeongin Joung et.al. | 2403.19985 | null |
2024-03-29 | MI-NeRF: Learning a Single Face NeRF from Multiple Identities | Aggelina Chatziagapi et.al. | 2403.19920 | null |
2024-03-28 | Mitigating Motion Blur in Neural Radiance Fields with Events and Frames | Marco Cannici et.al. | 2403.19780 | link |
2024-03-28 | SAID-NeRF: Segmentation-AIDed NeRF for Depth Completion of Transparent Objects | Avinash Ummadisingu et.al. | 2403.19607 | null |
2024-03-28 | CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians | Avinash Paliwal et.al. | 2403.19495 | link |
2024-03-28 | Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation | Yujin Chen et.al. | 2403.19319 | null |
2024-03-28 | Sine Activated Low-Rank Matrices for Parameter Efficient Learning | Yiping Ji et.al. | 2403.19243 | null |
2024-03-29 | Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | Qiuhong Shen et.al. | 2403.18795 | link |
2024-03-27 | SAT-NGP : Unleashing Neural Graphics Primitives for Fast Relightable Transient-Free 3D reconstruction from Satellite Imagery | Camille Billouard et.al. | 2403.18711 | link |
2024-03-27 | Modeling uncertainty for Gaussian Splatting | Luca Savant et.al. | 2403.18476 | null |
2024-03-26 | Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians | Kerui Ren et.al. | 2403.17898 | link |
2024-03-26 | NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation | Jiahao Chen et.al. | 2403.17537 | null |
2024-03-25 | VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation | Yang Chen et.al. | 2403.17001 | null |
2024-03-25 | CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs | Yingji Zhong et.al. | 2403.16885 | null |
2024-03-25 | Spike-NeRF: Neural Radiance Field Based On Spike Camera | Yijia Guo et.al. | 2403.16410 | null |
2024-03-24 | Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields | Haoyuan Wang et.al. | 2403.16224 | null |
2024-03-24 | Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes | Takashi Otonari et.al. | 2403.16141 | null |
2024-03-24 | CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Jiarui Hu et.al. | 2403.16095 | null |
2024-03-24 | Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap | Carl Lindström et.al. | 2403.16092 | null |
2024-03-26 | PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling | Xiaoyun Zheng et.al. | 2403.16080 | link |
2024-03-24 | Semantic Is Enough: Only Semantic Information For NeRF Reconstruction | Ruibo Wang et.al. | 2403.16043 | null |
2024-03-24 | Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields | unhong Zhao et.al. | 2403.15981 | null |
2024-03-23 | DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation | Mu-Yi Shen et.al. | 2403.15791 | link |
2024-03-23 | UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation | Yuliang Guo et.al. | 2403.15705 | link |
2024-03-22 | WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization | Jialu Wang et.al. | 2403.15272 | null |
2024-03-21 | Hyperspectral Neural Radiance Fields | Gerry Chen et.al. | 2403.14839 | null |
2024-03-21 | ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition | Tianhao Wu et.al. | 2403.14619 | null |
2024-03-21 | CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis | Matteo Bonotto et.al. | 2403.14412 | link |
2024-03-21 | InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity | Jiabin Liang et.al. | 2403.14376 | null |
2024-03-21 | Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions | Jiacong Xu et.al. | 2403.14053 | link |
2024-03-20 | MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination | Weiying Wang et.al. | 2403.13348 | null |
2024-03-19 | Depth-guided NeRF Training via Earth Mover’s Distance | Anita Rau et.al. | 2403.13206 | null |
2024-03-19 | DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images | Zaid Tasneem et.al. | 2403.13199 | null |
2024-03-19 | Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering | Mingqi Shao et.al. | 2403.12839 | null |
2024-03-19 | Learning Neural Volumetric Pose Features for Camera Localization | Jingyu Lin et.al. | 2403.12800 | null |
2024-03-19 | IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model | Matteo Bortolon et.al. | 2403.12682 | null |
2024-03-18 | FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos | Florian Philipp Stilz et.al. | 2403.12198 | null |
2024-03-18 | ThermoNeRF: Multimodal Neural Radiance Fields for Thermal Novel View Synthesis | Mariam Hassan et.al. | 2403.12154 | link |
2024-03-18 | RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF | Sibi Catley-Chandar et.al. | 2403.11909 | null |
2024-03-18 | GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors | LI Yang et.al. | 2403.11899 | null |
2024-03-18 | Exploring Multi-modal Neural Scene Representations With Applications on Thermal Imaging | Mert Özer et.al. | 2403.11865 | null |
2024-03-19 | BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting | Lingzhe Zhao et.al. | 2403.11831 | link |
2024-03-18 | Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Yuqi Zhang et.al. | 2403.11812 | link |
2024-03-18 | DVN-SLAM: Dynamic Visual Neural SLAM Based on Local-Global Encoding | Wenhua Wu et.al. | 2403.11776 | null |
2024-03-18 | Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes | Antoine Schnepf et.al. | 2403.11678 | null |
2024-03-18 | UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling | Yujiao Jiang et.al. | 2403.11589 | null |
2024-03-18 | Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem | Mincheol Chang et.al. | 2403.11573 | null |
2024-03-17 | Creating Seamless 3D Maps Using Radiance Fields | Sai Tarun Sathyan et.al. | 2403.11364 | null |
2024-03-17 | SpikeNeRF: Learning Neural Radiance Fields from Continuous Spike Stream | Lin Zhu et.al. | 2403.11222 | link |
2024-03-17 | Recent Advances in 3D Gaussian Splatting | Tong Wu et.al. | 2403.11134 | null |
2024-03-17 | Omni-Recon: Towards General-Purpose Neural Radiance Fields for Versatile 3D Applications | Yonggan Fu et.al. | 2403.11131 | link |
2024-03-16 | Fast Sparse View Guided NeRF Update for Object Reconfigurations | Ziqi Lu et.al. | 2403.11024 | null |
2024-03-16 | HourglassNeRF: Casting an Hourglass as a Bundle of Rays for Few-shot Neural Rendering | Seunghyeon Seo et.al. | 2403.10906 | null |
2024-03-15 | FeatUp: A Model-Agnostic Framework for Features at Any Resolution | Stephanie Fu et.al. | 2403.10516 | link |
2024-03-15 | Thermal-NeRF: Neural Radiance Fields from an Infrared Camera | Tianxiang Ye et.al. | 2403.10340 | link |
2024-03-15 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297 | link |
2024-03-15 | GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time | Hao Li et.al. | 2403.10147 | null |
2024-03-15 | URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields | Bo Xu et.al. | 2403.10119 | null |
2024-03-15 | DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video | Huiqiang Sun et.al. | 2403.10103 | null |
2024-03-15 | Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience | Xiaohang Yu et.al. | 2403.09973 | null |
2024-03-14 | GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping | Yuhang Zheng et.al. | 2403.09637 | link |
2024-03-14 | The NeRFect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou et.al. | 2403.09577 | null |
2024-03-14 | VIRUS-NeRF – Vision, InfraRed and UltraSonic based Neural Radiance Fields | Nicolaj Schmid et.al. | 2403.09477 | link |
2024-03-14 | 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation | Frank Zhang et.al. | 2403.09439 | null |
2024-03-14 | RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes | Thang-Anh-Quan Nguyen et.al. | 2403.09419 | null |
2024-03-14 | PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Tianyuan Yuan et.al. | 2403.09079 | link |
2024-03-13 | Gaussian Splatting in Style | Abhishek Saroha et.al. | 2403.08498 | null |
2024-03-13 | StyleDyRF: Zero-shot 4D Style Transfer for Dynamic Neural Radiance Fields | Hongbin Xu et.al. | 2403.08310 | link |
2024-03-13 | NeRF-Supervised Feature Point Detection and Description | Ali Youssef et.al. | 2403.08156 | link |
2024-03-12 | Q-SLAM: Quadric Representations for Monocular SLAM | Chensheng Peng et.al. | 2403.08125 | null |
2024-03-12 | SMURF: Continuous Dynamics for Motion-Deblurring Radiance Fields | Jungho Lee et.al. | 2403.07547 | link |
2024-03-11 | SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection | Yifu Tao et.al. | 2403.06877 | null |
2024-03-11 | Vosh: Voxel-Mesh Hybrid Representation for Real-Time View Synthesis | Chenhao Zhang et.al. | 2403.06505 | null |
2024-03-13 | FSViewFusion: Few-Shots View Generation of Novel Objects | Rukhshanda Hussain et.al. | 2403.06394 | null |
2024-03-10 | Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis? | Hanxin Zhu et.al. | 2403.06092 | null |
2024-03-09 | Lightning NeRF: Efficient Hybrid Scene Representation for Autonomous Driving | Junyi Cao et.al. | 2403.05907 | link |
2024-03-09 | Large Generative Model Assisted 3D Semantic Communication | Feibo Jiang et.al. | 2403.05783 | null |
2024-03-08 | GSEdit: Efficient Text-Guided Editing of 3D Objects via Gaussian Splatting | Francesco Palandra et.al. | 2403.05154 | null |
2024-03-08 | Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces | Evangelos Skartados et.al. | 2403.04508 | null |
2024-03-07 | Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis | Yuanhao Cai et.al. | 2403.04116 | link |
2024-03-08 | DNAct: Diffusion Guided Multi-Task 3D Policy Learning | Ge Yan et.al. | 2403.04115 | null |
2024-03-07 | Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs | Nikhil Mishra et.al. | 2403.04114 | link |
2024-03-06 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding | Zi-Ting Chou et.al. | 2403.03608 | null |
2024-03-05 | A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction | Haofan Lu et.al. | 2403.03241 | null |
2024-03-05 | Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps | Timothy Chen et.al. | 2403.02751 | null |
2024-03-04 | DaReNeRF: Direction-aware Representation for Dynamic Scenes | Ange Lou et.al. | 2403.02265 | null |
2024-03-04 | Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views | Shuai Guo et.al. | 2403.02063 | null |
2024-03-02 | NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning | Linsheng Chen et.al. | 2403.01325 | link |
2024-03-02 | Neural radiance fields-based holography [Invited] | Minsung Kang et.al. | 2403.01137 | null |
2024-03-02 | Neural Field Classifiers via Target Encoding and Classification Loss | Xindi Yang et.al. | 2403.01058 | null |
2024-03-01 | DISORF: A Distributed Online NeRF Training and Rendering Framework for Mobile Robots | Chunlin Li et.al. | 2403.00228 | link |
2024-02-28 | NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images | Jingrui Yu et.al. | 2402.18196 | link |
2024-02-26 | Neural Radiance Fields in Medical Imaging: Challenges and Next Steps | Xin Wang et.al. | 2402.17797 | null |
2024-02-27 | Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning | Xiaoyu Zhang et.al. | 2402.17768 | null |
2024-02-27 | VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction | Jiaqi Lin et.al. | 2402.17427 | null |
2024-02-27 | Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis | Zicheng Zhang et.al. | 2402.17364 | link |
2024-02-27 | DivAvatar: Diverse 3D Avatar Generation with a Single Prompt | Weijing Tao et.al. | 2402.17292 | null |
2024-02-27 | CharNeRF: 3D Character Generation from Concept Art | Eddy Chu et.al. | 2402.17115 | null |
2024-02-26 | Disentangled 3D Scene Generation with Layout Learning | Dave Epstein et.al. | 2402.16936 | null |
2024-02-26 | CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency | Hanxin Zhu et.al. | 2402.16407 | null |
2024-02-26 | SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field | Zetian Song et.al. | 2402.16366 | null |
2024-02-26 | DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer | Yizhe Wu et.al. | 2402.16308 | null |
2024-02-22 | Consolidating Attention Features for Multi-view Image Editing | Or Patashnik et.al. | 2402.14792 | null |
2024-02-26 | FrameNeRF: A Simple and Efficient Framework for Few-shot Novel View Synthesis | Yan Xing et.al. | 2402.14586 | null |
2024-02-22 | NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection | Chenxi Huang et.al. | 2402.14464 | link |
2024-02-22 | TaylorGrid: Towards Fast and High-Quality Implicit Field Learning via Direct Taylor-based Grid Optimization | Renyi Mao et.al. | 2402.14415 | null |
2024-02-22 | Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields | Seungtae Nam et.al. | 2402.14196 | null |
2024-02-21 | Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Joongho Jo et.al. | 2402.13827 | null |
2024-02-21 | SealD-NeRF: Interactive Pixel-Level Editing for Dynamic Scenes by Neural Radiance Fields | Zhentao Huang et.al. | 2402.13510 | null |
2024-02-20 | How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey | Fabio Tosi et.al. | 2402.13255 | link |
2024-02-20 | Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields | Bo-Yu Cheng et.al. | 2402.13252 | link |
2024-02-20 | NeRF Solves Undersampled MRI Reconstruction | Tae Jun Jang et.al. | 2402.13226 | null |
2024-02-20 | OccFlowNet: Towards Self-supervised Occupancy Estimation via Differentiable Rendering and Occupancy Flow | Simon Boeder et.al. | 2402.12792 | null |
2024-02-19 | Binary Opacity Grids: Capturing Fine Geometric Detail for Mesh-Based View Synthesis | Christian Reiser et.al. | 2402.12377 | null |
2024-02-19 | Colorizing Monochromatic Radiance Fields | Yean Cheng et.al. | 2402.12184 | null |
2024-02-17 | Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review | Thang-Anh-Quan Nguyen et.al. | 2402.11141 | link |
2024-02-15 | Evaluating NeRFs for 3D Plant Geometry Reconstruction in Field Conditions | Muhammad Arbab Arshad et.al. | 2402.10344 | null |
2024-02-14 | PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2402.09325 | link |
2024-02-13 | Preconditioners for the Stochastic Training of Implicit Neural Representations | Shin-Fang Chng et.al. | 2402.08784 | null |
2024-02-13 | NeRF Analogies: Example-Based Visual Attribute Transfer for NeRFs | Michael Fischer et.al. | 2402.08622 | null |
2024-02-13 | H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields | Minyoung Park et.al. | 2402.08138 | null |
2024-02-12 | DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation | Chenchang Li et.al. | 2402.07648 | null |
2024-02-11 | BioNeRF: Biologically Plausible Neural Radiance Fields for View Synthesis | Leandro A. Passos et.al. | 2402.07310 | link |
2024-02-11 | 3D Gaussian as a New Vision Era: A Survey | Ben Fei et.al. | 2402.07181 | null |
2024-02-09 | ImplicitDeepfake: Plausible Face-Swapping through Implicit Deepfake Generation using NeRF and Gaussian Splatting | Georgii Stanishevskii et.al. | 2402.06390 | link |
2024-02-07 | NeRF as Non-Distant Environment Emitter in Physics-based Inverse Rendering | Jingwang Ling et.al. | 2402.04829 | null |
2024-02-07 | OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding | Guibiao Liao et.al. | 2402.04648 | link |
2024-02-11 | BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery | Huiqing Zhang et.al. | 2402.04554 | null |
2024-02-06 | Improved Generalization of Weight Space Networks via Augmentations | Aviv Shamsian et.al. | 2402.04081 | link |
2024-02-05 | ViewFusion: Learning Composable Diffusion Models for Novel View Synthesis | Bernard Spiegl et.al. | 2402.02906 | link |
2024-02-02 | ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields | Xingyu Miao et.al. | 2402.01950 | link |
2024-02-02 | Robust Inverse Graphics via Probabilistic Inference | Tuan Anh Le et.al. | 2402.01915 | link |
2024-02-02 | HyperPlanes: Hypernetwork Approach to Rapid NeRF Adaptation | Paweł Batorski et.al. | 2402.01524 | link |
2024-02-02 | Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses | Mahboubeh Asadi et.al. | 2402.01485 | null |
2024-02-06 | GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting | Joanna Waczyńska et.al. | 2402.01459 | link |
2024-02-02 | Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization | Zhiyu Zhang et.al. | 2402.01380 | null |
2024-02-06 | Taming Uncertainty in Sparse-view Generalizable NeRF via Indirect Diffusion Guidance | Yaokun Li et.al. | 2402.01217 | null |
2024-02-01 | ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Jiahua Dong et.al. | 2402.00864 | link |
2024-02-01 | Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering | Pinxin Liu et.al. | 2402.00827 | link |
2024-01-31 | CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting | Jiezhi Yang et.al. | 2401.18075 | null |
2024-02-01 | Segment Anything in 3D Gaussians | Xu Hu et.al. | 2401.17857 | link |
2024-01-30 | Physical Priors Augmented Event-Based 3D Reconstruction | Jiaxu Wang et.al. | 2401.17121 | link |
2024-01-31 | Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting | Yiming Huang et.al. | 2401.16416 | link |
2024-01-29 | Divide and Conquer: Rethinking the Training Paradigm of Neural Radiance Fields | Rongkai Ma et.al. | 2401.16144 | null |
2024-01-26 | 3D Reconstruction and New View Synthesis of Indoor Environments based on a Dual Neural Radiance Field | Zhenyu Bao et.al. | 2401.14726 | link |
2024-01-25 | Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation | Jiaxu Wang et.al. | 2401.14354 | null |
2024-01-27 | Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation | Minglin Chen et.al. | 2401.14257 | null |
2024-01-24 | EndoGaussians: Single View Dynamic Gaussian Splatting for Deformable Endoscopic Tissues Reconstruction | Yangsen Chen et.al. | 2401.13352 | null |
2024-01-23 | NeRF-AD: Neural Radiance Field with Attention-based Disentanglement for Talking Face Synthesis | Chongke Bi et.al. | 2401.12568 | null |
2024-01-23 | Exploration and Improvement of Nerf-based 3D Scene Editing Techniques | Shun Fang et.al. | 2401.12456 | null |
2024-01-23 | Methods and strategies for improving the novel view synthesis quality of neural radiation field | Shun Fang et.al. | 2401.12451 | null |
2024-01-22 | Single-View 3D Human Digitalization with Large Reconstruction Models | Zhenzhen Weng et.al. | 2401.12175 | null |
2024-01-22 | Scaling Face Interaction Graph Networks to Real World Scenes | Tatiana Lopez-Guevara et.al. | 2401.11985 | null |
2024-01-22 | HG3-NeRF: Hierarchical Geometric, Semantic, and Photometric Guided Neural Radiance Fields for Sparse View Inputs | Zelin Gao et.al. | 2401.11711 | null |
2024-01-23 | IPR-NeRF: Ownership Verification meets Neural Radiance Field | Win Kent Ong et.al. | 2401.09495 | null |
2024-01-17 | ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization | Weiyao Wang et.al. | 2401.08937 | null |
2024-01-18 | ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Process | Kiyohiro Nakayama et.al. | 2401.08140 | null |
2024-01-16 | Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities | Xu Yan et.al. | 2401.08045 | link |
2024-01-15 | 6-DoF Grasp Pose Evaluation and Optimization via Transfer Learning from NeRFs | Gergely Sóti et.al. | 2401.07935 | null |
2024-01-11 | TriNeRFLet: A Wavelet Based Multiscale Triplane NeRF Representation | Rajaei Khatib et.al. | 2401.06191 | null |
2024-01-11 | Fast High Dynamic Range Radiance Fields for Dynamic Scenes | Guanjun Wu et.al. | 2401.06052 | null |
2024-01-11 | CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians | Bin Dou et.al. | 2401.05925 | null |
2024-01-11 | GO-NeRF: Generating Virtual Objects in Neural Radiance Fields | Peng Dai et.al. | 2401.05750 | null |
2024-01-10 | Diffusion Priors for Dynamic View Synthesis from Monocular Videos | Chaoyang Wang et.al. | 2401.05583 | null |
2024-01-10 | InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes | Mohamad Shahbazi et.al. | 2401.05335 | null |
2024-01-10 | CTNeRF: Cross-Time Transformer for Dynamic Neural Radiance Field from Monocular Video | Xingyu Miao et.al. | 2401.04861 | link |
2024-01-08 | A Survey on 3D Gaussian Splatting | Guikun Chen et.al. | 2401.03890 | null |
2024-01-08 | NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation | Casimir Feldmann et.al. | 2401.03771 | null |
2024-01-06 | RustNeRF: Robust Neural Radiance Field with Low-Quality Images | Mengfei Li et.al. | 2401.03257 | null |
2024-01-06 | Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping | Tongyan Hua et.al. | 2401.03203 | null |
2024-01-05 | Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human | Song Bai et.al. | 2401.02620 | null |
2024-01-05 | FED-NeRF: Achieve High 3D Consistency and Temporal Coherence for Face Video Editing on Dynamic NeRF | Hao Zhang et.al. | 2401.02616 | link |
2024-01-05 | Characterizing Satellite Geometry via Accelerated 3D Gaussian Splatting | Van Minh Nguyen et.al. | 2401.02588 | null |
2024-01-03 | SIGNeRF: Scene Integrated Generation for Neural Radiance Fields | Jan-Niklas Dihlmann et.al. | 2401.01647 | null |
2024-01-02 | Street Gaussians for Modeling Dynamic Urban Scenes | Yunzhi Yan et.al. | 2401.01339 | link |
2024-01-02 | Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise | Qinglong Huang et.al. | 2401.01216 | null |
2024-01-02 | 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands | Xuan Huang et.al. | 2401.00979 | link |
2024-01-01 | Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior | Byeonghyeon Lee et.al. | 2401.00825 | link |
2024-01-02 | GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields | Xiao Pan et.al. | 2401.00616 | null |
2023-12-30 | Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models | Han Jiang et.al. | 2401.00208 | null |
2023-12-29 | Informative Rays Selection for Few-Shot Neural Radiance Fields | Marco Orsingher et.al. | 2312.17561 | null |
2023-12-27 | City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web | Kaiwen Song et.al. | 2312.16457 | null |
2023-12-26 | DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision | Lu Ling et.al. | 2312.16256 | null |
2023-12-24 | SUNDIAL: 3D Satellite Understanding through Direct, Ambient, and Complex Lighting Decomposition | Nikhil Behari et.al. | 2312.16215 | null |
2023-12-23 | INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields | Andrew Hou et.al. | 2312.16197 | null |
2023-12-26 | LangSplat: 3D Language Gaussian Splatting | Minghan Qin et.al. | 2312.16084 | link |
2023-12-26 | 2D-Guided 3D Gaussian Segmentation | Kun Lan et.al. | 2312.16047 | null |
2023-12-26 | Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images | Zhan Lu et.al. | 2312.15942 | null |
2023-12-23 | Human101: Training 100+FPS Human Gaussians in 100s from 1 View | Mingwei Li et.al. | 2312.15258 | link |
2023-12-23 | Efficient Deformable Tissue Reconstruction via Orthogonal Neural Plane | Chen Yang et.al. | 2312.15253 | link |
2023-12-23 | CaLDiff: Camera Localization in NeRF via Pose Diffusion | Rashik Shrestha et.al. | 2312.15242 | null |
2023-12-22 | PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF | Mohsen Gholami et.al. | 2312.14915 | link |
2023-12-22 | Density Uncertainty Quantification with NeRF-Ensembles: Impact of Data and Scene Constraints | Miriam Jäger et.al. | 2312.14664 | null |
2023-12-21 | PlatoNeRF: 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar | Tzofi Klinghoffer et.al. | 2312.14239 | null |
2023-12-21 | Virtual Pets: Animatable Animal Generation in 3D Scenes | Yen-Chi Cheng et.al. | 2312.14154 | null |
2023-12-21 | Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning | Desai Xie et.al. | 2312.13980 | null |
2023-12-21 | SyncDreamer for 3D Reconstruction of Endangered Animal Species with NeRF and NeuS | Ahmet Haydar Ornek et.al. | 2312.13832 | null |
2023-12-22 | Gaussian Splatting with NeRF-based Color and Opacity | Dawid Malarz et.al. | 2312.13729 | link |
2023-12-21 | DyBluRF: Dynamic Deblurring Neural Radiance Fields for Blurry Monocular Video | Minh-Quan Viet Bui et.al. | 2312.13528 | null |
2023-12-21 | Visual Tomography: Physically Faithful Volumetric Models of Partially Translucent Objects | David Nakath et.al. | 2312.13494 | null |
2023-12-20 | NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields | Jens Naumann et.al. | 2312.13471 | null |
2023-12-20 | Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM | Junru Lin et.al. | 2312.13332 | null |
2023-12-20 | ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors | Weijia Mao et.al. | 2312.13324 | null |
2023-12-20 | UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections | Fangjinhua Wang et.al. | 2312.13285 | null |
2023-12-19 | ZS-SRT: An Efficient Zero-Shot Super-Resolution Training Method for Neural Radiance Fields | Xiang Feng et.al. | 2312.12122 | null |
2023-12-19 | LHManip: A Dataset for Long-Horizon Language-Grounded Manipulation Tasks in Cluttered Tabletop Environments | Federico Ceola et.al. | 2312.12036 | link |
2023-12-19 | MixRT: Mixed Neural Representations For Real-Time NeRF Rendering | Chaojian Li et.al. | 2312.11841 | null |
2023-12-19 | Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation | Yuze He et.al. | 2312.11774 | null |
2023-12-15 | FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline | Chien-Yu Lin et.al. | 2312.11537 | null |
2023-12-15 | Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior | Nan Huang et.al. | 2312.11535 | null |
2023-12-18 | GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning | Ye Yuan et.al. | 2312.11461 | null |
2023-12-18 | AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis | Dongze Li et.al. | 2312.10921 | null |
2023-12-17 | PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields | Boming Zhao et.al. | 2312.10649 | null |
2023-12-19 | Learning Dense Correspondence for NeRF-Based Face Reenactment | Songlin Yang et.al. | 2312.10422 | null |
2023-12-15 | SlimmeRF: Slimmable Radiance Fields | Shiran Yuan et.al. | 2312.10034 | link |
2023-12-15 | LAENeRF: Local Appearance Editing for Neural Radiance Fields | Lukas Radl et.al. | 2312.09913 | null |
2023-12-15 | SLS4D: Sparse Latent Space for 4D Novel View Synthesis | Qi-Yuan Feng et.al. | 2312.09743 | null |
2023-12-15 | Towards Transferable Targeted 3D Adversarial Attack in the Physical World | Yao Huang et.al. | 2312.09558 | link |
2023-12-14 | LatentEditor: Text Driven Local Editing of 3D Scenes | Umar Khalid et.al. | 2312.09313 | link |
2023-12-14 | Stable Score Distillation for High-Quality 3D Generation | Boshi Tang et.al. | 2312.09305 | null |
2023-12-14 | ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining | Ruoxi Shi et.al. | 2312.09249 | null |
2023-12-15 | 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting | Zhiyin Qian et.al. | 2312.09228 | null |
2023-12-15 | ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field | Zhangkai Ni et.al. | 2312.09095 | link |
2023-12-15 | Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption | Ziteng Cui et.al. | 2312.09093 | link |
2023-12-14 | iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching | Yuan Sun et.al. | 2312.09031 | null |
2023-12-14 | Scene 3-D Reconstruction System in Scattering Medium | Zhuoyifan Zhang et.al. | 2312.09005 | null |
2023-12-14 | CF-NeRF: Camera Parameter Free Neural Radiance Fields with Incremental Learning | Qingsong Yan et.al. | 2312.08760 | null |
2023-12-14 | SpectralNeRF: Physically Based Spectral Rendering with Neural Radiance Field | Ru Li et.al. | 2312.08692 | link |
2023-12-13 | ProNeRF: Learning Efficient Projection-Aware Ray Sampling for Fine-Grained Implicit Neural Radiance Fields | Juan Luis Gonzalez Bello et.al. | 2312.08136 | null |
2023-12-13 | Neural Radiance Fields for Transparent Object Using Visual Hull | Heechan Yoon et.al. | 2312.08118 | null |
2023-12-13 | uSF: Learning Neural Semantic Field with Uncertainty | Vsevolod Skorokhodov et.al. | 2312.08012 | link |
2023-12-12 | COLMAP-Free 3D Gaussian Splatting | Yang Fu et.al. | 2312.07504 | null |
2023-12-12 | Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs | Sunghwan Hong et.al. | 2312.07246 | link |
2023-12-12 | WaterHE-NeRF: Water-ray Tracing Neural Radiance Fields for Underwater Scene Reconstruction | Jingchun Zhou et.al. | 2312.06946 | null |
2023-12-10 | TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video | Minye Wu et.al. | 2312.06713 | null |
2023-12-11 | CorresNeRF: Image Correspondence Priors for Neural Radiance Fields | Yixing Lao et.al. | 2312.06642 | link |
2023-12-11 | DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior | Tianyu Huang et.al. | 2312.06439 | link |
2023-12-10 | NeVRF: Neural Video-based Radiance Fields for Long-duration Sequences | Minye Wu et.al. | 2312.05855 | null |
2023-12-10 | IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment | Letian Zhang et.al. | 2312.05748 | null |
2023-12-09 | CoGS: Controllable Gaussian Splatting | Heng Yu et.al. | 2312.05664 | null |
2023-12-09 | R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning | Zhiling Ye et.al. | 2312.05572 | null |
2023-12-08 | Multi-view Inversion for 3D-aware Generative Adversarial Networks | Florian Barthel et.al. | 2312.05330 | link |
2023-12-08 | TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis | Heming Zhu et.al. | 2312.05161 | null |
2023-12-08 | Learn to Optimize Denoising Scores for 3D Generation: A Unified and Improved Diffusion Prior on NeRF and 3D Gaussian Splatting | Xiaofeng Yang et.al. | 2312.04820 | null |
2023-12-08 | Reality’s Canvas, Language’s Brush: Crafting 3D Avatars from Monocular Video | Yuchen Rao et.al. | 2312.04784 | null |
2023-12-07 | MuRF: Multi-Baseline Radiance Fields | Haofei Xu et.al. | 2312.04565 | link |
2023-12-07 | EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS | Sharath Girish et.al. | 2312.04564 | link |
2023-12-07 | Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection | Kohei Yamashita et.al. | 2312.04527 | null |
2023-12-07 | Multi-View Unsupervised Image Generation with Cross Attention Guidance | Llukman Cerkezi et.al. | 2312.04337 | null |
2023-12-07 | Towards 4D Human Video Stylization | Tiantian Wang et.al. | 2312.04143 | link |
2023-12-07 | Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction | Jiayi Kong et.al. | 2312.04106 | null |
2023-12-06 | Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion | Kira Prabhu et.al. | 2312.03869 | null |
2023-12-06 | Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle | Youtian Lin et.al. | 2312.03431 | null |
2023-12-06 | Artist-Friendly Relightable and Animatable Neural Heads | Yingyan Xu et.al. | 2312.03420 | null |
2023-12-06 | Evaluating the point cloud of individual trees generated from images based on Neural Radiance fields (NeRF) method | Hongyu Huang et.al. | 2312.03372 | null |
2023-12-06 | RING-NeRF: A Versatile Architecture based on Residual Implicit Neural Grids | Doriand Petit et.al. | 2312.03357 | null |
2023-12-06 | SO-NeRF: Active View Planning for NeRF using Surrogate Objectives | Keifer Lee et.al. | 2312.03266 | null |
2023-12-06 | Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Shijie Zhou et.al. | 2312.03203 | link |
2023-12-05 | HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces | Haithem Turki et.al. | 2312.03160 | null |
2023-12-05 | ReconFusion: 3D Reconstruction with Diffusion Priors | Rundi Wu et.al. | 2312.02981 | null |
2023-12-05 | GauHuman: Articulated Gaussian Splatting from Monocular Human Videos | Shoukang Hu et.al. | 2312.02973 | link |
2023-12-05 | Alchemist: Parametric Control of Material Properties with Diffusion Models | Prafull Sharma et.al. | 2312.02970 | null |
2023-12-05 | MVHumanNet: A Large-scale Dataset of Multi-view Daily Dressing Human Captures | Zhangyang Xiong et.al. | 2312.02963 | null |
2023-12-05 | C-NERF: Representing Scene Changes as Directional Consistency Difference-based NeRF | Rui Huang et.al. | 2312.02751 | link |
2023-12-05 | Prompt2NeRF-PIL: Fast NeRF Generation via Pretrained Implicit Latent | Jianmeng Liu et.al. | 2312.02568 | null |
2023-12-04 | PointNeRF++: A multi-scale, point-based Neural Radiance Field | Weiwei Sun et.al. | 2312.02362 | null |
2023-12-04 | Calibrated Uncertainties for Neural Radiance Fields | Niki Amini-Naieni et.al. | 2312.02350 | null |
2023-12-04 | Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis | Felix Tristram et.al. | 2312.02255 | null |
2023-12-04 | ColonNeRF: Neural Radiance Fields for High-Fidelity Long-Sequence Colonoscopy Reconstruction | Yufei Shi et.al. | 2312.02015 | null |
2023-12-04 | Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training | Runze He et.al. | 2312.01663 | null |
2023-12-03 | SANeRF-HQ: Segment Anything for NeRF in High Quality | Yichen Liu et.al. | 2312.01531 | null |
2023-12-03 | VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams | Liao Wang et.al. | 2312.01407 | null |
2023-12-02 | Self-Evolving Neural Radiance Fields | Jaewoo Jung et.al. | 2312.01003 | link |
2023-12-01 | Gaussian Grouping: Segment and Edit Anything in 3D Scenes | Mingqiao Ye et.al. | 2312.00732 | link |
2023-11-30 | LucidDreaming: Controllable Object-Centric 3D Generation | Zhaoning Wang et.al. | 2312.00588 | null |
2023-12-01 | FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting | Zehao Zhu et.al. | 2312.00451 | null |
2023-11-30 | PyNeRF: Pyramidal Neural Radiance Fields | Haithem Turki et.al. | 2312.00252 | link |
2023-11-30 | SparseGS: Real-Time 360° Sparse View Synthesis using Gaussian Splatting | Haolin Xiong et.al. | 2312.00206 | link |
2023-11-30 | Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing | Hyelin Nam et.al. | 2311.18608 | null |
2023-11-30 | ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs | Violeta Menéndez González et.al. | 2311.18491 | null |
2023-11-30 | Anisotropic Neural Representation Learning for High-Quality Neural Rendering | Y. Wang et.al. | 2311.18311 | null |
2023-11-30 | CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt | Haiyao Xiao et.al. | 2311.18288 | null |
2023-11-30 | Compact3D: Compressing Gaussian Splat Radiance Field Models with Vector Quantization | KL Navaneet et.al. | 2311.18159 | link |
2023-11-29 | GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces | Yingwenqi Jiang et.al. | 2311.17977 | null |
2023-11-29 | AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text | Jianfeng Zhang et.al. | 2311.17917 | null |
2023-11-29 | FisherRF: Active View Selection and Uncertainty Quantification for Radiance Fields using Fisher Information | Wen Jiang et.al. | 2311.17874 | link |
2023-11-29 | Cinematic Behavior Transfer via NeRF-based Differentiable Filming | Xuekun Jiang et.al. | 2311.17754 | null |
2023-11-29 | SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Ziqiao Peng et.al. | 2311.17590 | link |
2023-11-29 | NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields | Xiaoliang Liu et.al. | 2311.17332 | null |
2023-11-28 | LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS | Zhiwen Fan et.al. | 2311.17245 | link |
2023-11-28 | Continuous Pose for Monocular Cameras in Neural Implicit Representation | Qi Ma et.al. | 2311.17119 | link |
2023-11-28 | UC-NeRF: Neural Radiance Field for Under-Calibrated multi-view cameras in autonomous driving | Kai Cheng et.al. | 2311.16945 | null |
2023-11-28 | The Sky’s the Limit: Re-lightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility | James A. D. Gardner et.al. | 2311.16937 | link |
2023-11-28 | SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation | Jesus Zarzar et.al. | 2311.16671 | link |
2023-11-28 | DGNR: Density-Guided Neural Point Rendering of Large Driving Scenes | Zhuopeng Li et.al. | 2311.16664 | null |
2023-11-28 | SCALAR-NeRF: SCAlable LARge-scale Neural Radiance Fields for Scene Reconstruction | Yu Chen et.al. | 2311.16657 | null |
2023-11-28 | Rethinking Directional Integration in Neural Radiance Fields | Congyue Deng et.al. | 2311.16504 | null |
2023-11-27 | Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images | Shiu-hong Kao et.al. | 2311.16499 | link |
2023-11-27 | Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling | Zhe Li et.al. | 2311.16096 | link |
2023-11-27 | SOAC: Spatio-Temporal Overlap-Aware Multi-Sensor Calibration using Neural Radiance Fields | Quentin Herau et.al. | 2311.15803 | null |
2023-11-27 | CaesarNeRF: Calibrated Semantic Representation for Few-shot Generalizable Neural Rendering | Haidong Zhu et.al. | 2311.15510 | link |
2023-11-26 | Efficient Encoding of Graphics Primitives with Simplex-based Structures | Yibo Wen et.al. | 2311.15439 | null |
2023-11-26 | Obj-NeRF: Extract Object NeRFs from Multi-view Images | Zhiyi Li et.al. | 2311.15291 | null |
2023-11-26 | NeuRAD: Neural Rendering for Autonomous Driving | Adam Tonderski et.al. | 2311.15260 | link |
2023-11-24 | Animate124: Animating One Image to 4D Dynamic Scene | Yuyang Zhao et.al. | 2311.14603 | null |
2023-11-24 | GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting | Yiwen Chen et.al. | 2311.14521 | link |
2023-11-23 | ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain Optimization | Soonbin Lee et.al. | 2311.14208 | null |
2023-11-23 | Tube-NeRF: Efficient Imitation Learning of Visuomotor Policies from MPC using Tube-Guided Data Augmentation and NeRFs | Andrea Tagliabue et.al. | 2311.14153 | null |
2023-11-23 | Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder | Xiaohao Xu et.al. | 2311.13750 | null |
2023-11-22 | Compact 3D Gaussian Representation for Radiance Field | Joo Chan Lee et.al. | 2311.13681 | link |
2023-11-22 | Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning | Kai Yu et.al. | 2311.13617 | null |
2023-11-22 | Animatable 3D Gaussians for High-fidelity Synthesis of Human Motions | Keyang Ye et.al. | 2311.13404 | null |
2023-11-22 | Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images | Jaeyoung Chung et.al. | 2311.13398 | link |
2023-11-22 | 3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization | Jianwei Feng et.al. | 2311.13168 | null |
2023-11-22 | PIE-NeRF: Physics-based Interactive Elastodynamics with NeRF | Yutao Feng et.al. | 2311.13099 | null |
2023-11-21 | SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering | Antoine Guédon et.al. | 2311.12775 | link |
2023-11-21 | Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields | Yifan Wang et.al. | 2311.12490 | null |
2023-11-18 | Towards Function Space Mesh Watermarking: Protecting the Copyright of Signed Distance Fields | Xingyu Zhu et.al. | 2311.12059 | null |
2023-11-20 | GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding | Hao Li et.al. | 2311.11863 | null |
2023-11-20 | Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields | Zhiyuan Min et.al. | 2311.11845 | link |
2023-11-19 | GaussianDiffusion: 3D Gaussian Splatting for Denoising Diffusion Probabilistic Models with Structured Noise | Xinhai Li et.al. | 2311.11221 | null |
2023-11-18 | SNI-SLAM: Semantic Neural Implicit SLAM | Siting Zhu et.al. | 2311.11016 | link |
2023-11-18 | Structure-Aware Sparse-View X-ray 3D Reconstruction | Yuanhao Cai et.al. | 2311.10959 | link |
2023-11-17 | Removing Adverse Volumetric Effects From Trained Neural Radiance Fields | Andreas L. Teigen et.al. | 2311.10523 | null |
2023-11-18 | EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices | Jingnan Gao et.al. | 2311.09806 | null |
2023-11-16 | Reconstructing Continuous Light Field From Single Coded Image | Yuya Ishikawa et.al. | 2311.09646 | null |
2023-11-15 | Single-Image 3D Human Digitization with Shape-Guided Diffusion | Badour AlBahar et.al. | 2311.09221 | null |
2023-11-15 | DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model | Yinghao Xu et.al. | 2311.09217 | null |
2023-11-15 | Spiking NeRF: Representing the Real-World Geometry by a Discontinuous Representation | Zhanfeng Liao et.al. | 2311.09077 | link |
2023-11-13 | $L_0$-Sampler: An $L_{0}$ Model Guided Volume Sampling for NeRF | Liangchen Li et.al. | 2311.07044 | null |
2023-11-11 | Aria-NeRF: Multimodal Egocentric View Synthesis | Jiankai Sun et.al. | 2311.06455 | null |
2023-11-10 | Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model | Jiahao Li et.al. | 2311.06214 | null |
2023-11-10 | A Neural Height-Map Approach for the Binocular Photometric Stereo Problem | Fotios Logothetis et.al. | 2311.05958 | null |
2023-11-09 | BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis | Hao-Bin Duan et.al. | 2311.05521 | link |
2023-11-09 | Control3D: Towards Controllable Text-to-3D Generation | Yang Chen et.al. | 2311.05461 | null |
2023-11-08 | LRM: Large Reconstruction Model for Single Image to 3D | Yicong Hong et.al. | 2311.04400 | null |
2023-11-07 | ADFactory: Automated Data Factory for Optical Flow Tasks | Han Ling et.al. | 2311.04246 | null |
2023-11-07 | High-fidelity 3D Reconstruction of Plants using Neural Radiance Field | Kewei Hu et.al. | 2311.04154 | null |
2023-11-07 | Fast Sun-aligned Outdoor Scene Relighting based on TensoRF | Yeonjin Chang et.al. | 2311.03965 | null |
2023-11-08 | UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields | Injae Kim et.al. | 2311.03784 | link |
2023-11-06 | Osprey: Multi-Session Autonomous Aerial Mapping with LiDAR-based SLAM and Next Best View Planning | Rowan Border et.al. | 2311.03484 | null |
2023-11-06 | Animating NeRFs from Texture Space: A Framework for Pose-Dependent Rendering of Human Performances | Paul Knoll et.al. | 2311.03140 | null |
2023-11-06 | InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image | Jianhui Li et.al. | 2311.02826 | link |
2023-11-03 | Estimating 3D Uncertainty Field: Quantifying Uncertainty for Neural Radiance Fields | Jianxiong Shen et.al. | 2311.01815 | null |
2023-11-03 | PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation | Yuhan Ding et.al. | 2311.01773 | null |
2023-11-03 | Efficient Cloud Pipelines for Neural Radiance Fields | Derek Jacoby et.al. | 2311.01659 | null |
2023-11-02 | Novel View Synthesis from a Single RGBD Image for Indoor Scenes | Congrui Hetang et.al. | 2311.01065 | null |
2023-10-31 | FPO++: Efficient Encoding and Rendering of Dynamic Neural Radiance Fields by Analyzing and Enhancing Fourier PlenOctrees | Saskia Rabich et.al. | 2310.20710 | null |
2023-10-31 | NeRF Revisited: Fixing Quadrature Instability in Volume Rendering | Mikaela Angelina Uy et.al. | 2310.20685 | null |
2023-10-30 | Generative Neural Fields by Mixtures of Neural Implicit Functions | Tackgeun You et.al. | 2310.19464 | null |
2023-11-04 | TiV-NeRF: Tracking and Mapping via Time-Varying Representation with Dynamic Neural Radiance Fields | Chengyao Duan et.al. | 2310.18917 | null |
2023-10-28 | INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings | Amirhossein Kazerouni et.al. | 2310.18846 | link |
2023-10-27 | ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image | Kyle Sargent et.al. | 2310.17994 | link |
2023-10-27 | Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations | Tristan Aumentado-Armstrong et.al. | 2310.17880 | null |
2023-10-27 | HyperFields: Towards Zero-Shot Generation of NeRFs from Text | Sudarshan Babu et.al. | 2310.17075 | null |
2023-10-25 | 4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via 4D Semantic Segmentation | Dadong Jiang et.al. | 2310.16858 | null |
2023-10-26 | LightSpeed: Light and Fast Neural Light Fields on Mobile Devices | Aarush Gupta et.al. | 2310.16832 | link |
2023-10-28 | PERF: Panoramic Neural Radiance Field from a Single Panorama | Guangcong Wang et.al. | 2310.16831 | link |
2023-10-25 | Open-NeRF: Towards Open Vocabulary NeRF Decomposition | Hao Zhang et.al. | 2310.16383 | null |
2023-10-25 | UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception | Christopher Maxey et.al. | 2310.16255 | null |
2023-10-24 | Cross-view Self-localization from Synthesized Scene-graphs | Ryogo Yamamoto et.al. | 2310.15504 | null |
2023-10-23 | CAwa-NeRF: Instant Learning of Compression-Aware NeRF Features | Omnia Mahmoud et.al. | 2310.14695 | null |
2023-10-23 | VQ-NeRF: Vector Quantization Enhances Implicit Neural Representations | Yiying Yang et.al. | 2310.14487 | null |
2023-10-20 | ManifoldNeRF: View-dependent Image Feature Supervision for Few-shot Neural Radiance Fields | Daiju Kanaoka et.al. | 2310.13670 | null |
2023-10-20 | Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos | Seoha Kim et.al. | 2310.13356 | link |
2023-10-20 | UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene | Jiaming Gu et.al. | 2310.13263 | null |
2023-10-18 | VQ-NeRF: Neural Reflectance Decomposition and Editing with Vector Quantization | Hongliang Zhong et.al. | 2310.11864 | null |
2023-10-18 | Towards Abdominal 3-D Scene Rendering from Laparoscopy Surgical Videos using NeRFs | Khoa Tuan Nguyen et.al. | 2310.11645 | null |
2023-10-16 | TraM-NeRF: Tracing Mirror and Near-Perfect Specular Reflections through Neural Radiance Fields | Leif Van Holland et.al. | 2310.10650 | link |
2023-10-16 | DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing | Jia-Wei Liu et.al. | 2310.10624 | null |
2023-10-16 | Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model | Junpeng Tan et.al. | 2310.10209 | null |
2023-10-15 | ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context | Binglun Wang et.al. | 2310.09965 | null |
2023-10-15 | Active Perception using Neural Radiance Fields | Siming He et.al. | 2310.09892 | link |
2023-10-15 | CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields from Imperfect Camera Poses | Hongyu Fu et.al. | 2310.09776 | null |
2023-10-11 | Dynamic Appearance Particle Neural Radiance Field | Ancheng Lin et.al. | 2310.07916 | null |
2023-10-12 | PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction | Jia-Wang Bian et.al. | 2310.07449 | link |
2023-10-11 | rpcPRF: Generalizable MPI Neural Radiance Field for Satellite Camera | Tongtong Zhang et.al. | 2310.07179 | null |
2023-10-10 | Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization | Le Chen et.al. | 2310.06984 | null |
2023-10-10 | High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field | Minghan Qin et.al. | 2310.06275 | null |
2023-10-09 | A Real-time Method for Inserting Virtual Objects into Neural Radiance Fields | Keyang Ye et.al. | 2310.05837 | null |
2023-10-09 | Neural Impostor: Editing Neural Radiance Fields with Explicit Shape Manipulation | Ruiyang Liu et.al. | 2310.05391 | null |
2023-10-08 | LocoNeRF: A NeRF-based Approach for Local Structure from Motion for Precise Localization | Artem Nenashev et.al. | 2310.05134 | null |
2023-10-08 | Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation | Dominik Hollidt et.al. | 2310.05133 | null |
2023-10-06 | Improving Neural Radiance Field using Near-Surface Sampling with Point Cloud Generation | Hye Bin Yoo et.al. | 2310.04152 | null |
2023-10-05 | Drag View: Generalizable Novel View Synthesis with Unposed Imagery | Zhiwen Fan et.al. | 2310.03704 | link |
2023-10-05 | Targeted Adversarial Attacks on Generalizable Neural Radiance Fields | Andras Horvath et.al. | 2310.03578 | null |
2023-10-05 | BID-NeRF: RGB-D image pose estimation with inverted Neural Radiance Fields | Ágoston István Csehi et.al. | 2310.03563 | null |
2023-10-04 | Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation | Yihan Wu et.al. | 2310.03125 | null |
2023-10-04 | T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation | Yuze He et.al. | 2310.02977 | link |
2023-10-04 | ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF | Jangho Park et.al. | 2310.02712 | null |
2023-10-05 | USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields | Moyang Li et.al. | 2310.02687 | link |
2023-10-03 | EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields | Anish Bhattacharya et.al. | 2310.02437 | link |
2023-10-03 | Adaptive Multi-NeRF: Exploit Efficient Parallelism in Adaptive Multiple Scale Neural Radiance Field Rendering | Tong Wang et.al. | 2310.01881 | null |
2023-10-03 | MIMO-NeRF: Fast Neural Rendering with Multi-input Multi-output Neural Radiance Fields | Takuhiro Kaneko et.al. | 2310.01821 | null |
2023-10-02 | PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments | Xiuzhong Hu et.al. | 2310.00874 | link |
2023-10-01 | How Many Views Are Needed to Reconstruct an Unknown Object Using NeRF? | Sicong Pan et.al. | 2310.00684 | link |
2023-10-01 | Enabling Neural Radiance Fields (NeRF) for Large-scale Aerial Images – A Multi-tiling Approaching and the Geometry Assessment of NeRF | Ningli Xu et.al. | 2310.00530 | null |
2023-09-30 | MMPI: a Flexible Radiance Field Representation by Multiple Multi-plane Images Blending | Yuze He et.al. | 2310.00249 | null |
2023-09-29 | Multi-task View Synthesis with Neural Radiance Fields | Shuhong Zheng et.al. | 2309.17450 | link |
2023-09-29 | Forward Flow for Novel View Synthesis of Dynamic Scenes | Xiang Guo et.al. | 2309.17390 | null |
2023-09-29 | HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field | Xiaochen Zhao et.al. | 2309.17128 | null |
2023-09-28 | Preface: A Data-driven Volumetric Prior for Few-shot Ultra High-resolution Face Synthesis | Marcel C. Bühler et.al. | 2309.16859 | null |
2023-09-28 | MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond | Yixuan Li et.al. | 2309.16553 | null |
2023-09-28 | FG-NeRF: Flow-GAN based Probabilistic Neural Radiance Field for Independence-Assumption-Free Uncertainty Estimation | Songlin Wei et.al. | 2309.16364 | null |
2023-09-28 | Learning Effective NeRFs and SDFs Representations with 3D Generative Adversarial Networks for 3D Object Generation: Technical Report for ICCV 2023 OmniObject3D Challenge | Zheyuan Yang et.al. | 2309.16110 | null |
2023-09-27 | P2I-NET: Mapping Camera Pose to Image via Adversarial Learning for New View Synthesis in Real Indoor Environments | Xujie Kang et.al. | 2309.15526 | null |
2023-09-27 | BASED: Bundle-Adjusting Surgical Endoscopic Dynamic Video Reconstruction using Neural Radiance Fields | Shreya Saha et.al. | 2309.15329 | null |
2023-09-26 | 3D Density-Gradient based Edge Detection on Neural Radiance Fields (NeRFs) for Geometric Reconstruction | Miriam Jäger et.al. | 2309.14800 | null |
2023-09-25 | NAS-NeRF: Generative Neural Architecture Search for Neural Radiance Fields | Saeejith Nair et.al. | 2309.14293 | null |
2023-09-25 | Variational Inference for Scalable 3D Object-centric Learning | Tianyu Wang et.al. | 2309.14010 | null |
2023-09-24 | MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field | Zijiang Yang et.al. | 2309.13607 | null |
2023-09-23 | NeRF-Enhanced Outpainting for Faithful Field-of-View Extrapolation | Rui Yu et.al. | 2309.13240 | null |
2023-09-22 | NeRRF: 3D Reconstruction and View Synthesis for Transparent and Specular Objects with Neural Refractive-Reflective Fields | Xiaoxue Chen et.al. | 2309.13039 | link |
2023-09-21 | ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding | Yu Cheng et.al. | 2309.12183 | null |
2023-09-21 | NeuralLabeling: A versatile toolset for labeling vision datasets using Neural Radiance Fields | Floris Erich et.al. | 2309.11966 | link |
2023-09-21 | Fast Satellite Tensorial Radiance Field for Multi-date Satellite Imagery of Large Size | Tongtong Zhang et.al. | 2309.11767 | null |
2023-09-21 | MarkNerf:Watermarking for Neural Radiance Field | Lifeng Chen et.al. | 2309.11747 | null |
2023-09-21 | Rendering stable features improves sampling-based localisation with Neural radiance fields | Boxuan Zhang et.al. | 2309.11698 | null |
2023-09-20 | GenLayNeRF: Generalizable Layered Representations with 3D Model Alignment for Multi-Human View Synthesis | Youssef Abdelkareem et.al. | 2309.11627 | null |
2023-09-20 | Light Field Diffusion for Single-View Novel View Synthesis | Yifeng Xiong et.al. | 2309.11525 | null |
2023-09-21 | Controllable Dynamic Appearance for Neural 3D Portraits | ShahRukh Athar et.al. | 2309.11009 | null |
2023-09-20 | Spiking NeRF: Making Bio-inspired Neural Networks See through the Real World | Xingting Yao et.al. | 2309.10987 | link |
2023-09-19 | Locally Stylized Neural Radiance Fields | Hong-Wing Pang et.al. | 2309.10684 | null |
2023-09-19 | Steganography for Neural Radiance Fields by Backdooring | Weina Dong et.al. | 2309.10503 | null |
2023-09-18 | Instant Photorealistic Style Transfer: A Lightweight and Adaptive Approach | Rong Liu et.al. | 2309.10011 | null |
2023-09-18 | RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision | Mingjie Pan et.al. | 2309.09502 | link |
2023-09-17 | NeRF-VINS: A Real-time Neural Radiance Field Map-based Visual-Inertial Navigation System | Saimouli Katragadda et.al. | 2309.09295 | null |
2023-09-16 | DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF | Mert Asim Karaoglu et.al. | 2309.08927 | null |
2023-09-15 | Robust e-NeRF: NeRF from Sparse & Noisy Events under Non-Uniform Motion | Weng Fei Low et.al. | 2309.08596 | link |
2023-09-14 | Gradient based Grasp Pose Optimization on a NeRF that Approximates Grasp Success | Gergely Sóti et.al. | 2309.08040 | null |
2023-09-14 | MC-NeRF: Muti-Camera Neural Radiance Fields for Muti-Camera Image Acquisition Systems | Yu Gao et.al. | 2309.07846 | null |
2023-09-14 | DT-NeRF: Decomposed Triplane-Hash Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis | Yaoyu Su et.al. | 2309.07752 | null |
2023-09-14 | CoRF : Colorizing Radiance Fields using Knowledge Distillation | Ankit Dhiman et.al. | 2309.07668 | null |
2023-09-13 | Text-Guided Generation and Editing of Compositional 3D Avatars | Hao Zhang et.al. | 2309.07125 | null |
2023-09-13 | Dynamic NeRFs for Soccer Scenes | Sacha Lewin et.al. | 2309.06802 | link |
2023-09-12 | Federated Learning for Large-Scale Scene Modeling with Neural Radiance Fields | Teppei Suzuki et.al. | 2309.06030 | null |
2023-09-11 | PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics | Claus Smitt et.al. | 2309.05339 | null |
2023-09-10 | Text-driven Editing of 3D Scenes without Retraining | Shuangkang Fang et.al. | 2309.04917 | link |
2023-09-09 | Mirror-Aware Neural Humans | Daniel Ajisafe et.al. | 2309.04750 | link |
2023-09-08 | Dynamic Mesh-Aware Radiance Fields | Yi-Ling Qiao et.al. | 2309.04581 | null |
2023-09-08 | DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields | Junzhe Zhang et.al. | 2309.04410 | link |
2023-09-14 | SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions | Nagabhushan Somraj et.al. | 2309.03955 | null |
2023-09-07 | BluNF: Blueprint Neural Field | Robin Courant et.al. | 2309.03933 | null |
2023-09-07 | Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model | Sungwon Hwang et.al. | 2309.03550 | null |
2023-09-06 | Bayes’ Rays: Uncertainty Quantification for Neural Radiance Fields | Lily Goli et.al. | 2309.03185 | link |
2023-09-06 | ResFields: Residual Neural Fields for Spatiotemporal Signals | Marko Mihajlovic et.al. | 2309.03160 | link |
2023-09-06 | Instant Continual Learning of Neural Radiance Fields | Ryan Po et.al. | 2309.01811 | null |
2023-09-04 | Adv3D: Generating 3D Adversarial Examples in Driving Scenarios with NeRF | Leheng Li et.al. | 2309.01351 | null |
2023-09-01 | SparseSat-NeRF: Dense Depth Supervised Neural Radiance Fields for Sparse Satellite Images | Lulin Zhang et.al. | 2309.00277 | link |
2023-08-24 | Improving NeRF Quality by Progressive Camera Placement for Unrestricted Navigation in Complex Environments | Georgios Kopanas et.al. | 2309.00014 | null |
2023-09-03 | GHuNeRF: Generalizable Human NeRF from a Monocular Video | Chen Li et.al. | 2308.16576 | link |
2023-08-30 | From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications | Shreyank N Gowda et.al. | 2308.16041 | null |
2023-08-30 | Drone-NeRF: Efficient NeRF Based 3D Scene Reconstruction for Large-Scale Drone Survey | Zhihao Jia et.al. | 2308.15733 | null |
2023-08-29 | Efficient Ray Sampling for Radiance Fields Reconstruction | Shilei Sun et.al. | 2308.15547 | null |
2023-08-29 | Pose-Free Neural Radiance Fields via Implicit Pose Regularization | Jiahui Zhang et.al. | 2308.15049 | null |
2023-08-28 | CLNeRF: Continual Learning Meets NeRF | Zhipeng Cai et.al. | 2308.14816 | link |
2023-08-26 | InsertNeRF: Instilling Generalizability into NeRF with HyperNet Modules | Yanqi Bao et.al. | 2308.13897 | link |
2023-08-24 | NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects | Dakshit Agrawal et.al. | 2308.12560 | link |
2023-08-23 | Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields | Hyeonseop Song et.al. | 2308.11974 | null |
2023-08-25 | Pose Modulated Avatars from Video | Chunjin Song et.al. | 2308.11951 | null |
2023-08-22 | Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts | Wenyan Cong et.al. | 2308.11793 | link |
2023-08-22 | SAMSNeRF: Segment Anything Model (SAM) Guides Dynamic Surgical Scene Reconstruction by Neural Radiance Field (NeRF) | Ange Lou et.al. | 2308.11774 | null |
2023-08-22 | Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views | Wentian Qu et.al. | 2308.11198 | null |
2023-08-22 | Efficient View Synthesis with Neural Radiance Distribution Field | Yushuang Wu et.al. | 2308.11130 | null |
2023-08-21 | CamP: Camera Preconditioning for Neural Radiance Fields | Keunhong Park et.al. | 2308.10902 | null |
2023-08-20 | Strata-NeRF : Neural Radiance Fields for Stratified Scenes | Ankit Dhiman et.al. | 2308.10337 | null |
2023-08-19 | HollowNeRF: Pruning Hashgrid-Based NeRFs with Trainable Collision Mitigation | Xiufeng Xie et.al. | 2308.10122 | null |
2023-08-19 | AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization | Kun Wang et.al. | 2308.10001 | null |
2023-08-19 | Semantic-Human: Neural Rendering of Humans from Monocular Video with Human Parsing | Jie Zhang et.al. | 2308.09894 | null |
2023-08-18 | MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection | Junkai Xu et.al. | 2308.09421 | link |
2023-08-18 | DReg-NeRF: Deep Registration for Neural Radiance Fields | Yu Chen et.al. | 2308.09386 | link |
2023-08-17 | Watch Your Steps: Local Image and Scene Editing by Text Instructions | Ashkan Mirzaei et.al. | 2308.08947 | null |
2023-08-21 | Ref-DVGO: Reflection-Aware Direct Voxel Grid Optimization for an Improved Quality-Efficiency Trade-Off in Reflective Scene Reconstruction | Georgios Kouros et.al. | 2308.08530 | link |
2023-08-16 | SceNeRFlow: Time-Consistent Reconstruction of General Dynamic Scenes | Edith Tretschk et.al. | 2308.08258 | null |
2023-08-16 | Neural radiance fields in the industrial and robotics domain: applications, research opportunities and use cases | Eugen Šlapak et.al. | 2308.07118 | link |
2023-08-14 | S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural Fields | Zeke Xie et.al. | 2308.07032 | link |
2023-08-11 | Focused Specific Objects NeRF | Yuesong Li et.al. | 2308.05970 | null |
2023-08-11 | VERF: Runtime Monitoring of Pose Estimation with Neural Radiance Fields | Dominic Maggio et.al. | 2308.05939 | null |
2023-08-09 | WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields | Muyu Xu et.al. | 2308.04826 | null |
2023-08-14 | A General Implicit Framework for Fast NeRF Composition and Rendering | Xinyu Gao et.al. | 2308.04669 | null |
2023-08-08 | Digging into Depth Priors for Outdoor Neural Radiance Fields | Chen Wang et.al. | 2308.04413 | null |
2023-08-07 | Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing | Junyi Zeng et.al. | 2308.03280 | null |
2023-08-05 | Where and How: Mitigating Confusion in Neural Radiance Fields from Sparse Inputs | Yanqi Bao et.al. | 2308.02908 | link |
2023-08-05 | Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis | Yuxin Wang et.al. | 2308.02840 | null |
2023-08-05 | NeRFs: The Search for the Best 3D Representation | Ravi Ramamoorthi et.al. | 2308.02751 | null |
2023-08-04 | ES-MVSNet: Efficient Framework for End-to-end Self-supervised Multi-View Stereo | Qiang Zhou et.al. | 2308.02191 | null |
2023-08-02 | Incorporating Season and Solar Specificity into Renderings made by a NeRF Architecture using Satellite Images | Michael Gableman et.al. | 2308.01262 | link |
2023-08-01 | High-Fidelity Eye Animatable Neural Radiance Fields for Human Face | Hengfei Wang et.al. | 2308.00773 | null |
2023-08-01 | Context-Aware Talking-Head Video Editing | Songlin Yang et.al. | 2308.00462 | null |
2023-07-28 | Dynamic PlenOctree for Adaptive Sampling Refinement in Explicit NeRF | Haotian Bai et.al. | 2307.15333 | null |
2023-07-27 | Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields | Xiangyu Wang et.al. | 2307.15131 | link |
2023-07-27 | MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving | Zirui Wu et.al. | 2307.15058 | link |
2023-07-27 | NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection | Chenfeng Xu et.al. | 2307.14620 | link |
2023-07-26 | Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation | Chaohui Yu et.al. | 2307.13908 | null |
2023-07-24 | Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields | Shangzhan Zhang et.al. | 2307.12909 | null |
2023-07-24 | CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components | Davide Di Nucci et.al. | 2307.12718 | null |
2023-07-23 | TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering | Xiao Pan et.al. | 2307.12291 | null |
2023-07-29 | CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields | Ziyuan Luo et.al. | 2307.11526 | link |
2023-07-21 | FaceCLIPNeRF: Text-driven 3D Face Manipulation using Deformable Neural Radiance Fields | Sungwon Hwang et.al. | 2307.11418 | null |
2023-07-21 | Tri-MipRF: Tri-Mip Representation for Efficient Anti-Aliasing Neural Radiance Fields | Wenbo Hu et.al. | 2307.11335 | null |
2023-07-20 | Urban Radiance Field Representation with Deformable Neural Mesh Primitives | Fan Lu et.al. | 2307.10776 | null |
2023-07-20 | Lighting up NeRF via Unsupervised Decomposition and Enhancement | Haoyuan Wang et.al. | 2307.10664 | link |
2023-07-19 | An Improved NeuMIP with Better Accuracy | Bowen Xue et.al. | 2307.10135 | null |
2023-07-19 | Magic NeRF Lens: Interactive Fusion of Neural Radiance Fields for Virtual Facility Inspection | Ke Li et.al. | 2307.09860 | link |
2023-07-14 | Transient Neural Radiance Fields for Lidar View Synthesis and 3D Reconstruction | Anagh Malik et.al. | 2307.09555 | null |
2023-07-18 | Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis | Jiahe Li et.al. | 2307.09323 | link |
2023-07-16 | Cross-Ray Neural Radiance Fields for Novel-view Synthesis from Unconstrained Image Collections | Yifan Yang et.al. | 2307.08093 | link |
2023-07-15 | Improving NeRF with Height Data for Utilization of GIS Data | Hinata Aoki et.al. | 2307.07729 | null |
2023-07-11 | SAR-NeRF: Neural Radiance Fields for Synthetic Aperture Radar Multi-View Representation | Zhengxin Lei et.al. | 2307.05087 | null |
2023-07-07 | NOFA: NeRF-based One-shot Facial Avatar Reconstruction | Wangbo Yu et.al. | 2307.03441 | null |
2023-07-07 | RGB-D Mapping and Tracking in a Plenoxel Radiance Field | Andreas L. Teigen et.al. | 2307.03404 | link |
2023-07-16 | FlipNeRF: Flipped Reflection Rays for Few-shot Novel View Synthesis | Seunghyeon Seo et.al. | 2306.17723 | link |
2023-07-03 | Sphere2Vec: A General-Purpose Location Representation Learning over a Spherical Surface for Large-Scale Geospatial Predictions | Gengchen Mai et.al. | 2306.17624 | null |
2023-06-28 | Envisioning a Next Generation Extended Reality Conferencing System with Efficient Photorealistic Human Rendering | Chuanyue Shen et.al. | 2306.16541 | null |
2023-06-27 | Unsupervised Polychromatic Neural Representation for CT Metal Artifact Reduction | Qing Wu et.al. | 2306.15203 | link |
2023-06-22 | Blended-NeRF: Zero-Shot Object Generation and Blending in Existing Neural Radiance Fields | Ori Gordon et.al. | 2306.12760 | link |
2023-06-21 | Local 3D Editing via 3D Distillation of CLIP Knowledge | Junha Hyung et.al. | 2306.12570 | null |
2023-06-21 | Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase | Qiuyu Wang et.al. | 2306.12423 | link |
2023-06-21 | DreamTime: An Improved Optimization Strategy for Text-to-3D Content Creation | Yukun Huang et.al. | 2306.12422 | null |
2023-06-20 | NeRF synthesis with shading guidance | Chenbin Li et.al. | 2306.11556 | null |
2023-06-24 | MA-NeRF: Motion-Assisted Neural Radiance Fields for Face Synthesis from Sparse Images | Weichen Zhang et.al. | 2306.10350 | null |
2023-06-15 | Edit-DiffNeRF: Editing 3D Neural Radiance Fields using 2D Diffusion Model | Lu Yu et.al. | 2306.09551 | null |
2023-06-16 | UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video | Zhi-Hao Lin et.al. | 2306.09349 | null |
2023-06-13 | DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$ | Allan Jabri et.al. | 2306.08068 | null |
2023-06-13 | Binary Radiance Fields | Seungjoo Shin et.al. | 2306.07581 | null |
2023-06-10 | From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm | Kun Zhou et.al. | 2306.06388 | null |
2023-06-15 | NERFBK: A High-Quality Benchmark for NERF-Based 3D Reconstruction | Ali Karami et.al. | 2306.06300 | link |
2023-06-09 | HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork | Bipasha Sen et.al. | 2306.06093 | null |
2023-06-09 | GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields | Barbara Roessle et.al. | 2306.06044 | null |
2023-06-09 | RePaint-NeRF: NeRF Editting via Semantic Masks and Diffusion Models | Xingchen Zhou et.al. | 2306.05668 | null |
2023-06-08 | LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs | Zezhou Cheng et.al. | 2306.05410 | null |
2023-06-08 | Enhance-NeRF: Multiple Performance Evaluation for Neural Radiance Fields | Qianqiu Tan et.al. | 2306.05303 | link |
2023-06-06 | Towards Visual Foundational Models of Physical Scenes | Chethan Parameshwara et.al. | 2306.03727 | null |
2023-06-06 | Human 3D Avatar Modeling with Implicit Neural Representation: A Brief Survey | Mingyang Sun et.al. | 2306.03576 | null |
2023-06-05 | H2-Mapping: Real-time Dense Mapping Using Hierarchical Hybrid Representation | Chenxing Jiang et.al. | 2306.03207 | link |
2023-06-05 | BeyondPixels: A Comprehensive Review of the Evolution of Neural Radiance Fields | AKM Shahariar Azad Rabby et.al. | 2306.03000 | null |
2023-06-05 | ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields | Kanghyeok Ko et.al. | 2306.02741 | null |
2023-06-01 | FDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models | Hao Zhang et.al. | 2306.00783 | link |
2023-06-01 | Analyzing the Internals of Neural Radiance Fields | Lukas Radl et.al. | 2306.00696 | link |
2023-06-02 | AvatarStudio: Text-driven Editing of 3D Dynamic Human Head Avatars | Mohit Mendiratta et.al. | 2306.00547 | null |
2023-05-30 | DäRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation | Jiuhn Song et.al. | 2305.19201 | link |
2023-05-30 | Template-free Articulated Neural Point Clouds for Reposable View Synthesis | Lukas Uzolas et.al. | 2305.19065 | link |
2023-05-31 | HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance | Junzhe Zhu et.al. | 2305.18766 | link |
2023-05-31 | Towards a Robust Framework for NeRF Evaluation | Adrian Azzarelli et.al. | 2305.18079 | link |
2023-05-31 | Volume Feature Rendering for Fast Neural Radiance Field Reconstruction | Kang Han et.al. | 2305.17916 | null |
2023-05-30 | PlaNeRF: SVD Unsupervised 3D Plane Regularization for NeRF Large-Scale Scene Reconstruction | Fusang Wang et.al. | 2305.16914 | null |
2023-05-25 | ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image | Zhenzhen Weng et.al. | 2305.16411 | null |
2023-05-25 | Interactive Segment Anything NeRF with Feature Imitation | Xiaokang Chen et.al. | 2305.16233 | null |
2023-05-25 | ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation | Zhengyi Wang et.al. | 2305.16213 | link |
2023-05-31 | Deceptive-NeRF: Enhancing NeRF Reconstruction using Pseudo-Observations from Diffusion Models | Xinhang Liu et.al. | 2305.15171 | null |
2023-05-24 | InpaintNeRF360: Text-Guided 3D Inpainting on Unbounded Neural Radiance Fields | Dongqing Wang et.al. | 2305.15094 | null |
2023-05-24 | OD-NeRF: Efficient Training of On-the-Fly Dynamic Neural Radiance Fields | Zhiwen Yan et.al. | 2305.14831 | null |
2023-05-24 | 3D Open-vocabulary Segmentation with Foundation Models | Kunhao Liu et.al. | 2305.14093 | link |
2023-05-22 | NeRFuser: Large-Scale Scene Representation by NeRF Fusion | Jiading Fang et.al. | 2305.13307 | link |
2023-05-22 | Registering Neural Radiance Fields as 3D Density Images | Han Jiang et.al. | 2305.12843 | null |
2023-05-19 | Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields | Jingbo Zhang et.al. | 2305.11588 | link |
2023-05-18 | MVPSNet: Fast Generalizable Multi-view Photometric Stereo | Dongxu Zhao et.al. | 2305.11167 | null |
2023-05-18 | ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis | Shoukang Hu et.al. | 2305.11031 | link |
2023-05-17 | MultiPlaneNeRF: Neural Radiance Field with Non-Trainable Representation | Dominik Zimny et.al. | 2305.10579 | link |
2023-05-24 | OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields | Youtan Yin et.al. | 2305.10503 | link |
2023-05-16 | NerfBridge: Bringing Real-time, Online Neural Radiance Field Training to Robotics | Javier Yu et.al. | 2305.09761 | link |
2023-05-15 | MV-Map: Offboard HD-Map Generation with Multi-view Consistency | Ziyang Xie et.al. | 2305.08851 | link |
2023-05-12 | BundleRecon: Ray Bundle-Based 3D Neural Reconstruction | Weikun Zhang et.al. | 2305.07342 | null |
2023-05-10 | Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era | Chenghao Li et.al. | 2305.06131 | null |
2023-05-10 | NeRF $^\textbf{2}$ : Neural Radio-Frequency Radiance Fields | Xiaopeng Zhao et.al. | 2305.06118 | null |
2023-05-09 | Instant-NeRF: Instant On-Device Neural Radiance Field Training via Algorithm-Accelerator Co-Designed Near-Memory Processing | Yang Zhao et.al. | 2305.05766 | null |
2023-05-09 | PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces | Yiqun Wang et.al. | 2305.05594 | link |
2023-05-08 | NerfAcc: Efficient Sampling Accelerates NeRFs | Ruilong Li et.al. | 2305.04966 | null |
2023-05-08 | AvatarReX: Real-time Expressive Full-body Avatars | Zerong Zheng et.al. | 2305.04789 | null |
2023-05-07 | HashCC: Lightweight Method to Improve the Quality of the Camera-less NeRF Scene Generation | Jan Olszewski et.al. | 2305.04296 | null |
2023-05-07 | Multi-Space Neural Radiance Fields | Ze-Xin Yin et.al. | 2305.04268 | null |
2023-05-04 | NeRF-QA: Neural Radiance Fields Quality Assessment Database | Pedro Martin et.al. | 2305.03176 | null |
2023-05-04 | NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds | Jun-Kun Chen et.al. | 2305.03049 | null |
2023-05-04 | Radiance Field Gradient Scaling for Unbiased Near-Camera Training | Julien Philip et.al. | 2305.02756 | link |
2023-05-04 | Semantic-aware Generation of Multi-view Portrait Drawings | Biao Ma et.al. | 2305.02618 | link |
2023-05-02 | Neural LiDAR Fields for Novel View Synthesis | Shengyu Huang et.al. | 2305.01643 | null |
2023-05-03 | LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar | Yuelang Xu et.al. | 2305.01190 | null |
2023-05-02 | Federated Neural Radiance Fields | Lachlan Holden et.al. | 2305.01163 | link |
2023-05-01 | GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation | Zhenhui Ye et.al. | 2305.00787 | null |
2023-04-30 | Neural Radiance Fields (NeRFs): A Review and Some Recent Developments | Mohamed Debbagh et.al. | 2305.00375 | null |
2023-04-28 | ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance Fields | Nagabhushan Somraj et.al. | 2305.00041 | link |
2023-04-28 | NeRF-LiDAR: Generating Realistic LiDAR Point Clouds with Neural Radiance Fields | Junge Zhang et.al. | 2304.14811 | link |
2023-04-27 | Learning a Diffusion Prior for NeRFs | Guandao Yang et.al. | 2304.14473 | null |
2023-04-27 | ActorsNeRF: Animatable Few-shot Human Rendering with Generalizable NeRFs | Jiteng Mu et.al. | 2304.14401 | null |
2023-05-03 | Combining HoloLens with Instant-NeRFs: Advanced Real-Time 3D Mobile Mapping | Dennis Haitz et.al. | 2304.14301 | null |
2023-04-27 | Compositional 3D Human-Object Neural Animation | Zhi Hou et.al. | 2304.14070 | null |
2023-04-26 | Super-NeRF: View-consistent Detail Generation for NeRF super-resolution | Yuqi Han et.al. | 2304.13518 | null |
2023-04-26 | VGOS: Voxel Grid Optimization for View Synthesis from Sparse Inputs | Jiakai Sun et.al. | 2304.13386 | link |
2023-04-25 | Local Implicit Ray Function for Generalizable Radiance Field Representation | Xin Huang et.al. | 2304.12746 | null |
2023-04-27 | MF-NeRF: Memory Efficient NeRF with Mixed-Feature Hash Table | Yongjae Lee et.al. | 2304.12587 | link |
2023-04-24 | Instant-3D: Instant Neural Radiance Field Training Towards On-Device AR/VR 3D Reconstruction | Sixu Li et.al. | 2304.12467 | null |
2023-04-24 | TextMesh: Generation of Realistic 3D Meshes From Text Prompts | Christina Tsalicoglou et.al. | 2304.12439 | null |
2023-04-26 | Segment Anything in 3D with NeRFs | Jiazhong Cen et.al. | 2304.12308 | link |
2023-04-24 | Explicit Correspondence Matching for Generalizable Neural Radiance Fields | Yuedong Chen et.al. | 2304.12294 | link |
2023-04-25 | Gen-NeRF: Efficient and Generalizable Neural Radiance Fields via Algorithm-Hardware Co-Design | Yonggan Fu et.al. | 2304.11842 | null |
2023-04-22 | 3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes | Haotian Xue et.al. | 2304.11470 | null |
2023-04-22 | Dehazing-NeRF: Neural Radiance Fields from Hazy Images | Tian Li et.al. | 2304.11448 | null |
2023-04-22 | NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation | Baao Xie et.al. | 2304.11342 | link |
2023-04-21 | AutoNeRF: Training Implicit Scene Representations with Autonomous Agents | Pierre Marza et.al. | 2304.11241 | link |
2023-04-21 | Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction | Binbin Huang et.al. | 2304.10780 | null |
2023-04-20 | A Comparative Neural Radiance Field (NeRF) 3D Analysis of Camera Poses from HoloLens Trajectories and Structure from Motion | Miriam Jäger et.al. | 2304.10664 | null |
2023-04-20 | Learning Neural Duplex Radiance Fields for Real-Time View Synthesis | Ziyu Wan et.al. | 2304.10537 | null |
2023-04-21 | Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs | Frederik Warburg et.al. | 2304.10532 | link |
2023-04-20 | ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects | Marco Toschi et.al. | 2304.10448 | null |
2023-04-20 | LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields | Tang Tao et.al. | 2304.10406 | link |
2023-04-20 | Revisiting Implicit Neural Representations in Low-Level Vision | Wentian Xu et.al. | 2304.10250 | link |
2023-04-20 | Multiscale Representation for Real-Time Anti-Aliasing Neural Rendering | Dongting Hu et.al. | 2304.10075 | null |
2023-04-20 | Neural Radiance Fields: Past, Present, and Future | Ansh Mittal et.al. | 2304.10050 | link |
2023-04-19 | Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra | Jonas Kulhanek et.al. | 2304.09987 | link |
2023-04-20 | Reference-guided Controllable Inpainting of Neural Radiance Fields | Ashkan Mirzaei et.al. | 2304.09677 | null |
2023-04-18 | SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes | Yiming Gao et.al. | 2304.08971 | null |
2023-04-18 | NeAI: A Pre-convoluted Representation for Plug-and-Play Neural Ambient Illumination | Yiyu Zhuang et.al. | 2304.08757 | null |
2023-04-17 | MoDA: Modeling Deformable 3D Objects from Casual Videos | Chaoyue Song et.al. | 2304.08279 | link |
2023-04-17 | NeRF-Loc: Visual Localization with Conditional Neural Radiance Field | Jianlin Liu et.al. | 2304.07979 | link |
2023-04-16 | Likelihood-Based Generative Radiance Field with Latent Space Energy-Based Model for 3D-Aware Disentangled Image Representation | Yaxuan Zhu et.al. | 2304.07918 | null |
2023-04-16 | CAT-NeRF: Constancy-Aware Tx $^2$ Former for Dynamic Body Modeling | Haidong Zhu et.al. | 2304.07915 | link |
2023-04-16 | SeaThru-NeRF: Neural Radiance Fields in Scattering Media | Deborah Levy et.al. | 2304.07743 | link |
2023-04-14 | UVA: Towards Unified Volumetric Avatar for View Synthesis, Pose rendering, Geometry and Texture Editing | Jinlong Fan et.al. | 2304.06969 | null |
2023-04-17 | Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction | Hansheng Chen et.al. | 2304.06714 | link |
2023-04-13 | Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields | Jonathan T. Barron et.al. | 2304.06706 | null |
2023-04-13 | NeRFVS: Neural Radiance Fields for Free View Synthesis via Geometry Scaffolds | Chen Yang et.al. | 2304.06287 | null |
2023-04-12 | NutritionVerse-Thin: An Optimized Strategy for Enabling Improved Rendering of 3D Thin Food Models | Chi-en Amy Tai et.al. | 2304.05620 | null |
2023-04-11 | Improving Neural Radiance Fields with Depth-aware Optimization for Novel View Synthesis | Shu Chen et.al. | 2304.05218 | link |
2023-04-11 | One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field | Weichuang Li et.al. | 2304.05097 | null |
2023-04-11 | MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields | Ganlin Yang et.al. | 2304.04962 | link |
2023-04-10 | Neural Image-based Avatars: Generalizable Radiance Fields for Human Avatar Modeling | Youngjoong Kwon et.al. | 2304.04897 | null |
2023-04-07 | Event-based Camera Tracker by $\nabla$ t NeRF | Mana Masuda et.al. | 2304.04559 | null |
2023-04-10 | Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos | Liao Wang et.al. | 2304.04452 | null |
2023-04-10 | Inferring Fluid Dynamics via Inverse Rendering | Jinxian Liu et.al. | 2304.04446 | null |
2023-04-10 | Instance Neural Radiance Field | Benran Hu et.al. | 2304.04395 | link |
2023-04-12 | NeRF applied to satellite imagery for surface reconstruction | Federico Semeraro et.al. | 2304.04133 | link |
2023-04-08 | PVD-AL: Progressive Volume Distillation with Active Learning for Efficient Conversion Between Different NeRF Architectures | Shuangkang Fang et.al. | 2304.04012 | link |
2023-04-07 | Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field | Leheng Li et.al. | 2304.03526 | null |
2023-04-06 | Beyond NeRF Underwater: Learning Neural Reflectance Fields for True Color Correction of Marine Imagery | Tianyi Zhang et.al. | 2304.03384 | link |
2023-04-06 | LANe: Lighting-Aware Neural Fields for Compositional Scene Synthesis | Akshay Krishnan et.al. | 2304.03280 | null |
2023-04-06 | Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes | Zian Wang et.al. | 2304.03266 | null |
2023-04-06 | DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model | Hoigi Seo et.al. | 2304.02827 | null |
2023-04-05 | Image Stabilization for Hololens Camera in Remote Collaboration | Gowtham Senthil et.al. | 2304.02736 | null |
2023-04-04 | Generating Continual Human Motion in Diverse 3D Scenes | Aymen Mir et.al. | 2304.02061 | null |
2023-04-04 | MonoHuman: Animatable Human Neural Field from Monocular Video | Zhengming Yu et.al. | 2304.02001 | null |
2023-04-06 | DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models | Yukang Cao et.al. | 2304.00916 | link |
2023-04-01 | JacobiNeRF: NeRF Shaping with Mutual Information Gradients | Xiaomeng Xu et.al. | 2304.00341 | link |
2023-03-31 | VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization | Bingfan Zhu et.al. | 2303.17968 | link |
2023-03-30 | NeRF-Supervised Deep Stereo | Fabio Tosi et.al. | 2303.17603 | link |
2023-03-30 | SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling | Zhitao Yang et.al. | 2303.17368 | link |
2023-03-30 | NeILF++: Inter-Reflectable Light Fields for Geometry and Material Estimation | Jingyang Zhang et.al. | 2303.17147 | null |
2023-03-30 | Enhanced Stable View Synthesis | Nishant Jain et.al. | 2303.17094 | null |
2023-03-29 | TriVol: Point Cloud Rendering via Triple Volumes | Tao Hu et.al. | 2303.16485 | link |
2023-03-29 | Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance Fields | Tao Hu et.al. | 2303.16482 | null |
2023-03-28 | Flow supervision for Deformable NeRF | Chaoyang Wang et.al. | 2303.16333 | null |
2023-03-28 | SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis | Guangcong Wang et.al. | 2303.16196 | link |
2023-03-28 | VMesh: Hybrid Volume-Mesh Representation for Efficient View Synthesis | Yuan-Chen Guo et.al. | 2303.16184 | null |
2023-03-30 | Adaptive Voronoi NeRFs | Tim Elsner et.al. | 2303.16001 | null |
2023-03-28 | F $^{2}$ -NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories | Peng Wang et.al. | 2303.15951 | link |
2023-03-27 | JAWS: Just A Wild Shot for Cinematic Transfer in Neural Radiance Fields | Xi Wang et.al. | 2303.15427 | link |
2023-03-27 | Generalizable Neural Voxels for Fast Human Radiance Fields | Taoran Yi et.al. | 2303.15387 | null |
2023-03-27 | NeUDF: Learning Unsigned Distance Fields from Multi-view Images for Reconstructing Non-watertight Models | Fei Hou et.al. | 2303.15368 | link |
2023-03-24 | Perceptual Quality Assessment of NeRF and Neural View Synthesis Methods for Front-Facing Views | Hanxue Liang et.al. | 2303.15206 | null |
2023-03-27 | 3D-Aware Multi-Class Image-to-Image Translation with NeRFs | Senmao Li et.al. | 2303.15012 | link |
2023-03-26 | Clean-NeRF: Reformulating NeRF to account for View-Dependent Observations | Xinhang Liu et.al. | 2303.14707 | null |
2023-03-25 | SUDS: Scalable Urban Dynamic Scenes | Haithem Turki et.al. | 2303.14536 | null |
2023-03-25 | DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields | Yu Chen et.al. | 2303.14478 | null |
2023-03-25 | NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects | Zhiwen Yan et.al. | 2303.14435 | link |
2023-03-24 | Grid-guided Neural Radiance Fields for Large Urban Scenes | Linning Xu et.al. | 2303.14001 | null |
2023-03-24 | CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout | Yiqi Lin et.al. | 2303.13843 | null |
2023-03-24 | HandNeRF: Neural Radiance Fields for Animatable Interacting Hands | Zhiyang Guo et.al. | 2303.13825 | null |
2023-03-24 | ABLE-NeRF: Attention-Based Rendering with Learnable Embeddings for Neural Radiance Field | Zhe Jun Tang et.al. | 2303.13817 | link |
2023-03-24 | GM-NeRF: Learning Generalizable Model-based Neural Radiance Fields from Multi-view Images | Jianchuan Chen et.al. | 2303.13777 | null |
2023-03-24 | TEGLO: High Fidelity Canonical Texture Mapping from Single-View Images | Vishal Vinod et.al. | 2303.13743 | null |
2023-03-23 | SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates | Mikaela Angelina Uy et.al. | 2303.13582 | null |
2023-03-23 | TriPlaneNet: An Encoder for EG3D Inversion | Ananta R. Bhattarai et.al. | 2303.13497 | null |
2023-03-23 | Plotting Behind the Scenes: Towards Learnable Game Engines | Willi Menapace et.al. | 2303.13472 | null |
2023-03-23 | Set-the-Scene: Global-Local Training for Generating Controllable NeRF Scenes | Dana Cohen-Bar et.al. | 2303.13450 | link |
2023-03-23 | SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field | Chong Bao et.al. | 2303.13277 | link |
2023-03-23 | Transforming Radiance Field with Lipschitz Network for Photorealistic 3D Scene Stylization | Zicheng Zhang et.al. | 2303.13232 | null |
2023-03-23 | Semantic Ray: Learning a Generalizable Semantic Field with Cross-Reprojection Attention | Fangfu Liu et.al. | 2303.13014 | link |
2023-03-22 | NeRF-GAN Distillation for Efficient 3D-Aware Generation with Convolutions | Mohamad Shahbazi et.al. | 2303.12865 | link |
2023-03-22 | SHERF: Generalizable Human NeRF from a Single Image | Shoukang Hu et.al. | 2303.12791 | link |
2023-03-22 | Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions | Ayaan Haque et.al. | 2303.12789 | null |
2023-03-22 | FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models | Jianglong Ye et.al. | 2303.12786 | link |
2023-03-22 | Balanced Spherical Grid for Egocentric View Synthesis | Changwoon Choi et.al. | 2303.12408 | link |
2023-03-21 | Pre-NeRF 360: Enriching Unbounded Appearances for Neural Radiance Fields | Ahmad AlMughrabi et.al. | 2303.12234 | link |
2023-03-21 | 3D-CLFusion: Fast Text-to-3D Rendering with Contrastive Latent Diffusion | Yu-Jhe Li et.al. | 2303.11938 | null |
2023-03-22 | ExtremeNeRF: Few-shot Neural Radiance Fields Under Unconstrained Illumination | SeokYeong Lee et.al. | 2303.11728 | null |
2023-03-20 | DehazeNeRF: Multiple Image Haze Removal and 3D Shape Reconstruction using Neural Radiance Fields | Wei-Ting Chen et.al. | 2303.11364 | null |
2023-03-20 | ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-real Novel View Synthesis via Contrastive Learning | Hao Yang et.al. | 2303.11052 | null |
2023-03-19 | SKED: Sketch-guided Text-based 3D Editing | Aryan Mikaeili et.al. | 2303.10735 | null |
2023-03-19 | NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping | Junyuan Deng et.al. | 2303.10709 | link |
2023-03-18 | 3D Data Augmentation for Driving Scenes on Camera | Wenwen Tong et.al. | 2303.10340 | null |
2023-03-17 | $α$ Surf: Implicit Surface Reconstruction for Semi-Transparent and Thin Objects with Decoupled Geometry and Opacity | Tianhao Wu et.al. | 2303.10083 | null |
2023-03-17 | Single-view Neural Radiance Fields with Depth Teacher | Yurui Chen et.al. | 2303.09952 | null |
2023-03-21 | PartNeRF: Generating Part-Aware Editable 3D Shapes without 3D Supervision | Konstantinos Tertikas et.al. | 2303.09554 | null |
2023-03-16 | LERF: Language Embedded Radiance Fields | Justin Kerr et.al. | 2303.09553 | null |
2023-03-16 | NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes | Marie-Julie Rakotosaona et.al. | 2303.09431 | null |
2023-03-17 | NeRFtrinsic Four: An End-To-End Trainable NeRF Jointly Optimizing Diverse Intrinsic and Extrinsic Camera Parameters | Hannah Schieber et.al. | 2303.09412 | link |
2023-03-16 | Reliable Image Dehazing by NeRF | Zheyan Jin et.al. | 2303.09153 | null |
2023-03-15 | Mesh Strikes Back: Fast and Efficient Human Reconstruction from RGB videos | Rohit Jena et.al. | 2303.08808 | null |
2023-03-15 | Re-ReND: Real-time Rendering of NeRFs across Devices | Sara Rojas et.al. | 2303.08717 | link |
2023-03-15 | RefiNeRF: Modelling dynamic neural radiance fields with inconsistent or missing camera parameters | Shuja Khalid et.al. | 2303.08695 | null |
2023-03-15 | Harnessing Low-Frequency Neural Fields for Few-Shot View Synthesis | Liangchen Song et.al. | 2303.08370 | link |
2023-03-14 | MELON: NeRF with Unposed Images Using Equivalence Class Estimation | Axel Levy et.al. | 2303.08096 | null |
2023-03-16 | Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation | Junyoung Seo et.al. | 2303.07937 | link |
2023-03-16 | NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction from Multi-view Images | Yunfan Ye et.al. | 2303.07653 | link |
2023-03-14 | Frequency-Modulated Point Cloud Rendering with Easy Editing | Yi Zhang et.al. | 2303.07596 | link |
2023-03-13 | FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization | Jiawei Yang et.al. | 2303.07418 | link |
2023-03-13 | NeRFLiX: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-viewpoint MiXer | Kun Zhou et.al. | 2303.06919 | link |
2023-03-11 | Just Flip: Flipped Observation Generation and Optimization for Neural Radiance Fields to Cover Unobserved View | Minjae Lee et.al. | 2303.06335 | link |
2023-03-10 | NeRFlame: FLAME-based conditioning of NeRF for 3D face rendering | Wojciech Zając et.al. | 2303.06226 | link |
2023-03-10 | You Only Train Once: Multi-Identity Free-Viewpoint Neural Human Rendering from Monocular Videos | Jaehyeok Kim et.al. | 2303.05835 | null |
2023-03-10 | Aleth-NeRF: Low-light Condition View Synthesis with Concealing Fields | Ziteng Cui et.al. | 2303.05807 | null |
2023-03-10 | Self-NeRF: A Self-Training Pipeline for Few-Shot Neural Radiance Fields | Jiayang Bai et.al. | 2303.05775 | null |
2023-03-14 | Hardware Acceleration of Neural Graphics | Muhammad Husnain Mubarik et.al. | 2303.05735 | null |
2023-03-10 | MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field | Kaizhi Yang et.al. | 2303.05703 | null |
2023-03-09 | PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification | Xuan Li et.al. | 2303.05512 | null |
2023-03-08 | FastSurf: Fast Neural RGB-D Surface Reconstruction using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning | Seunghwan Lee et.al. | 2303.04508 | link |
2023-03-08 | DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields | Dipam Patel et.al. | 2303.04322 | null |
2023-03-07 | NEPHELE: A Neural Platform for Highly Realistic Cloud Radiance Rendering | Haimin Luo et.al. | 2303.04086 | null |
2023-03-05 | Semantic-aware Occlusion Filtering Neural Radiance Fields in the Wild | Jaewon Lee et.al. | 2303.03966 | null |
2023-03-07 | Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis | Kang Han et.al. | 2303.03808 | link |
2023-03-10 | Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation from 2D Supervision | Xiaoshuai Zhang et.al. | 2303.03361 | null |
2023-03-07 | Efficient Large-scale Scene Representation with a Hybrid of High-resolution Grid and Plane Features | Yuqi Zhang et.al. | 2303.03003 | link |
2023-03-03 | Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement | Jiaxiang Tang et.al. | 2303.02091 | link |
2023-03-03 | Multi-Plane Neural Radiance Fields for Novel View Synthesis | Youssef Abdelkareem et.al. | 2303.01736 | null |
2023-03-01 | S-NeRF: Neural Radiance Fields for Street Views | Ziyang Xie et.al. | 2303.00749 | null |
2023-02-28 | IntrinsicNGP: Intrinsic Coordinate based Hash Encoding for Human NeRF | Bo Peng et.al. | 2302.14683 | null |
2023-02-27 | BaLi-RF: Bandlimited Radiance Fields for Dynamic Scene Modeling | Sameera Ramasinghe et.al. | 2302.13543 | null |
2023-02-26 | Efficient physics-informed neural networks using hash encoding | Xinquan Huang et.al. | 2302.13397 | null |
2023-02-24 | CATNIPS: Collision Avoidance Through Neural Implicit Probabilistic Scenes | Timothy Chen et.al. | 2302.12931 | link |
2023-02-24 | Learning Neural Volumetric Representations of Dynamic Humans in Minutes | Chen Geng et.al. | 2302.12237 | link |
2023-02-23 | DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models | Jamie Wynn et.al. | 2302.12231 | link |
2023-02-20 | NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion | Jiatao Gu et.al. | 2302.10109 | null |
2023-02-19 | LC-NeRF: Local Controllable Face Generation in Neural Randiance Field | Wenyang Zhou et.al. | 2302.09486 | null |
2023-02-17 | MixNeRF: Modeling a Ray with Mixture Density for Novel View Synthesis from Sparse Inputs | Seunghyeon Seo et.al. | 2302.08788 | link |
2023-02-14 | VQ3D: Learning a 3D-Aware Generative Model on ImageNet | Kyle Sargent et.al. | 2302.06833 | null |
2023-02-13 | 3D-aware Blending with Generative NeRFs | Hyunsu Kim et.al. | 2302.06608 | link |
2023-02-11 | 3D Colored Shape Reconstruction from a Single RGB Image through Diffusion | Bo Li et.al. | 2302.05573 | null |
2023-02-08 | Nerfstudio: A Modular Framework for Neural Radiance Field Development | Matthew Tancik et.al. | 2302.04264 | null |
2023-02-07 | AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis | Susan Liang et.al. | 2302.02088 | null |
2023-02-03 | Semantic 3D-aware Portrait Synthesis and Manipulation Based on Compositional Neural Radiance Field | Tianxiang Ma et.al. | 2302.01579 | link |
2023-02-03 | Robust Camera Pose Refinement for Multi-Resolution Hash Encoding | Hwan Heo et.al. | 2302.01571 | null |
2023-02-03 | INV: Towards Streaming Incremental Neural Videos | Shengze Wang et.al. | 2302.01532 | null |
2023-02-02 | Factor Fields: A Unified Framework for Neural Fields and Beyond | Anpei Chen et.al. | 2302.01226 | null |
2023-02-02 | RobustNeRF: Ignoring Distractors with Robust Losses | Sara Sabour et.al. | 2302.00833 | null |
2023-01-31 | GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis | Zhenhui Ye et.al. | 2301.13430 | null |
2023-01-30 | Equivariant Architectures for Learning in Deep Weight Spaces | Aviv Navon et.al. | 2301.12780 | link |
2023-01-27 | HyperNeRFGAN: Hypernetwork approach to 3D NeRF GAN | Adam Kania et.al. | 2301.11631 | link |
2023-01-27 | A Comparison of Tiny-nerf versus Spatial Representations for 3d Reconstruction | Saulo Abraham Gante et.al. | 2301.11522 | null |
2023-01-27 | SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning | Dongseok Shim et.al. | 2301.11520 | null |
2023-01-26 | Text-To-4D Dynamic Scene Generation | Uriel Singer et.al. | 2301.11280 | null |
2023-01-26 | GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency | Minseop Kwak et.al. | 2301.10941 | link |
2023-01-23 | HexPlane: A Fast Representation for Dynamic Scenes | Ang Cao et.al. | 2301.09632 | link |
2023-01-22 | 3D Reconstruction of Non-cooperative Resident Space Objects using Instant NGP-accelerated NeRF and D-NeRF | Trupti Mahendrakar et.al. | 2301.09060 | null |
2023-01-18 | NeRF in the Palm of Your Hand: Corrective Augmentation for Robotics via Novel-View Synthesis | Allan Zhou et.al. | 2301.08556 | null |
2023-01-19 | RecolorNeRF: Layer Decomposed Radiance Field for Efficient Color Editing of 3D Scenes | Bingchen Gong et.al. | 2301.07958 | null |
2023-01-18 | Behind the Scenes: Density Fields for Single View Reconstruction | Felix Wimbauer et.al. | 2301.07668 | link |
2023-01-17 | A Large-Scale Outdoor Multi-modal Dataset and Benchmark for Novel View Synthesis and Implicit Scene Reconstruction | Chongshan Lu et.al. | 2301.06782 | null |
2023-01-13 | Laser: Latent Set Representations for 3D Generative Modeling | Pol Moreno et.al. | 2301.05747 | null |
2023-01-10 | Benchmarking Robustness in Neural Radiance Fields | Chen Wang et.al. | 2301.04075 | null |
2023-01-08 | Towards Open World NeRF-Based SLAM | Daniil Lisus et.al. | 2301.03102 | null |
2023-01-10 | Traditional Readability Formulas Compared for English | Bruce W. Lee et.al. | 2301.02975 | null |
2023-01-09 | Class-Continuous Conditional Generative Neural Radiance Field | Jiwook Kim et.al. | 2301.00950 | link |
2023-01-11 | Detachable Novel Views Synthesis of Dynamic Scenes Using Distribution-Driven Neural Radiance Fields | Boyu Zhang et.al. | 2301.00411 | link |
2022-12-26 | MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos | Fengrui Tian et.al. | 2212.13056 | link |
2022-12-25 | PaletteNeRF: Palette-based Color Editing for NeRFs | Qiling Wu et.al. | 2212.12871 | null |
2022-12-22 | Removing Objects From Neural Radiance Fields | Silvan Weder et.al. | 2212.11966 | null |
2022-12-21 | Incremental Learning for Neural Radiance Field with Uncertainty-Filtered Knowledge Distillation | Mengqi Guo et.al. | 2212.10950 | link |
2022-12-21 | PaletteNeRF: Palette-based Appearance Editing of Neural Radiance Fields | Zhengfei Kuang et.al. | 2212.10699 | null |
2022-12-20 | Correspondence Distillation from NeRF-based GAN | Yushi Lan et.al. | 2212.09735 | null |
2022-12-19 | StyleTRF: Stylizing Tensorial Radiance Fields | Rahul Goel et.al. | 2212.09330 | null |
2022-12-18 | SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images | Abdullah Hamdi et.al. | 2212.09100 | link |
2022-12-18 | Masked Wavelet Representation for Compact Neural Radiance Fields | Daniel Rho et.al. | 2212.09069 | link |
2022-12-15 | SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory | Sicheng Li et.al. | 2212.08476 | null |
2022-12-16 | MEIL-NeRF: Memory-Efficient Incremental Learning of Neural Radiance Fields | Jaeyoung Chung et.al. | 2212.08328 | null |
2022-12-15 | NeRF-Art: Text-Driven Neural Radiance Fields Stylization | Can Wang et.al. | 2212.08070 | link |
2022-12-15 | Real-Time Neural Light Field on Mobile Devices | Junli Cao et.al. | 2212.08057 | link |
2022-12-14 | NoPe-NeRF: Optimising Neural Radiance Field with No Pose Prior | Wenjing Bian et.al. | 2212.07388 | link |
2022-12-08 | GazeNeRF: 3D-Aware Gaze Redirection with Neural Radiance Fields | Alessandro Ruzzi et.al. | 2212.04823 | link |
2022-12-09 | 4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions | Zhongshu Wang et.al. | 2212.04701 | link |
2022-12-07 | EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points | Chengwei Zheng et.al. | 2212.04247 | null |
2022-12-08 | NeRFEditor: Differentiable Style Decomposition for Full 3D Scene Editing | Chunyi Sun et.al. | 2212.03848 | null |
2022-12-07 | Non-uniform Sampling Strategies for NeRF on 360{\textdegree} images | Takashi Otonari et.al. | 2212.03635 | null |
2022-12-07 | SSDNeRF: Semantic Soft Decomposition of Neural Radiance Fields | Siddhant Ranade et.al. | 2212.03406 | null |
2022-12-06 | NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors | Congyue Deng et.al. | 2212.03267 | null |
2022-12-05 | SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields | Anh-Quan Cao et.al. | 2212.02501 | link |
2022-12-05 | Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields | Rohith Agaram et.al. | 2212.02493 | link |
2022-12-06 | D-TensoRF: Tensorial Radiance Fields for Dynamic Scenes | Hankyu Jang et.al. | 2212.02375 | null |
2022-12-07 | GARF:Geometry-Aware Generalized Neural Radiance Field | Yue Shi et.al. | 2212.02280 | null |
2022-12-05 | INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors | Chaojian Li et.al. | 2212.01959 | null |
2022-12-03 | MaRF: Representing Mars as Neural Radiance Fields | Lorenzo Giusti et.al. | 2212.01672 | link |
2022-12-03 | StegaNeRF: Embedding Invisible Information within Neural Radiance Fields | Chenxin Li et.al. | 2212.01602 | null |
2022-12-02 | RT-NeRF: Real-Time On-Device Neural Radiance Fields Towards Immersive AR/VR Rendering | Chaojian Li et.al. | 2212.01120 | null |
2022-12-02 | 3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation | Zutao Jiang et.al. | 2212.01103 | null |
2022-12-02 | QFF: Quantized Fourier Features for Neural Field Representations | Jae Yong Lee et.al. | 2212.00914 | null |
2022-12-01 | ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural Radiance Fields | Octave Mariotti et.al. | 2212.00436 | null |
2022-11-30 | NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation | Yu Yin et.al. | 2211.17235 | null |
2022-11-29 | NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views | Dejia Xu et.al. | 2211.16431 | link |
2022-11-29 | Compressing Volumetric Radiance Fields to 1 MB | Lingzhi Li et.al. | 2211.16386 | link |
2022-11-28 | In-Hand 3D Object Scanning from an RGB Sequence | Shreyas Hampali et.al. | 2211.16193 | null |
2022-11-30 | One is All: Bridging the Gap Between Neural Radiance Fields Architectures with Progressive Volume Distillation | Shuangkang Fang et.al. | 2211.15977 | link |
2022-11-28 | High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors | Yunpeng Bai et.al. | 2211.15064 | null |
2022-11-27 | SuNeRF: Validation of a 3D Global Reconstruction of the Solar Corona Using Simulated EUV Images | Kyriaki-Margarita Bintsi et.al. | 2211.14879 | null |
2022-11-27 | 3D Scene Creation and Rendering via Rough Meshes: A Lighting Transfer Avenue | Yujie Li et.al. | 2211.14823 | null |
2022-11-27 | Sampling Neural Radiance Fields for Refractive Objects | Jen-I Pan et.al. | 2211.14799 | link |
2022-11-25 | 3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models | Gang Li et.al. | 2211.14108 | null |
2022-11-25 | ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision | Jingwang Ling et.al. | 2211.14086 | link |
2022-11-25 | Dynamic Neural Portraits | Michail Christos Doukas et.al. | 2211.13994 | null |
2022-11-25 | Unsupervised Continual Semantic Adaptation through Neural Rendering | Zhizheng Liu et.al. | 2211.13969 | link |
2022-11-25 | TPA-Net: Generate A Dataset for Text to Physics-based Animation | Yuxing Qiu et.al. | 2211.13887 | null |
2022-11-24 | ScanNeRF: a Scalable Benchmark for Neural Radiance Fields | Luca De Luigi et.al. | 2211.13762 | null |
2022-11-24 | Immersive Neural Graphics Primitives | Ke Li et.al. | 2211.13494 | link |
2022-11-23 | CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields | Keqiang Sun et.al. | 2211.13251 | null |
2022-11-26 | ClimateNeRF: Physically-based Neural Rendering for Extreme Climate Synthesis | Yuan Li et.al. | 2211.13226 | null |
2022-11-23 | ManVatar : Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels | Yuelang Xu et.al. | 2211.13206 | null |
2022-11-23 | BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields | Peng Wang et.al. | 2211.12853 | link |
2022-11-23 | PANeRF: Pseudo-view Augmentation for Improved Neural Radiance Fields Based on Few-shot Inputs | Young Chun Ahn et.al. | 2211.12758 | null |
2022-11-23 | ActiveRMAP: Radiance Field for Active Mapping And Planning | Huangying Zhan et.al. | 2211.12656 | null |
2022-11-22 | Zero NeRF: Registration with Zero Overlap | Casey Peat et.al. | 2211.12544 | null |
2022-11-22 | Depth-Supervised NeRF for Multi-View RGB-D Operating Room Images | Beerend G. A. Gerats et.al. | 2211.12436 | null |
2022-11-22 | Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition | Jiaxiang Tang et.al. | 2211.12368 | null |
2022-11-22 | Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields | Brian K. S. Isaac-Medina et.al. | 2211.12285 | link |
2022-11-22 | SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields | Ashkan Mirzaei et.al. | 2211.12254 | null |
2022-11-22 | Deblurred Neural Radiance Field with Physical Scene Priors | Dogyoon Lee et.al. | 2211.12046 | link |
2022-11-22 | ONeRF: Unsupervised 3D Object Segmentation from Multiple Views | Shengnan Liang et.al. | 2211.12038 | null |
2022-11-21 | Towards Live 3D Reconstruction from Wearable Video: An Evaluation of V-SLAM, NeRF, and Videogrammetry Techniques | David Ramirez et.al. | 2211.11836 | null |
2022-11-21 | SPARF: Neural Radiance Fields from Sparse and Noisy Poses | Prune Truong et.al. | 2211.11738 | link |
2022-11-21 | ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari et.al. | 2211.11704 | null |
2022-11-21 | Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion | Dario Pavllo et.al. | 2211.11674 | link |
2022-11-18 | Magic3D: High-Resolution Text-to-3D Content Creation | Chen-Hsuan Lin et.al. | 2211.10440 | null |
2022-11-17 | AligNeRF: High-Fidelity Neural Radiance Fields via Alignment-Aware Training | Yifan Jiang et.al. | 2211.09682 | null |
2022-11-16 | CoNFies: Controllable Neural Face Avatars | Heng Yu et.al. | 2211.08610 | null |
2022-11-14 | Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures | Gal Metzer et.al. | 2211.07600 | link |
2022-11-12 | 3D-Aware Encoding for Style-based Neural Radiance Fields | Yu-Jhe Li et.al. | 2211.06583 | null |
2022-11-11 | ParticleNeRF: A Particle-Based Encoding for Online Neural Radiance Fields in Dynamic Scenes | Jad Abou-Chakra et.al. | 2211.04041 | null |
2022-11-07 | Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories | Samarth Sinha et.al. | 2211.03889 | null |
2022-11-03 | nerf2nerf: Pairwise Registration of Neural Radiance Fields | Lily Goli et.al. | 2211.01600 | null |
2022-10-27 | ProbNeRF: Uncertainty-Aware Inference of 3D Shapes from 2D Images | Matthew D. Hoffman et.al. | 2210.17415 | null |
2022-10-27 | Boosting Point Clouds Rendering via Radiance Mapping | Xiaoyang Huang et.al. | 2210.15107 | link |
2022-10-24 | Learning Neural Radiance Fields from Multi-View Geometry | Marco Orsingher et.al. | 2210.13041 | null |
2022-10-23 | Compressing Explicit Voxel Grid Representations: fast NeRFs become also small | Chenxi Lola Deng et.al. | 2210.12782 | null |
2022-11-06 | Joint Rigid Motion Correction and Sparse-View CT via Self-Calibrating Neural Field | Qing Wu et.al. | 2210.12731 | null |
2022-10-21 | An Exploration of Neural Radiance Field Scene Reconstruction: Synthetic, Real-world and Dynamic Scenes | Benedict Quartey et.al. | 2210.12268 | null |
2022-11-06 | Neural Fields for Robotic Object Manipulation from a Single Image | Valts Blukis et.al. | 2210.12126 | null |
2022-10-21 | HDHumans: A Hybrid Approach for High-fidelity Digital Humans | Marc Habermann et.al. | 2210.12003 | null |
2022-10-21 | RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control | Zhenggang Tang et.al. | 2210.11668 | null |
2022-10-21 | Coordinates Are NOT Lonely – Codebook Prior Helps Implicit Neural 3D Representations | Fukun Yin et.al. | 2210.11170 | link |
2022-10-18 | Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation | Yunzhi Lin et.al. | 2210.10108 | link |
2022-10-18 | ARAH: Animatable Volume Rendering of Articulated Human SDFs | Shaofei Wang et.al. | 2210.10036 | null |
2022-10-20 | Differentiable Physics Simulation of Dynamics-Augmented Neural Objects | Simon Le Cleac’h et.al. | 2210.09420 | null |
2022-10-15 | SPIDR: SDF-based Neural Point Fields for Illumination and Deformation | Ruofan Liang et.al. | 2210.08398 | null |
2022-10-15 | IBL-NeRF: Image-Based Lighting Formulation of Neural Radiance Fields | Changwoon Choi et.al. | 2210.08202 | link |
2022-10-17 | 3D GAN Inversion with Pose Optimization | Jaehoon Ko et.al. | 2210.07301 | link |
2022-10-13 | Multiplane NeRF-Supervised Disentanglement of Depth and Camera Pose from Videos | Yang Fu et.al. | 2210.07181 | null |
2022-10-12 | GraspNeRF: Multiview-based 6-DoF Grasp Detection for Transparent and Specular Objects Using Generalizable NeRF | Qiyu Dai et.al. | 2210.06575 | link |
2022-10-12 | Reconstructing Personalized Semantic Facial NeRF Models From Monocular Video | Xuan Gao et.al. | 2210.06108 | link |
2022-10-11 | X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360 $^{\circ}$ Insufficient RGB-D Views | Haoyi Zhu et.al. | 2210.05135 | link |
2022-10-10 | NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields | Arunkumar Byravan et.al. | 2210.04932 | null |
2022-10-10 | EVA3D: Compositional 3D Human Generation from 2D Image Collections | Fangzhou Hong et.al. | 2210.04888 | link |
2022-10-13 | NerfAcc: A General NeRF Acceleration Toolbox | Ruilong Li et.al. | 2210.04847 | link |
2022-10-10 | SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction | Yitong Xia et.al. | 2210.04553 | link |
2022-10-09 | Robustifying the Multi-Scale Representation of Neural Radiance Fields | Nishant Jain et.al. | 2210.04233 | null |
2022-10-09 | Estimating Neural Reflectance Field from Radiance Field using Tree Structures | Xiu Li et.al. | 2210.04217 | null |
2022-10-09 | Data augmentation for NeRF: a geometric consistent solution based on view morphing | Matteo Bortolon et.al. | 2210.04214 | link |
2022-10-09 | Towards Efficient Neural Scene Graphs by Learning Consistency Fields | Yeji Song et.al. | 2210.04127 | null |
2022-10-08 | ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints | Yinpeng Dong et.al. | 2210.03895 | link |
2022-10-04 | SelfNeRF: Fast Training NeRF for Human from Monocular Self-rotating Video | Bo Peng et.al. | 2210.01651 | null |
2022-10-03 | NARF22: Neural Articulated Radiance Fields for Configuration-Aware Rendering | Stanley Lewis et.al. | 2210.01166 | null |
2022-10-02 | IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis | Weicai Ye et.al. | 2210.00647 | link |
2022-10-02 | Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation | Xinhang Liu et.al. | 2210.00489 | null |
2022-10-01 | NeRF: Neural Radiance Field in 3D Vision, A Comprehensive Review | Kyle Gao et.al. | 2210.00379 | null |
2022-10-01 | Structure-Aware NeRF without Posed Camera via Epipolar Constraint | Shu Chen et.al. | 2210.00183 | link |
2022-09-30 | Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator | Zifan Shi et.al. | 2209.15637 | null |
2022-09-30 | Understanding Pure CLIP Guidance for Voxel Grid NeRF Models | Han-Hung Lee et.al. | 2209.15172 | null |
2022-09-29 | DreamFusion: Text-to-3D using 2D Diffusion | Ben Poole et.al. | 2209.14988 | null |
2022-09-29 | SymmNeRF: Learning to Explore Symmetry Prior for Single-View View Synthesis | Xingyi Li et.al. | 2209.14819 | link |
2022-10-03 | 360FusionNeRF: Panoramic Neural Radiance Fields with Joint Guidance | Shreyas Kulkarni et.al. | 2209.14265 | link |
2022-09-27 | OmniNeRF: Hybriding Omnidirectional Distance and Radiance fields for Neural Surface Reconstruction | Jiaming Shen et.al. | 2209.13433 | null |
2022-09-27 | Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping | Chi-Ming Chung et.al. | 2209.13274 | link |
2022-09-27 | WaterNeRF: Neural Radiance Fields for Underwater Scenes | Advaith Venkatramanan Sethuraman et.al. | 2209.13091 | null |
2022-09-26 | Baking in the Feature: Accelerating Volumetric Segmentation by Rendering Feature Maps | Kenneth Blomqvist et.al. | 2209.12744 | null |
2022-09-25 | Enforcing safety for vision-based controllers via Control Barrier Functions and Neural Radiance Fields | Mukun Tong et.al. | 2209.12266 | null |
2022-09-24 | NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields | Jiankai Sun et.al. | 2209.12068 | null |
2022-09-19 | Loc-NeRF: Monte Carlo Localization using Neural Radiance Fields | Dominic Maggio et.al. | 2209.09050 | link |
2022-09-23 | NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes | Zhiwen Fan et.al. | 2209.08776 | link |
2022-09-19 | Density-aware NeRF Ensembles: Quantifying Predictive Uncertainty in Neural Radiance Fields | Niko Sünderhauf et.al. | 2209.08718 | null |
2022-09-18 | ActiveNeRF: Learning where to See with Uncertainty Estimation | Xuran Pan et.al. | 2209.08546 | link |
2022-09-18 | LATITUDE: Robotic Global Localization with Truncated Dynamic Low-pass Filter in City-scale NeRF | Zhenxin Zhu et.al. | 2209.08498 | link |
2022-09-16 | iDF-SLAM: End-to-End RGB-D SLAM with Neural Implicit Mapping and Deep Feature Tracking | Yuhang Ming et.al. | 2209.07919 | null |
2022-09-12 | StructNeRF: Neural Radiance Fields for Indoor Scenes with Structural Hints | Zheng Chen et.al. | 2209.05277 | null |
2022-09-09 | Generative Deformable Radiance Fields for Disentangled Image Synthesis of Topology-Varying Objects | Ziyu Wang et.al. | 2209.04183 | null |
2022-09-08 | im2nerf: Image to Neural Radiance Field in the Wild | Lu Mi et.al. | 2209.04061 | null |
2022-09-08 | PixTrack: Precise 6DoF Object Pose Tracking using NeRF Templates and Feature-metric Alignment | Prajwal Chidananda et.al. | 2209.03910 | link |
2022-09-07 | Neural Feature Fusion Fields: 3D Distillation of Self-Supervised 2D Image Representations | Vadim Tschernezki et.al. | 2209.03494 | null |
2022-08-29 | Volume Rendering Digest (for NeRF) | Andrea Tagliasacchi et.al. | 2209.02417 | null |
2022-09-06 | CLONeR: Camera-Lidar Fusion for Occupancy Grid-aided Neural Representations | Alexandra Carlson et.al. | 2209.01194 | null |
2022-09-01 | On Quantizing Implicit Neural Representations | Cameron Gordon et.al. | 2209.01019 | null |
2022-08-31 | Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces | Yihao Zhi et.al. | 2208.14851 | link |
2022-08-30 | A Portable Multiscopic Camera for Novel View and Time Synthesis in Dynamic Scenes | Tianjia Zhang et.al. | 2208.14433 | null |
2022-08-24 | PeRFception: Perception using Radiance Fields | Yoonwoo Jeong et.al. | 2208.11537 | link |
2022-08-24 | E-NeRF: Neural Radiance Fields from a Moving Event Camera | Simon Klenk et.al. | 2208.11300 | link |
2022-08-18 | Neural Capture of Animatable 3D Human from Monocular Video | Gusi Te et.al. | 2208.08728 | null |
2022-08-16 | Casual Indoor HDR Radiance Capture from Omnidirectional Images | Pulkit Gera et.al. | 2208.07903 | null |
2022-08-15 | DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images | Bing Wang et.al. | 2208.07227 | link |
2022-08-11 | RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild | Jason Y. Zhang et.al. | 2208.05963 | null |
2022-08-11 | FDNeRF: Few-shot Dynamic Neural Radiance Fields for Face Reconstruction and Expression Editing | Jingbo Zhang et.al. | 2208.05751 | link |
2022-08-04 | 360Roam: Real-Time Indoor Roaming Using Geometry-Aware ${360^\circ}$ Radiance Fields | Huajian Huang et.al. | 2208.02705 | null |
2022-08-02 | T4DT: Tensorizing Time for Learning Temporal 3D Visual Data | Mikhail Usvyatsov et.al. | 2208.01421 | link |
2022-08-01 | DoF-NeRF: Depth-of-Field Meets Neural Radiance Fields | Zijin Wu et.al. | 2208.00945 | link |
2022-08-06 | MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures | Zhiqin Chen et.al. | 2208.00277 | link |
2022-07-30 | Distilled Low Rank Neural Radiance Field with Quantization for Light Field Compression | Jinglei Shi et.al. | 2208.00164 | null |
2022-08-01 | End-to-end View Synthesis via NeRF Attention | Zelin Zhao et.al. | 2207.14741 | null |
2022-07-29 | Neural Density-Distance Fields | Itsuki Ueda et.al. | 2207.14455 | link |
2022-07-27 | Is Attention All NeRF Needs? | Mukund Varma T et.al. | 2207.13298 | null |