Skip to content

Visual Localization

Visual Localization

Publish Date Title Authors PDF Code
2024-11-29 SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection Philipp Wolters et.al. 2411.19860v1 null
2024-11-29 SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens Chi Su et.al. 2411.19824v1 null
2024-11-29 DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering Yihao Wang et.al. 2411.19756v1 null
2024-11-29 MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications Gasser Elazab et.al. 2411.19717v1 null
2024-11-29 In-Vehicle Edge System for Real-Time Dashcam Video Analysis Seyul Lee et.al. 2411.19558v1 null
2024-11-29 MCUCoder: Adaptive Bitrate Learned Video Compression for IoT Devices Ali Hojjat et.al. 2411.19442v1 link
2024-11-28 Trajectory Attention for Fine-grained Video Motion Control Zeqi Xiao et.al. 2411.19324v1 null
2024-11-28 On-chip Hyperspectral Image Segmentation with Fully Convolutional Networks for Scene Understanding in Autonomous Driving Jon GutiƩrrez-Zaballa et.al. 2411.19274v1 null
2024-11-28 Video Depth without Video Models Bingxin Ke et.al. 2411.19189v1 null
2024-11-28 HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos Prithviraj Banerjee et.al. 2411.19167v1 null
2024-11-28 Lost & Found: Updating Dynamic 3D Scene Graphs from Egocentric Observations Tjark Behrens et.al. 2411.19162v1 null
2024-11-28 Bayesian Deconvolution of Astronomical Images with Diffusion Models: Quantifying Prior-Driven Features in Reconstructions Alessio Spagnoletti et.al. 2411.19158v1 link
2024-11-28 Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras Jicheng Yuan et.al. 2411.19143v1 null
2024-11-28 On Moving Object Segmentation from Monocular Video with Transformers Christian Homeyer et.al. 2411.19141v1 null
2024-11-28 360Recon: An Accurate Reconstruction Method Based on Depth Fusion from 360 Images Zhongmiao Yan et.al. 2411.19102v1 null
2024-11-28 GelSight FlexiRay: Breaking Planar Limits by Harnessing Large Deformations for Flexible,Full-Coverage Multimodal Sensing Yanzhe Wang et.al. 2411.18979v1 null
2024-11-28 CrossTracker: Robust Multi-modal 3D Multi-Object Tracking via Cross Correction Lipeng Gu et.al. 2411.18850v1 null
2024-11-27 Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds Shuangqi Li et.al. 2411.18810v2 null
2024-11-27 CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Rundi Wu et.al. 2411.18613v1 null
2024-11-27 AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Sherwin Bahmani et.al. 2411.18673v2 null
2024-11-27 Structured light with a million light planes per second Dhawal Sirikonda et.al. 2411.18597v1 null
2024-11-27 A comparison of extended object tracking with multi-modal sensors in indoor environment Jiangtao Shuai et.al. 2411.18476v1 null
2024-11-27 Deep learning-based spatio-temporal fusion for high-fidelity ultra-high-speed x-ray radiography Songyuan Tang et.al. 2411.18441v1 link
2024-11-27 Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth Estimation Mehdi Zayene et.al. 2411.18335v1 link
2024-11-27 Neural Surface Priors for Editable Gaussian Splatting Jakub Szymkowiak et.al. 2411.18311v1 link
2024-11-27 Genetic algorithm as a tool for detection setup optimisation: SiFi-CC case study Jonas Kasper et.al. 2411.18239v1 null
2024-11-27 ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching Yangrui Dong et.al. 2411.18174v1 null
2024-11-27 Towards Cross-device and Training-free Robotic Grasping in 3D Open World Weiguang Zhao et.al. 2411.18133v1 null
2024-11-27 When Large Vision-Language Models Meet Person Re-Identification Qizao Wang et.al. 2411.18111v1 null
2024-11-27 SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images Yanyan Li et.al. 2411.18072v1 null