Skip to content

3D Object Detection

3D Object Detection

Publish Date Title Authors PDF Code
2025-07-03 Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory Yuqi Wu et.al. 2507.02863v1 null
2025-07-03 LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans Zhening Huang et.al. 2507.02861v1 null
2025-07-03 RefTok: Reference-Based Tokenization for Video Generation Xiang Fan et.al. 2507.02862v1 null
2025-07-03 Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation Jiaer Xia et.al. 2507.02859v1 null
2025-07-03 Answer Matching Outperforms Multiple Choice for Language Model Evaluation Nikhil Chandak et.al. 2507.02856v1 null
2025-07-03 AnyI2V: Animating Any Conditional Image with Motion Control Ziye Li et.al. 2507.02857v1 null
2025-07-03 MvHo-IB: Multi-View Higher-Order Information Bottleneck for Brain Disorder Diagnosis Kunyu Zhang et.al. 2507.02847v1 null
2025-07-03 Revealing a transitional epoch of large-scale cosmic anisotropy in the quasar distribution Amit Mondal et.al. 2507.02835v1 null
2025-07-03 Trace Formulas for Deformed W-Algebras Fabrizio Nieri et.al. 2507.02831v1 null
2025-07-03 USAD: An Unsupervised Data Augmentation Spatio-Temporal Attention Diffusion Network Ying Yu et.al. 2507.02827v1 null
2025-07-03 Measurement as Bricolage: Examining How Data Scientists Construct Target Variables for Predictive Modeling Tasks Luke Guerdan et.al. 2507.02819v1 null
2025-07-03 Helicons in tilted-Weyl semimetals Shiv Kumar Ram et.al. 2507.02816v1 null
2025-07-03 Towards Perception-Informed Latent HRTF Representations You Zhang et.al. 2507.02815v1 null
2025-07-03 LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion Fangfu Liu et.al. 2507.02813v1 null
2025-07-03 HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars Gent Serifi et.al. 2507.02803v1 null
2025-07-03 No time to train! Training-Free Reference-Based Instance Segmentation Miguel Espinosa et.al. 2507.02798v1 null
2025-07-03 A Highly Carbon-Rich Dayside and Disequilibrium Chemistry in the Ultra-Hot Jupiter WASP-19b Suman Saha et.al. 2507.02797v1 null
2025-07-03 From Long Videos to Engaging Clips: A Human-Inspired Video Editing Framework with Multimodal Narrative Understanding Xiangfeng Wang et.al. 2507.02790v1 null
2025-07-03 New components of Hilbert schemes of points and 2-step ideals Franco Giovenzana et.al. 2507.02789v1 null
2025-07-03 From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images Danrong Zhang et.al. 2507.02781v1 null
2025-07-03 Advanced techniques of searching for flares of ultra-high-energy photons from point sources Jaroslaw Stasielak et.al. 2507.02777v1 null
2025-07-03 Connected k-Median with Disjoint and Non-disjoint Clusters Jan Eube et.al. 2507.02774v1 null
2025-07-03 KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs Yuzhang Xie et.al. 2507.02773v1 null
2025-07-03 A Leptonic Interpretation of the UHE Gamma-ray Emission from V4641 Sgr Su-Yu Wan et.al. 2507.02763v1 null
2025-07-03 Discovery and Preliminary Characterization of a Third Interstellar Object: 3I/ATLAS Darryl Z. Seligman et.al. 2507.02757v1 null
2025-07-03 Multi-agent Auditory Scene Analysis Caleb Rascon et.al. 2507.02755v1 null
2025-07-03 Partial Weakly-Supervised Oriented Object Detection Mingxin Liu et.al. 2507.02751v1 null
2025-07-03 On the origin of the X-ray emission surrounding PSR B0656+14 in the eROSITA Cal-PV data Alena Khokhriakova et.al. 2507.02750v1 null
2025-07-03 DexVLG: Dexterous Vision-Language-Grasp Model at Scale Jiawei He et.al. 2507.02747v1 null
2025-07-03 $\textit{Twisted echoes of an odd quartet}$: Scalar-induced gravitational waves as a probe of primordial parity-violation H. V. Ragavendra et.al. 2507.02733v1 null