2025-02-20 |
Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework |
Yuming Yang et.al. |
2502.14864v1 |
null |
2025-02-20 |
Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition |
Tianyi Shang et.al. |
2502.14195v1 |
link |
2025-02-19 |
A General Framework for Augmenting Lossy Compressors with Topological Guarantees |
Nathaniel Gorski et.al. |
2502.14022v1 |
null |
2025-02-19 |
2.5D U-Net with Depth Reduction for 3D CryoET Object Identification |
Yusuke Uchida et.al. |
2502.13484v1 |
null |
2025-02-19 |
Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor |
Barys Liskavets et.al. |
2502.13374v1 |
null |
2025-02-18 |
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport |
Mingyang Sun et.al. |
2502.12631v1 |
null |
2025-02-17 |
Implicit Geometric Descriptor-Enabled ANN Framework for a Unified Structure-Property Relationship in Architected Nanofibrous Materials |
Bhanugoban Maheswaran et.al. |
2502.12311v1 |
null |
2025-02-17 |
ILIAS: Instance-Level Image retrieval At Scale |
Giorgos Kordopatis-Zilos et.al. |
2502.11748v1 |
null |
2025-02-17 |
FUNCTO: Function-Centric One-Shot Imitation Learning for Tool Manipulation |
Chao Tang et.al. |
2502.11744v1 |
null |
2025-02-17 |
Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition |
Jianyi Peng et.al. |
2502.11742v1 |
null |
2025-02-17 |
SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking |
Zijian Wu et.al. |
2502.11534v1 |
null |
2025-02-17 |
Beyond Cortisol! Physiological Indicators of Welfare for Dogs: Deficits, Misunderstandings and Opportunities |
ML Cobb et.al. |
2502.11384v1 |
null |
2025-02-16 |
FeaKM: Robust Collaborative Perception under Noisy Pose Conditions |
Jiuwu Hao et.al. |
2502.11003v1 |
link |
2025-02-15 |
Implicit Neural Representations of Molecular Vector-Valued Functions |
Jirka Lhotka et.al. |
2502.10848v1 |
link |
2025-02-13 |
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights |
Jonathan Kahana et.al. |
2502.09619v1 |
null |
2025-02-13 |
Metamorphic Testing for Pose Estimation Systems |
Matias Duran et.al. |
2502.09460v1 |
null |
2025-02-13 |
The quest for new materials: the network theory and machine learning perspectives |
Jacopo Moi et.al. |
2502.09250v1 |
null |
2025-02-12 |
Are Expressions for Music Emotions the Same Across Cultures? |
Elif Celen et.al. |
2502.08744v1 |
null |
2025-02-12 |
A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards |
Shivansh Patel et.al. |
2502.08643v2 |
null |
2025-02-12 |
A comprehensive approach to incorporating intermolecular dispersion into the openCOSMO-RS model. Part 2: Atomic polarizabilities |
Daria Grigorash et.al. |
2502.08520v1 |
null |
2025-02-12 |
On Euler-Sombor Energy of Graphs |
Sopan Bansode et.al. |
2502.08304v1 |
null |
2025-02-11 |
Efficient Sparsification of Simplicial Complexes via Local Densities of States |
Anton Savostianov et.al. |
2502.07558v1 |
null |
2025-02-11 |
Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation |
Pinxin Liu et.al. |
2502.07239v1 |
null |
2025-02-11 |
Mesh2SSM++: A Probabilistic Framework for Unsupervised Learning of Statistical Shape Model of Anatomies from Surface Meshes |
Krithika Iyer et.al. |
2502.07145v1 |
null |
2025-02-10 |
Biomechanical Reconstruction with Confidence Intervals from Multiview Markerless Motion Capture |
R. James Cotton et.al. |
2502.06486v1 |
null |
2025-02-10 |
High-Intensity Helical Flow: A Double-Edged Sword in Coronary Artery Haemodynamics |
Chi Shen et.al. |
2502.06161v1 |
null |
2025-02-10 |
An Appearance Defect Detection Method for Cigarettes Based on C-CenterNet |
Hongyu Liu et.al. |
2502.06119v1 |
null |
2025-02-07 |
RobotMover: Learning to Move Large Objects by Imitating the Dynamic Chain |
Tianyu Li et.al. |
2502.05271v1 |
null |
2025-02-07 |
HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation |
Qijun Gan et.al. |
2502.04847v2 |
null |
2025-02-07 |
Building Rome with Convex Optimization |
Haoyu Han et.al. |
2502.04640v2 |
null |