Skip to content

Keypoint Detection

Keypoint Detection

Publish Date Title Authors PDF Code
2025-02-20 Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework Yuming Yang et.al. 2502.14864v1 null
2025-02-20 Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition Tianyi Shang et.al. 2502.14195v1 link
2025-02-19 A General Framework for Augmenting Lossy Compressors with Topological Guarantees Nathaniel Gorski et.al. 2502.14022v1 null
2025-02-19 2.5D U-Net with Depth Reduction for 3D CryoET Object Identification Yusuke Uchida et.al. 2502.13484v1 null
2025-02-19 Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor Barys Liskavets et.al. 2502.13374v1 null
2025-02-18 Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport Mingyang Sun et.al. 2502.12631v1 null
2025-02-17 Implicit Geometric Descriptor-Enabled ANN Framework for a Unified Structure-Property Relationship in Architected Nanofibrous Materials Bhanugoban Maheswaran et.al. 2502.12311v1 null
2025-02-17 ILIAS: Instance-Level Image retrieval At Scale Giorgos Kordopatis-Zilos et.al. 2502.11748v1 null
2025-02-17 FUNCTO: Function-Centric One-Shot Imitation Learning for Tool Manipulation Chao Tang et.al. 2502.11744v1 null
2025-02-17 Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition Jianyi Peng et.al. 2502.11742v1 null
2025-02-17 SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking Zijian Wu et.al. 2502.11534v1 null
2025-02-17 Beyond Cortisol! Physiological Indicators of Welfare for Dogs: Deficits, Misunderstandings and Opportunities ML Cobb et.al. 2502.11384v1 null
2025-02-16 FeaKM: Robust Collaborative Perception under Noisy Pose Conditions Jiuwu Hao et.al. 2502.11003v1 link
2025-02-15 Implicit Neural Representations of Molecular Vector-Valued Functions Jirka Lhotka et.al. 2502.10848v1 link
2025-02-13 Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Jonathan Kahana et.al. 2502.09619v1 null
2025-02-13 Metamorphic Testing for Pose Estimation Systems Matias Duran et.al. 2502.09460v1 null
2025-02-13 The quest for new materials: the network theory and machine learning perspectives Jacopo Moi et.al. 2502.09250v1 null
2025-02-12 Are Expressions for Music Emotions the Same Across Cultures? Elif Celen et.al. 2502.08744v1 null
2025-02-12 A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards Shivansh Patel et.al. 2502.08643v2 null
2025-02-12 A comprehensive approach to incorporating intermolecular dispersion into the openCOSMO-RS model. Part 2: Atomic polarizabilities Daria Grigorash et.al. 2502.08520v1 null
2025-02-12 On Euler-Sombor Energy of Graphs Sopan Bansode et.al. 2502.08304v1 null
2025-02-11 Efficient Sparsification of Simplicial Complexes via Local Densities of States Anton Savostianov et.al. 2502.07558v1 null
2025-02-11 Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation Pinxin Liu et.al. 2502.07239v1 null
2025-02-11 Mesh2SSM++: A Probabilistic Framework for Unsupervised Learning of Statistical Shape Model of Anatomies from Surface Meshes Krithika Iyer et.al. 2502.07145v1 null
2025-02-10 Biomechanical Reconstruction with Confidence Intervals from Multiview Markerless Motion Capture R. James Cotton et.al. 2502.06486v1 null
2025-02-10 High-Intensity Helical Flow: A Double-Edged Sword in Coronary Artery Haemodynamics Chi Shen et.al. 2502.06161v1 null
2025-02-10 An Appearance Defect Detection Method for Cigarettes Based on C-CenterNet Hongyu Liu et.al. 2502.06119v1 null
2025-02-07 RobotMover: Learning to Move Large Objects by Imitating the Dynamic Chain Tianyu Li et.al. 2502.05271v1 null
2025-02-07 HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation Qijun Gan et.al. 2502.04847v2 null
2025-02-07 Building Rome with Convex Optimization Haoyu Han et.al. 2502.04640v2 null