Skip to content

Vision Transformer

Vision Transformer

Publish Date Title Authors PDF Code
2025-04-24 LiDPM: Rethinking Point Diffusion for Lidar Scene Completion Tetiana Martyniuk et.al. 2504.17791v1 null
2025-04-24 Dynamic Camera Poses and Where to Find Them Chris Rockwell et.al. 2504.17788v1 null
2025-04-24 Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Xu Ma et.al. 2504.17789v1 null
2025-04-24 The Fourth Monocular Depth Estimation Challenge Anton Obukhov et.al. 2504.17787v1 null
2025-04-24 The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs Piotr Nawrot et.al. 2504.17768v1 null
2025-04-24 Step1X-Edit: A Practical Framework for General Image Editing Shiyu Liu et.al. 2504.17761v1 null
2025-04-24 Embedding Empirical Distributions for Computing Optimal Transport Maps Mingchen Jiang et.al. 2504.17740v1 null
2025-04-24 EgoCHARM: Resource-Efficient Hierarchical Activity Recognition using an Egocentric IMU Sensor Akhil Padmanabha et.al. 2504.17735v1 null
2025-04-24 DPMambaIR:All-in-One Image Restoration via Degradation-Aware Prompt State Space Model Zhanwen Liu et.al. 2504.17732v1 null
2025-04-24 CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos Shucheng Gong et.al. 2504.17728v1 null
2025-04-24 Towards Robust LLMs: an Adversarial Robustness Measurement Framework Natan Levy et.al. 2504.17723v1 null
2025-04-24 Early Detection of Multidrug Resistance Using Multivariate Time Series Analysis and Interpretable Patient-Similarity Representations Óscar Escudero-Arnanz et.al. 2504.17717v1 null
2025-04-24 Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields Zhuo He et.al. 2504.17712v1 null
2025-04-24 Plasma State Monitoring and Disruption Characterization using Multimodal VAEs Yoeri Poels et.al. 2504.17710v1 null
2025-04-24 Inverse problem in the LaMET framework Hervé Dutrieux et.al. 2504.17706v1 null
2025-04-24 Federated Learning: A Survey on Privacy-Preserving Collaborative Intelligence Edward Collins et.al. 2504.17703v1 null
2025-04-24 Real-space superconducting properties in the atomically-thin limit: Ab initio approach and its application to Josephson junctions Jonas Bekaert et.al. 2504.17702v1 null
2025-04-24 Hierarchical and Multimodal Data for Daily Activity Understanding Ghazal Kaviani et.al. 2504.17696v1 null
2025-04-24 PICO: Reconstructing 3D People In Contact with Objects Alpár Cseke et.al. 2504.17695v1 null
2025-04-24 BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring Asier Bikandi et.al. 2504.17693v1 null
2025-04-24 The Hubble Image Similarity Project Richard L. White et.al. 2504.17688v1 null
2025-04-24 DTECM: Digital Twin Enabled Channel Measurement and Modeling in Terahertz Urban Macrocell Yuanbo Li et.al. 2504.17673v1 null
2025-04-24 Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction Yuanchang Ye et.al. 2504.17671v1 null
2025-04-24 DiMeR: Disentangled Mesh Reconstruction Model Lutao Jiang et.al. 2504.17670v1 null
2025-04-24 Towards a HIPAA Compliant Agentic AI System in Healthcare Subash Neupane et.al. 2504.17669v1 null
2025-04-24 Seamless Data Migration between Database Schemas with DAMI-Framework: An Empirical Study on Developer Experience Delfina Ramos-Vidal et.al. 2504.17662v1 null
2025-04-24 Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction Farhad Pourkamali-Anaraki et.al. 2504.17655v1 null
2025-04-24 CLIPSE -- a minimalistic CLIP-based image search engine for research Steve Göring et.al. 2504.17643v1 link
2025-04-24 A Guide to Structureless Visual Localization Vojtech Panek et.al. 2504.17636v1 null
2025-04-24 Geometry-induced asymmetric level coupling Alhun Aydin et.al. 2504.17630v1 null