Vision Transformer

Publish Date	Title	Authors	PDF	Code
2025-04-24	LiDPM: Rethinking Point Diffusion for Lidar Scene Completion	Tetiana Martyniuk et.al.	2504.17791v1	null
2025-04-24	Dynamic Camera Poses and Where to Find Them	Chris Rockwell et.al.	2504.17788v1	null
2025-04-24	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789v1	null
2025-04-24	The Fourth Monocular Depth Estimation Challenge	Anton Obukhov et.al.	2504.17787v1	null
2025-04-24	The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs	Piotr Nawrot et.al.	2504.17768v1	null
2025-04-24	Step1X-Edit: A Practical Framework for General Image Editing	Shiyu Liu et.al.	2504.17761v1	null
2025-04-24	Embedding Empirical Distributions for Computing Optimal Transport Maps	Mingchen Jiang et.al.	2504.17740v1	null
2025-04-24	EgoCHARM: Resource-Efficient Hierarchical Activity Recognition using an Egocentric IMU Sensor	Akhil Padmanabha et.al.	2504.17735v1	null
2025-04-24	DPMambaIR:All-in-One Image Restoration via Degradation-Aware Prompt State Space Model	Zhanwen Liu et.al.	2504.17732v1	null
2025-04-24	CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos	Shucheng Gong et.al.	2504.17728v1	null
2025-04-24	Towards Robust LLMs: an Adversarial Robustness Measurement Framework	Natan Levy et.al.	2504.17723v1	null
2025-04-24	Early Detection of Multidrug Resistance Using Multivariate Time Series Analysis and Interpretable Patient-Similarity Representations	Óscar Escudero-Arnanz et.al.	2504.17717v1	null
2025-04-24	Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields	Zhuo He et.al.	2504.17712v1	null
2025-04-24	Plasma State Monitoring and Disruption Characterization using Multimodal VAEs	Yoeri Poels et.al.	2504.17710v1	null
2025-04-24	Inverse problem in the LaMET framework	Hervé Dutrieux et.al.	2504.17706v1	null
2025-04-24	Federated Learning: A Survey on Privacy-Preserving Collaborative Intelligence	Edward Collins et.al.	2504.17703v1	null
2025-04-24	Real-space superconducting properties in the atomically-thin limit: Ab initio approach and its application to Josephson junctions	Jonas Bekaert et.al.	2504.17702v1	null
2025-04-24	Hierarchical and Multimodal Data for Daily Activity Understanding	Ghazal Kaviani et.al.	2504.17696v1	null
2025-04-24	PICO: Reconstructing 3D People In Contact with Objects	Alpár Cseke et.al.	2504.17695v1	null
2025-04-24	BIM-Constrained Optimization for Accurate Localization and Deviation Correction in Construction Monitoring	Asier Bikandi et.al.	2504.17693v1	null
2025-04-24	The Hubble Image Similarity Project	Richard L. White et.al.	2504.17688v1	null
2025-04-24	DTECM: Digital Twin Enabled Channel Measurement and Modeling in Terahertz Urban Macrocell	Yuanbo Li et.al.	2504.17673v1	null
2025-04-24	Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction	Yuanchang Ye et.al.	2504.17671v1	null
2025-04-24	DiMeR: Disentangled Mesh Reconstruction Model	Lutao Jiang et.al.	2504.17670v1	null
2025-04-24	Towards a HIPAA Compliant Agentic AI System in Healthcare	Subash Neupane et.al.	2504.17669v1	null
2025-04-24	Seamless Data Migration between Database Schemas with DAMI-Framework: An Empirical Study on Developer Experience	Delfina Ramos-Vidal et.al.	2504.17662v1	null
2025-04-24	Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction	Farhad Pourkamali-Anaraki et.al.	2504.17655v1	null
2025-04-24	CLIPSE -- a minimalistic CLIP-based image search engine for research	Steve Göring et.al.	2504.17643v1	link
2025-04-24	A Guide to Structureless Visual Localization	Vojtech Panek et.al.	2504.17636v1	null
2025-04-24	Geometry-induced asymmetric level coupling	Alhun Aydin et.al.	2504.17630v1	null