Image Caption

Publish Date	Title	Authors	PDF	Code
2025-04-24	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789v1	null
2025-04-24	Nanoscale infrared and microwave imaging of stacking faults in multilayer graphene	Ludwig Holleis et.al.	2504.17783v1	null
2025-04-24	Step1X-Edit: A Practical Framework for General Image Editing	Shiyu Liu et.al.	2504.17761v1	null
2025-04-24	Disaggregated Deep Learning via In-Physics Computing at Radio Frequency	Zhihui Gao et.al.	2504.17752v1	null
2025-04-24	Revisiting Reset Mechanisms in Spiking Neural Networks for Sequential Modeling: Specialized Discretization for Binary Activated RNN	Enqi Zhang et.al.	2504.17751v1	null
2025-04-24	DPMambaIR:All-in-One Image Restoration via Degradation-Aware Prompt State Space Model	Zhanwen Liu et.al.	2504.17732v1	null
2025-04-24	CasualHDRSplat: Robust High Dynamic Range 3D Gaussian Splatting from Casually Captured Videos	Shucheng Gong et.al.	2504.17728v1	null
2025-04-24	Optical to infrared mapping of vapor-to-liquid phase change dynamics using generative machine learning	Siavash Khodakarami et.al.	2504.17726v1	null
2025-04-24	Conformal Segmentation in Industrial Surface Defect Detection with Statistical Guarantees	Cheng Shen et.al.	2504.17721v1	null
2025-04-24	Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields	Zhuo He et.al.	2504.17712v1	null
2025-04-24	Quadratic Interest Network for Multimodal Click-Through Rate Prediction	Honghao Li et.al.	2504.17699v1	null
2025-04-24	Self-Supervised Noise Adaptive MRI Denoising via Repetition to Repetition (Rep2Rep) Learning	Nikola Janjušević et.al.	2504.17698v1	null
2025-04-24	PICO: Reconstructing 3D People In Contact with Objects	Alpár Cseke et.al.	2504.17695v1	null
2025-04-24	The Hubble Image Similarity Project	Richard L. White et.al.	2504.17688v1	null
2025-04-24	DiMeR: Disentangled Mesh Reconstruction Model	Lutao Jiang et.al.	2504.17670v1	null
2025-04-24	Spectral Irradiance Variability in Lyman-Alpha Emission During Solar Flares	Luke Majury et.al.	2504.17667v1	null
2025-04-24	The Malicious Technical Ecosystem: Exposing Limitations in Technical Governance of AI-Generated Non-Consensual Intimate Images of Adults	Michelle L. Ding et.al.	2504.17663v1	null
2025-04-24	Aerial Image Classification in Scarce and Unconstrained Environments via Conformal Prediction	Farhad Pourkamali-Anaraki et.al.	2504.17655v1	null
2025-04-24	CLIPSE -- a minimalistic CLIP-based image search engine for research	Steve Göring et.al.	2504.17643v1	link
2025-04-24	A Guide to Structureless Visual Localization	Vojtech Panek et.al.	2504.17636v1	null
2025-04-24	Quantifying jet-interstellar medium interactions in Cyg X-1: Insights from dual-frequency bow shock detection with MeerKAT	P. Atri et.al.	2504.17635v1	null
2025-04-24	Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization	Abderrachid Hamrani et.al.	2504.17628v1	null
2025-04-24	Improving Open-World Object Localization by Discovering Background	Ashish Singh et.al.	2504.17626v1	null
2025-04-24	Likelihood-Free Variational Autoencoders	Chen Xu et.al.	2504.17622v1	null
2025-04-24	Enhancing CNNs robustness to occlusions with bioinspired filters for border completion	Catarina P. Coutinho et.al.	2504.17619v1	null
2025-04-24	STCL:Curriculum learning Strategies for deep learning image steganography models	Fengchun Liu et.al.	2504.17609v1	null
2025-04-24	Time-resolved dynamics of GaN waveguide polaritons	Loïc Méchin et.al.	2504.17607v1	null
2025-04-24	Tamper-evident Image using JPEG Fixed Points	Zhaofeng Si et.al.	2504.17594v1	null
2025-04-24	Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images	Zebo Huang et.al.	2504.17582v1	null
2025-04-24	A compact laser-plasma source for high-repetition-rate bi-modal X-ray and electron imaging	Angana Mondal et.al.	2504.17560v1	null