Skip to content

VQA

VQA

Publish Date Title Authors PDF Code
2025-01-16 Distilling Multi-modal Large Language Models for Autonomous Driving Deepti Hegde et.al. 2501.09757v1 null
2025-01-16 SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces Sumit Chaturvedi et.al. 2501.09756v1 null
2025-01-16 Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Philippe Hansen-Estruch et.al. 2501.09755v1 null
2025-01-16 Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Nanye Ma et.al. 2501.09732v1 null
2025-01-16 Predictions as Surrogates: Revisiting Surrogate Outcomes in the Age of AI Wenlong Ji et.al. 2501.09731v1 null
2025-01-16 A Simple Aerial Detection Baseline of Multimodal Language Models Qingyun Li et.al. 2501.09720v1 link
2025-01-16 CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education Tianyu Wang et.al. 2501.09709v1 null
2025-01-16 Faber-Krahn inequality for the heat content on quantum graphs via random walk expansion Patrizio Bifulco et.al. 2501.09693v1 null
2025-01-16 Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Fengli Xu et.al. 2501.09686v1 null
2025-01-16 Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review Masatoshi Uehara et.al. 2501.09685v1 null
2025-01-16 Data mining the functional architecture of the brain's circuitry Adam S. Charles et.al. 2501.09684v1 null
2025-01-16 Statistical inference for interacting innovation processes and related general results Giacomo Aletti et.al. 2501.09648v1 null
2025-01-16 Chromatic Purity in Hermitian K-Theory at $p=2$ Jordan Levin et.al. 2501.09633v1 null
2025-01-16 Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework Yushen Lin et.al. 2501.09631v1 null
2025-01-16 Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment Chaoqi Wang et.al. 2501.09620v1 null
2025-01-16 Managed-Retention Memory: A New Class of Memory for the AI Era Sergey Legtchenko et.al. 2501.09605v1 null
2025-01-16 Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures Pratyush Dhingra et.al. 2501.09588v1 null
2025-01-16 A simple model of a sequence-reading diffusion: non-self-averaging and self-averaging properties Silvio Kalaj et.al. 2501.09583v1 null
2025-01-16 AdaFV: Accelerating VLMs with Self-Adaptive Cross-Modality Attention Mixture Jiayi Han et.al. 2501.09532v1 null
2025-01-16 Confidence Estimation for Error Detection in Text-to-SQL Systems Oleg Somov et.al. 2501.09527v1 null
2025-01-16 Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data Omar Mena et.al. 2501.09521v1 null
2025-01-16 Recovering latent linkage structures and spillover effects with structural breaks in panel data models Ryo Okui et.al. 2501.09517v1 null
2025-01-16 PIER: A Novel Metric for Evaluating What Matters in Code-Switching Enes Yavuz Ugan et.al. 2501.09512v1 null
2025-01-16 Resolving the $χ_{\rm eff}$-$q$ correlation among Coalescing Binary Black Holes and Evidence for AGN-driven Hierarchical Mergers Yin-Jie Li et.al. 2501.09495v1 null
2025-01-16 On the robustness of exoplanet atmospheric detections: insights from extensive simulations A. Sánchez-López et.al. 2501.09494v1 null
2025-01-16 Semiparametrics via parametrics and contiguity Adam Lee et.al. 2501.09483v1 null
2025-01-16 Control and its applications in additive combinatorics Thomas F. Bloom et.al. 2501.09470v1 null
2025-01-16 RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection Jianrui Shi et.al. 2501.09465v1 null
2025-01-16 Pruning for Sparse Diffusion Models based on Gradient Flow Ben Wan et.al. 2501.09464v1 null
2025-01-16 Double Visual Defense: Adversarial Pre-training and Instruction Tuning for Improving Vision-Language Model Robustness Zeyu Wang et.al. 2501.09446v1 null