2025-01-16 |
Distilling Multi-modal Large Language Models for Autonomous Driving |
Deepti Hegde et.al. |
2501.09757v1 |
null |
2025-01-16 |
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces |
Sumit Chaturvedi et.al. |
2501.09756v1 |
null |
2025-01-16 |
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation |
Philippe Hansen-Estruch et.al. |
2501.09755v1 |
null |
2025-01-16 |
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps |
Nanye Ma et.al. |
2501.09732v1 |
null |
2025-01-16 |
Predictions as Surrogates: Revisiting Surrogate Outcomes in the Age of AI |
Wenlong Ji et.al. |
2501.09731v1 |
null |
2025-01-16 |
A Simple Aerial Detection Baseline of Multimodal Language Models |
Qingyun Li et.al. |
2501.09720v1 |
link |
2025-01-16 |
CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education |
Tianyu Wang et.al. |
2501.09709v1 |
null |
2025-01-16 |
Faber-Krahn inequality for the heat content on quantum graphs via random walk expansion |
Patrizio Bifulco et.al. |
2501.09693v1 |
null |
2025-01-16 |
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models |
Fengli Xu et.al. |
2501.09686v1 |
null |
2025-01-16 |
Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review |
Masatoshi Uehara et.al. |
2501.09685v1 |
null |
2025-01-16 |
Data mining the functional architecture of the brain's circuitry |
Adam S. Charles et.al. |
2501.09684v1 |
null |
2025-01-16 |
Statistical inference for interacting innovation processes and related general results |
Giacomo Aletti et.al. |
2501.09648v1 |
null |
2025-01-16 |
Chromatic Purity in Hermitian K-Theory at $p=2$ |
Jordan Levin et.al. |
2501.09633v1 |
null |
2025-01-16 |
Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework |
Yushen Lin et.al. |
2501.09631v1 |
null |
2025-01-16 |
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment |
Chaoqi Wang et.al. |
2501.09620v1 |
null |
2025-01-16 |
Managed-Retention Memory: A New Class of Memory for the AI Era |
Sergey Legtchenko et.al. |
2501.09605v1 |
null |
2025-01-16 |
Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures |
Pratyush Dhingra et.al. |
2501.09588v1 |
null |
2025-01-16 |
A simple model of a sequence-reading diffusion: non-self-averaging and self-averaging properties |
Silvio Kalaj et.al. |
2501.09583v1 |
null |
2025-01-16 |
AdaFV: Accelerating VLMs with Self-Adaptive Cross-Modality Attention Mixture |
Jiayi Han et.al. |
2501.09532v1 |
null |
2025-01-16 |
Confidence Estimation for Error Detection in Text-to-SQL Systems |
Oleg Somov et.al. |
2501.09527v1 |
null |
2025-01-16 |
Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data |
Omar Mena et.al. |
2501.09521v1 |
null |
2025-01-16 |
Recovering latent linkage structures and spillover effects with structural breaks in panel data models |
Ryo Okui et.al. |
2501.09517v1 |
null |
2025-01-16 |
PIER: A Novel Metric for Evaluating What Matters in Code-Switching |
Enes Yavuz Ugan et.al. |
2501.09512v1 |
null |
2025-01-16 |
Resolving the $χ_{\rm eff}$-$q$ correlation among Coalescing Binary Black Holes and Evidence for AGN-driven Hierarchical Mergers |
Yin-Jie Li et.al. |
2501.09495v1 |
null |
2025-01-16 |
On the robustness of exoplanet atmospheric detections: insights from extensive simulations |
A. Sánchez-López et.al. |
2501.09494v1 |
null |
2025-01-16 |
Semiparametrics via parametrics and contiguity |
Adam Lee et.al. |
2501.09483v1 |
null |
2025-01-16 |
Control and its applications in additive combinatorics |
Thomas F. Bloom et.al. |
2501.09470v1 |
null |
2025-01-16 |
RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection |
Jianrui Shi et.al. |
2501.09465v1 |
null |
2025-01-16 |
Pruning for Sparse Diffusion Models based on Gradient Flow |
Ben Wan et.al. |
2501.09464v1 |
null |
2025-01-16 |
Double Visual Defense: Adversarial Pre-training and Instruction Tuning for Improving Vision-Language Model Robustness |
Zeyu Wang et.al. |
2501.09446v1 |
null |