2025-02-20 |
LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention |
Shang Yang et.al. |
2502.14866v1 |
null |
2025-02-20 |
Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts |
Sara Ghaboura et.al. |
2502.14865v1 |
null |
2025-02-20 |
Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework |
Yuming Yang et.al. |
2502.14864v1 |
null |
2025-02-20 |
The Fourier coefficients of the holomorphic multiplicative chaos in the limit of large frequency |
Joseph Najnudel et.al. |
2502.14863v1 |
null |
2025-02-20 |
FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling |
Weilin Zhao et.al. |
2502.14856v1 |
null |
2025-02-20 |
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation |
Yue Yang et.al. |
2502.14846v1 |
null |
2025-02-20 |
Dynamic Concepts Personalization from Single Videos |
Rameen Abdal et.al. |
2502.14844v1 |
null |
2025-02-20 |
The law of thin processes: a law of large numbers for point processes |
Matthew Aldridge et.al. |
2502.14839v1 |
null |
2025-02-20 |
Revealing and Mitigating Over-Attention in Knowledge Editing |
Pinzheng Wang et.al. |
2502.14838v1 |
null |
2025-02-20 |
LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models |
Shangqing Tu et.al. |
2502.14834v1 |
null |
2025-02-20 |
Improving the Diffusability of Autoencoders |
Ivan Skorokhodov et.al. |
2502.14831v1 |
null |
2025-02-20 |
Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison |
Aiswarya Baby et.al. |
2502.14827v1 |
null |
2025-02-20 |
Turning on the Light: Polymorphism-Induced Photoluminescence in Cysteine Crystals |
Debarshi Banerjee et.al. |
2502.14826v1 |
null |
2025-02-20 |
MadVoro: Parallel Construction of Voronoi Diagrams in Distributed Memory Systems |
Maor Mizrachi et.al. |
2502.14825v1 |
null |
2025-02-20 |
FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis |
Fadillah Maani et.al. |
2502.14807v1 |
null |
2025-02-20 |
Passive Demultiplexed Two-photon State Generation from a Quantum Dot |
Yusuf Karli et.al. |
2502.14806v1 |
null |
2025-02-20 |
Noise Mitigation in Single Microwave Photon Counting by Cascaded Quantum Measurements |
Alexandre S. May et.al. |
2502.14804v1 |
null |
2025-02-20 |
A Survey on Text-Driven 360-Degree Panorama Generation |
Hai Wang et.al. |
2502.14799v1 |
null |
2025-02-20 |
Humanoid-VLA: Towards Universal Humanoid Control with Visual Integration |
Pengxiang Ding et.al. |
2502.14795v1 |
null |
2025-02-20 |
An Adversarial Analysis of Thompson Sampling for Full-information Online Learning: from Finite to Infinite Action Spaces |
Alexander Terenin et.al. |
2502.14790v1 |
null |
2025-02-20 |
Micro Blossom: Accelerated Minimum-Weight Perfect Matching Decoding for Quantum Error Correction |
Yue Wu et.al. |
2502.14787v1 |
null |
2025-02-20 |
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features |
Michael Tschannen et.al. |
2502.14786v1 |
null |
2025-02-20 |
Real-Time Device Reach Forecasting Using HLL and MinHash Data Sketches |
Chandrashekar Muniyappa et.al. |
2502.14785v1 |
null |
2025-02-20 |
Online Resource Management for the Uplink of Wideband Hybrid Beamforming System |
Yuan Quan et.al. |
2502.14784v1 |
null |
2025-02-20 |
Tracking and Assigning Jobs to a Markov Machine |
Subhankar Banerjee et.al. |
2502.14783v1 |
null |
2025-02-20 |
A Neural Operator-Based Emulator for Regional Shallow Water Dynamics |
Peter Rivera-Casillas et.al. |
2502.14782v1 |
null |
2025-02-20 |
ReVision: A Dataset and Baseline VLM for Privacy-Preserving Task-Oriented Visual Instruction Rewriting |
Abhijit Mishra et.al. |
2502.14780v1 |
null |
2025-02-20 |
DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models |
Hongji Yang et.al. |
2502.14779v1 |
null |
2025-02-20 |
Harnessing PDF Data for Improving Japanese Large Multimodal Models |
Jeonghun Baek et.al. |
2502.14778v1 |
null |
2025-02-20 |
SurveyX: Academic Survey Automation via Large Language Models |
Xun Liang et.al. |
2502.14776v1 |
null |