2025-02-20 |
LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention |
Shang Yang et.al. |
2502.14866v1 |
null |
2025-02-20 |
Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts |
Sara Ghaboura et.al. |
2502.14865v1 |
null |
2025-02-20 |
Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework |
Yuming Yang et.al. |
2502.14864v1 |
null |
2025-02-20 |
Interpretable Text Embeddings and Text Similarity Explanation: A Primer |
Juri Opitz et.al. |
2502.14862v1 |
null |
2025-02-20 |
Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning |
Shuyue Stella Li et.al. |
2502.14860v1 |
null |
2025-02-20 |
The $p$-adic Galois Cohomology of Valuation Fields |
Tongmu He et.al. |
2502.14858v1 |
null |
2025-02-20 |
FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling |
Weilin Zhao et.al. |
2502.14856v1 |
null |
2025-02-20 |
Prompt-to-Leaderboard |
Evan Frick et.al. |
2502.14855v1 |
null |
2025-02-20 |
CLIPPER: Compression enables long-context synthetic data generation |
Chau Minh Pham et.al. |
2502.14854v1 |
null |
2025-02-20 |
Fast Generation of Weak Lensing Maps in Modified Gravity with COLA |
Sophie Hoyland et.al. |
2502.14851v1 |
null |
2025-02-20 |
Primordial full bispectra from the general bounce cosmology |
Shingo Akama et.al. |
2502.14850v1 |
null |
2025-02-20 |
GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks |
Jianwen Luo et.al. |
2502.14848v1 |
null |
2025-02-20 |
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation |
Yue Yang et.al. |
2502.14846v1 |
null |
2025-02-20 |
Update on the isospin breaking corrections to the HVP with C-periodic boundary conditions |
Anian Altherr et.al. |
2502.14845v1 |
null |
2025-02-20 |
Dynamic Concepts Personalization from Single Videos |
Rameen Abdal et.al. |
2502.14844v1 |
null |
2025-02-20 |
Slave-spin approach to the Anderson-Josephson quantum dot |
Andriani Keliri et.al. |
2502.14843v1 |
null |
2025-02-20 |
Revealing and Mitigating Over-Attention in Knowledge Editing |
Pinzheng Wang et.al. |
2502.14838v1 |
null |
2025-02-20 |
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs |
Tao Ji et.al. |
2502.14837v1 |
null |
2025-02-20 |
Adaptive Syndrome Extraction |
Noah Berthusen et.al. |
2502.14835v1 |
null |
2025-02-20 |
LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models |
Shangqing Tu et.al. |
2502.14834v1 |
null |
2025-02-20 |
Improving the Diffusability of Autoencoders |
Ivan Skorokhodov et.al. |
2502.14831v1 |
null |
2025-02-20 |
Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs |
Danni Liu et.al. |
2502.14830v1 |
null |
2025-02-20 |
Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps |
Martin Tutek et.al. |
2502.14829v1 |
null |
2025-02-20 |
Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison |
Aiswarya Baby et.al. |
2502.14827v1 |
null |
2025-02-20 |
MadVoro: Parallel Construction of Voronoi Diagrams in Distributed Memory Systems |
Maor Mizrachi et.al. |
2502.14825v1 |
null |
2025-02-20 |
Identification of soft modes in amorphous Al${2}$O$$ via first-principles |
Alexander C. Tyner et.al. |
2502.14823v1 |
null |
2025-02-20 |
A Survey of Model Architectures in Information Retrieval |
Zhichao Xu et.al. |
2502.14822v1 |
null |
2025-02-20 |
Meshless Shape Optimization using Neural Networks and Partial Differential Equations on Graphs |
Eloi Martinet et.al. |
2502.14821v1 |
null |
2025-02-20 |
eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables |
Luis Antonio GutiƩrrez Guanilo et.al. |
2502.14820v1 |
null |
2025-02-20 |
Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models |
Vlad Sobal et.al. |
2502.14819v1 |
null |