2024-11-29 |
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs |
Shukang Yin et.al. |
2411.19951v2 |
link |
2024-11-29 |
Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark |
Joseph Heyward et.al. |
2411.19941v1 |
null |
2024-11-29 |
Transfer Learning for High-dimensional Quantile Regression with Distribution Shift |
Ruiqi Bai et.al. |
2411.19933v1 |
null |
2024-11-29 |
Linearization (in)stabilities and crossed products |
Julian De Vuyst et.al. |
2411.19931v1 |
null |
2024-11-29 |
Sparse Partitions of Graphs with Bounded Clique Number |
António Girão et.al. |
2411.19915v1 |
null |
2024-11-29 |
Another look at inference after prediction |
Jessica Gronsbell et.al. |
2411.19908v1 |
link |
2024-11-29 |
Classical and Quantum Algorithms for the Deterministic L-system Inductive Inference Problem |
Ali Lotfi et.al. |
2411.19906v1 |
link |
2024-11-29 |
LUMIA: Linear probing for Unimodal and MultiModal Membership Inference Attacks leveraging internal LLM states |
Luis Ibanez-Lissen et.al. |
2411.19876v2 |
null |
2024-11-29 |
Reverse Thinking Makes LLMs Stronger Reasoners |
Justin Chih-Yao Chen et.al. |
2411.19865v1 |
null |
2024-11-29 |
Kurepa trees, continuous images, and perfect set properties |
Chris Lambie-Hanson et.al. |
2411.19839v1 |
null |
2024-11-29 |
Classical transport in a maximally chaotic chain |
William Alderson et.al. |
2411.19828v1 |
null |
2024-11-29 |
SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens |
Chi Su et.al. |
2411.19824v1 |
null |
2024-11-29 |
GradAlign for Training-free Model Performance Inference |
Yuxuan Li et.al. |
2411.19819v1 |
null |
2024-11-29 |
The universality of the law of the wall: A long-lasting controversial debate |
Stefan Heinz et.al. |
2411.19805v1 |
null |
2024-11-29 |
Adjusting auxiliary variables under approximate neighborhood interference |
Xin Lu et.al. |
2411.19789v1 |
null |
2024-11-29 |
PerLA: Perceptive 3D Language Assistant |
Guofeng Mei et.al. |
2411.19774v1 |
null |
2024-11-29 |
Evidence-Based Threat Modeling for ICS |
Can Ozkan et.al. |
2411.19759v1 |
null |
2024-11-29 |
The role of ammonia in the distribution of volatiles in the primordial hydrosphere of Europa |
Alizée Amsler Moulanier et.al. |
2411.19743v1 |
null |
2024-11-29 |
A Statistical Study of Lopsided Galaxies using Random Forest |
Valentina Fontirroig et.al. |
2411.19723v1 |
null |
2024-11-29 |
JetFormer: An Autoregressive Generative Model of Raw Images and Text |
Michael Tschannen et.al. |
2411.19722v1 |
null |
2024-11-29 |
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems |
Rafael Teixeira de Lima et.al. |
2411.19710v1 |
null |
2024-11-29 |
Explaining the Impact of Training on Vision Models via Activation Clustering |
Ahcène Boubekki et.al. |
2411.19700v1 |
null |
2024-11-29 |
SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks |
Kim-Celine Kahl et.al. |
2411.19688v1 |
link |
2024-11-29 |
Privacy-Preserving Orthogonal Aggregation for Guaranteeing Gender Fairness in Federated Recommendation |
Siqing Zhang et.al. |
2411.19678v1 |
null |
2024-11-29 |
Large Nernst effect in Te-based van der Waals materials |
M. Behnami et.al. |
2411.19660v1 |
null |
2024-11-29 |
RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents |
Shi Zifeng et.al. |
2411.19639v1 |
null |
2024-11-29 |
Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings |
Qiong Wu et.al. |
2411.19628v1 |
link |
2024-11-29 |
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding |
Yawen Shao et.al. |
2411.19626v1 |
null |
2024-11-29 |
Materials Learning Algorithms (MALA): Scalable Machine Learning for Electronic Structure Calculations in Large-Scale Atomistic Simulations |
Attila Cangi et.al. |
2411.19617v1 |
null |
2024-11-29 |
Canonical correlation analysis of stochastic trends via functional approximation |
Massimo Franchi et.al. |
2411.19572v1 |
null |