2024-11-29 |
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs |
Shukang Yin et.al. |
2411.19951v2 |
link |
2024-11-29 |
AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos |
Yuze He et.al. |
2411.19950v1 |
null |
2024-11-29 |
DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation |
Zhiqiang Shen et.al. |
2411.19946v1 |
link |
2024-11-29 |
VLSBench: Unveiling Visual Leakage in Multimodal Safety |
Xuhao Hu et.al. |
2411.19939v1 |
null |
2024-11-29 |
On Domain-Specific Post-Training for Multimodal Large Language Models |
Daixuan Cheng et.al. |
2411.19930v1 |
null |
2024-11-29 |
The scalar angular Teukolsky equation and its solution for the Taub-NUT spacetime |
Felix Willenborg et.al. |
2411.19919v1 |
null |
2024-11-29 |
Interacting Dark Sector (ETHOS $n=0$): Cosmological Constraints from SPT Cluster Abundance with DES and HST Weak Lensing Data |
Asmaa Mazoun et.al. |
2411.19911v1 |
null |
2024-11-29 |
Classical and Quantum Algorithms for the Deterministic L-system Inductive Inference Problem |
Ali Lotfi et.al. |
2411.19906v1 |
link |
2024-11-29 |
Noncommutative Model Selection for Data Clustering and Dimension Reduction Using Relative von Neumann Entropy |
Araceli Guzmán-Tristán et.al. |
2411.19902v1 |
null |
2024-11-29 |
Enhanced anomaly detection in well log data through the application of ensemble GANs |
Abdulrahman Al-Fakih et.al. |
2411.19875v1 |
link |
2024-11-29 |
DeMo: Decoupled Momentum Optimization |
Bowen Peng et.al. |
2411.19870v1 |
link |
2024-11-29 |
RI-(S)MOM to $\overline{\rm MS}$ conversion for $B_K$ at two-loop order |
Martin Gorbahn et.al. |
2411.19861v1 |
null |
2024-11-29 |
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection |
Philipp Wolters et.al. |
2411.19860v1 |
null |
2024-11-29 |
Towards Class-wise Robustness Analysis |
Tejaswini Medi et.al. |
2411.19853v1 |
null |
2024-11-29 |
Neuroplasticity and Psychedelics: a comprehensive examination of classic and non-classic compounds in pre and clinical models |
Claudio Agnorelli et.al. |
2411.19840v1 |
null |
2024-11-29 |
Kurepa trees, continuous images, and perfect set properties |
Chris Lambie-Hanson et.al. |
2411.19839v1 |
null |
2024-11-29 |
Feedback-driven object detection and iterative model improvement |
Sönke Tenckhoff et.al. |
2411.19835v1 |
link |
2024-11-29 |
SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens |
Chi Su et.al. |
2411.19824v1 |
null |
2024-11-29 |
Gaussian multi-target filtering with target dynamics driven by a stochastic differential equation |
Ángel F. García-Fernández et.al. |
2411.19814v1 |
null |
2024-11-29 |
Tuning the Legacy Survey of Space and Time (LSST) Observing Strategy for Solar System Science: Incremental Templates in Year 1 |
James E. Robinson et.al. |
2411.19796v1 |
null |
2024-11-29 |
Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities |
Haorui He et.al. |
2411.19770v1 |
null |
2024-11-29 |
Riemannian Denoising Score Matching for Molecular Structure Optimization with Accurate Energy |
Jeheon Woo et.al. |
2411.19769v1 |
null |
2024-11-29 |
LaVIDE: A Language-Vision Discriminator for Detecting Changes in Satellite Image with Map References |
Shuguo Jiang et.al. |
2411.19758v1 |
null |
2024-11-29 |
A Comprehensive Content Verification System for ensuring Digital Integrity in the Age of Deep Fakes |
RaviKanth Kaja et.al. |
2411.19750v1 |
null |
2024-11-29 |
Real-Time Anomaly Detection in Video Streams |
Fabien Poirier et.al. |
2411.19731v1 |
null |
2024-11-29 |
JetFormer: An Autoregressive Generative Model of Raw Images and Text |
Michael Tschannen et.al. |
2411.19722v1 |
null |
2024-11-29 |
MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications |
Gasser Elazab et.al. |
2411.19717v1 |
null |
2024-11-29 |
Explaining the Impact of Training on Vision Models via Activation Clustering |
Ahcène Boubekki et.al. |
2411.19700v1 |
null |
2024-11-29 |
Gated-Attention Feature-Fusion Based Framework for Poverty Prediction |
Muhammad Umer Ramzan et.al. |
2411.19690v1 |
null |
2024-11-29 |
SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks |
Kim-Celine Kahl et.al. |
2411.19688v1 |
link |