2024-11-29 |
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs |
Shukang Yin et.al. |
2411.19951v2 |
link |
2024-11-29 |
AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos |
Yuze He et.al. |
2411.19950v1 |
null |
2024-11-29 |
DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation |
Zhiqiang Shen et.al. |
2411.19946v1 |
link |
2024-11-29 |
VLSBench: Unveiling Visual Leakage in Multimodal Safety |
Xuhao Hu et.al. |
2411.19939v1 |
null |
2024-11-29 |
On Domain-Specific Post-Training for Multimodal Large Language Models |
Daixuan Cheng et.al. |
2411.19930v1 |
null |
2024-11-29 |
The scalar angular Teukolsky equation and its solution for the Taub-NUT spacetime |
Felix Willenborg et.al. |
2411.19919v1 |
null |
2024-11-29 |
Classical and Quantum Algorithms for the Deterministic L-system Inductive Inference Problem |
Ali Lotfi et.al. |
2411.19906v1 |
link |
2024-11-29 |
Noncommutative Model Selection for Data Clustering and Dimension Reduction Using Relative von Neumann Entropy |
Araceli Guzmán-Tristán et.al. |
2411.19902v1 |
null |
2024-11-29 |
Enhanced anomaly detection in well log data through the application of ensemble GANs |
Abdulrahman Al-Fakih et.al. |
2411.19875v1 |
link |
2024-11-29 |
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection |
Philipp Wolters et.al. |
2411.19860v1 |
null |
2024-11-29 |
Towards Class-wise Robustness Analysis |
Tejaswini Medi et.al. |
2411.19853v1 |
null |
2024-11-29 |
Neuroplasticity and Psychedelics: a comprehensive examination of classic and non-classic compounds in pre and clinical models |
Claudio Agnorelli et.al. |
2411.19840v1 |
null |
2024-11-29 |
Kurepa trees, continuous images, and perfect set properties |
Chris Lambie-Hanson et.al. |
2411.19839v1 |
null |
2024-11-29 |
Feedback-driven object detection and iterative model improvement |
Sönke Tenckhoff et.al. |
2411.19835v1 |
link |
2024-11-29 |
SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens |
Chi Su et.al. |
2411.19824v1 |
null |
2024-11-29 |
Tuning the Legacy Survey of Space and Time (LSST) Observing Strategy for Solar System Science: Incremental Templates in Year 1 |
James E. Robinson et.al. |
2411.19796v1 |
null |
2024-11-29 |
MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks |
Yiming Wu et.al. |
2411.19786v1 |
null |
2024-11-29 |
PerLA: Perceptive 3D Language Assistant |
Guofeng Mei et.al. |
2411.19774v1 |
null |
2024-11-29 |
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos |
Tiantian Geng et.al. |
2411.19772v1 |
null |
2024-11-29 |
LaVIDE: A Language-Vision Discriminator for Detecting Changes in Satellite Image with Map References |
Shuguo Jiang et.al. |
2411.19758v1 |
null |
2024-11-29 |
A Comprehensive Content Verification System for ensuring Digital Integrity in the Age of Deep Fakes |
RaviKanth Kaja et.al. |
2411.19750v1 |
null |
2024-11-29 |
Real-Time Anomaly Detection in Video Streams |
Fabien Poirier et.al. |
2411.19731v1 |
null |
2024-11-29 |
JetFormer: An Autoregressive Generative Model of Raw Images and Text |
Michael Tschannen et.al. |
2411.19722v1 |
null |
2024-11-29 |
MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications |
Gasser Elazab et.al. |
2411.19717v1 |
null |
2024-11-29 |
Explaining the Impact of Training on Vision Models via Activation Clustering |
Ahcène Boubekki et.al. |
2411.19700v1 |
null |
2024-11-29 |
Gated-Attention Feature-Fusion Based Framework for Poverty Prediction |
Muhammad Umer Ramzan et.al. |
2411.19690v1 |
null |
2024-11-29 |
SURE-VQA: Systematic Understanding of Robustness Evaluation in Medical VQA Tasks |
Kim-Celine Kahl et.al. |
2411.19688v1 |
link |
2024-11-29 |
MIRI Deep Imaging Survey (MIDIS) of the Hubble Ultra Deep Field |
Göran Östlin et.al. |
2411.19686v1 |
null |
2024-11-29 |
Multimodal Whole Slide Foundation Model for Pathology |
Tong Ding et.al. |
2411.19666v1 |
link |
2024-11-29 |
V-band Optoelectronic Oscillator for Earth Observation Applications |
Dimitrios Kastritsis et.al. |
2411.19663v1 |
null |