2025-01-16 |
Distilling Multi-modal Large Language Models for Autonomous Driving |
Deepti Hegde et.al. |
2501.09757v1 |
null |
2025-01-16 |
Enhancing Lexicon-Based Text Embeddings with Large Language Models |
Yibin Lei et.al. |
2501.09749v1 |
null |
2025-01-16 |
Cueless EEG imagined speech for subject identification: dataset and benchmarks |
Ali Derakhshesh et.al. |
2501.09700v1 |
link |
2025-01-16 |
Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation |
Jiho Choi et.al. |
2501.09688v1 |
null |
2025-01-16 |
Analyzing Continuous Semantic Shifts with Diachronic Word Similarity Matrices |
Hajime Kiyama et.al. |
2501.09538v1 |
link |
2025-01-16 |
HydraMix: Multi-Image Feature Mixing for Small Data Image Classification |
Christoph Reinders et.al. |
2501.09504v1 |
null |
2025-01-16 |
VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization |
Zixun Fang et.al. |
2501.09499v1 |
null |
2025-01-16 |
The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning |
Wonjun Jo et.al. |
2501.09485v1 |
null |
2025-01-16 |
Justification logics with counterfactual and relevant conditionals |
Meghdad Ghari et.al. |
2501.09471v1 |
null |
2025-01-16 |
Scaling up self-supervised learning for improved surgical foundation models |
Tim J. M. Jaspers et.al. |
2501.09436v1 |
link |
2025-01-16 |
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring |
Xinyi Wang et.al. |
2501.09428v1 |
null |
2025-01-16 |
Joint Transmission and Deblurring: A Semantic Communication Approach Using Events |
Pujing Yang et.al. |
2501.09396v1 |
null |
2025-01-16 |
SVIA: A Street View Image Anonymization Framework for Self-Driving Applications |
Dongyu Liu et.al. |
2501.09393v1 |
null |
2025-01-16 |
Contract-Inspired Contest Theory for Controllable Image Generation in Mobile Edge Metaverse |
Guangyuan Liu et.al. |
2501.09391v1 |
null |
2025-01-16 |
Adaptive Contextual Caching for Mobile Edge Large Language Model Service |
Guangyuan Liu et.al. |
2501.09383v1 |
null |
2025-01-16 |
Image Segmentation with transformers: An Overview, Challenges and Future |
Deepjyoti Chetia et.al. |
2501.09372v1 |
null |
2025-01-16 |
PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks |
Huiyou Zhan et.al. |
2501.09367v1 |
null |
2025-01-16 |
ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark Dataset |
Fen Wang et.al. |
2501.09349v1 |
link |
2025-01-16 |
Algorithm for Semantic Network Generation from Texts of Low Resource Languages Such as Kiswahili |
Barack Wamkaya Wanjawa et.al. |
2501.09326v1 |
null |
2025-01-16 |
Shape-Based Single Object Classification Using Ensemble Method Classifiers |
Nur Shazwani Kamarudin et.al. |
2501.09311v1 |
null |
2025-01-16 |
Finding the Trigger: Causal Abductive Reasoning on Video Events |
Thao Minh Le et.al. |
2501.09304v1 |
null |
2025-01-16 |
LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport |
Kyeongha Rho et.al. |
2501.09291v1 |
link |
2025-01-16 |
Interoceptive Robots for Convergent Shared Control in Collaborative Construction Work |
Xiaoshan Zhou et.al. |
2501.09290v1 |
null |
2025-01-16 |
Text Semantics to Flexible Design: A Residential Layout Generation Method Based on Stable Diffusion Model |
Zijin Qiu et.al. |
2501.09279v1 |
null |
2025-01-16 |
Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding |
Kohei Torimi et.al. |
2501.09278v1 |
null |
2025-01-16 |
Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images |
Yongheng Zhang et.al. |
2501.09268v1 |
null |
2025-01-16 |
Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs |
Sanchit Sinha et.al. |
2501.09221v1 |
null |
2025-01-16 |
A Simple Graph Contrastive Learning Framework for Short Text Classification |
Yonghao Liu et.al. |
2501.09219v1 |
link |
2025-01-16 |
Interpretable Droplet Digital PCR Assay for Trustworthy Molecular Diagnostics |
Yuanyuan Wei et.al. |
2501.09218v1 |
null |
2025-01-16 |
Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning |
Yonghao Liu et.al. |
2501.09214v1 |
link |