Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive AlignmentKai-Po Chang,Wei-Yuan Cheng,Chi-Pin Huang,Fu-en Yang,Yu-Chiang Frank Wang· 0 min read PDF Cite ProjectType1PublicationIEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026Last updated on Dec 5, 2025Source Themes ← TA-Prompting: Enhancing Video Large Language Models for Dense Video Captioning via Temporal Anchors Jan 1, 0001