跳到主要导航 跳到搜索 跳到主要内容

From Predictions to Analyses: Rationale-Augmented Fake News Detection with Large Vision-Language Models

  • Xiaofan Zheng
  • , Zinan Zeng
  • , Heng Wang
  • , Yuyang Bai
  • , Yuhan Liu
  • , Minnan Luo
  • Xi'an Jiaotong University

科研成果: 书/报告/会议事项章节会议稿件同行评审

14 引用 (Scopus)

摘要

The rapid development of social media has led to a surge of eye-catching fake news on the Internet, with multimodal news comprising both images and text being particularly prevalent. To address the challenges of Multimodal Fake News Detection (MFND), numerous supervised task-specific Multimodal Small Language Models (MSLMs) have been developed. However, these models lack the breadth of knowledge and the depth of language understanding, which results in unsatisfactory adaptability, generalization, and explainability performance. To address these issues, we attempt to introduce Large Vision-Language Models (LVLMs), aiming to leverage the common sense understanding and logical reasoning abilities of LVLMs for the MFND task. We observed that LVLMs can generate reasonable analyses of news content from specific angles. However, when it comes to synthesizing these analyses for final judgment, their performance declines significantly, failing to meet the accuracy benchmarks set by existing MSLMs detection models. This reflects the need for a more effective way for LVLMs, which have not undergone task-specific training, to utilize their knowledge and capabilities. Based on these findings, we propose the Explainable Adaptive Rationale-Augmented Multimodal (EARAM) framework, which adaptively uses MSLMs to extract useful rationales from the multi-perspective analyses of LVLMs. After making judgments based on these rationales, EARAM then assists LVLMs in generating more reliable explanations. Extensive experiments demonstrate that our model not only achieves state-of-the-art results on widely used datasets but also significantly outperforms other models in terms of generalization and explainability.

源语言英语
主期刊名WWW 2025 - Proceedings of the ACM Web Conference
出版商Association for Computing Machinery, Inc
5364-5375
页数12
ISBN(电子版)9798400712746
DOI
出版状态已出版 - 28 4月 2025
活动34th ACM Web Conference, WWW 2025 - Sydney, 澳大利亚
期限: 28 4月 20252 5月 2025

出版系列

姓名WWW 2025 - Proceedings of the ACM Web Conference

会议

会议34th ACM Web Conference, WWW 2025
国家/地区澳大利亚
Sydney
时期28/04/252/05/25

学术指纹

探究 'From Predictions to Analyses: Rationale-Augmented Fake News Detection with Large Vision-Language Models' 的科研主题。它们共同构成独一无二的指纹。

引用此