A Multimodal Misinformation Detector for COVID-19 Short Videos on TikTok

被引:37
作者
Shang, Lanyu [1 ]
Kou, Ziyi [1 ]
Zhang, Yang [2 ]
Wang, Dong [1 ]
机构
[1] Univ Illinois, Sch Informat Sci, Champaign, IL 61820 USA
[2] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2021年
基金
美国国家科学基金会;
关键词
D O I
10.1109/BigData52589.2021.9671928
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies an emerging and important problem of identifying misleading COVID-19 short videos where the misleading content is jointly expressed in the visual, audio, and textual content of videos. Existing solutions for misleading video detection mainly focus on the authenticity of videos or audios against AI algorithms (e.g., deepfake) or video manipulation, and are insufficient to address our problem where most videos are user-generated and intentionally edited. Two critical challenges exist in solving our problem: i) how to effectively extract information from the distractive and manipulated visual content in TikTok videos? ii) How to efficiently aggregate heterogeneous information across different modalities in short videos? To address the above challenges, we develop TikTec, a multimodal misinformation detection framework that explicitly exploits the captions to accurately capture the key information from the distractive video content, and effectively learns the composed misinformation that is jointly conveyed by the visual and audio content. We evaluate TikTec on a real-world COVID-19 video dataset collected from TikTok. Evaluation results show that TikTec achieves significant performance gains compared to state-of-the-art baselines in accurately detecting misleading COVID-19 short videos.
引用
收藏
页码:899 / 908
页数:10
相关论文
共 39 条
[1]  
Adams S. H., 2021, J ADOLESCENT HLTH, P2021
[2]  
Afchar D, 2018, IEEE INT WORKS INFOR
[3]   Deep Audio-Visual Speech Recognition [J].
Afouras, Triantafyllos ;
Chung, Joon Son ;
Senior, Andrew ;
Vinyals, Oriol ;
Zisserman, Andrew .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) :8717-8727
[4]  
Alam Firoj, 2021, ARXIV210312541
[5]   Deepfake Video Detection through Optical Flow based CNN [J].
Amerini, Irene ;
Galteri, Leonardo ;
Caldelli, Roberto ;
Del Bimbo, Alberto .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :1205-1207
[6]   Fake news and COVID-19: modelling the predictors of fake news sharing among social media users [J].
Apuke, Oberiri Destiny ;
Omar, Bahiyah .
TELEMATICS AND INFORMATICS, 2021, 56
[7]  
Bang Yejin, 2021, ARXIV210103841
[8]  
Bidokhti A, 2015, 2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), P13, DOI 10.1109/AISP.2015.7123529
[9]  
Brennen J.S., 2020, Factsheet, V7, P1
[10]   Multi-modal Transformer for Video Retrieval [J].
Gabeur, Valentin ;
Sun, Chen ;
Alahari, Karteek ;
Schmid, Cordelia .
COMPUTER VISION - ECCV 2020, PT IV, 2020, 12349 :214-229