Identifying Cantonese rumors with discriminative feature integration in online social networks

被引:8
作者
Chen, Xinyu [1 ]
Wang, Haizhou [1 ]
Ke, Liang [1 ]
Lu, Zhipeng [1 ]
Su, Hanjian [1 ]
Chen, Xingshu [1 ]
机构
[1] Sichuan Univ, Sch Cyber Sci & Engn, Chengdu 610207, Peoples R China
基金
中国国家自然科学基金;
关键词
Cantonese rumor detection; Social network; Feature fusion; Deep learning; FAKE NEWS; IDENTIFICATION; PROPAGATION; CLASSIFICATION; TEXT;
D O I
10.1016/j.eswa.2022.119347
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To reduce the negative impacts of rumors on the real world, rumor detection on social networks has practical significance. Currently, the research on Chinese rumor detection is relatively comprehensive, but Cantonese rumors are much less investigated. As a main dialect of Chinese, Cantonese has more than 60 million speakers globally, but there are some great challenges in the Cantonese rumor detection. Firstly, there is no available benchmark dataset of Cantonese rumors. Secondly, it is a significant challenge to learn the unique linguistic characteristics of Cantonese. Thirdly, traditional rumor detection approaches cannot be directly applied to Cantonese rumors. Therefore, we propose a novel framework for Cantonese rumor detection using deep neural networks with feature fusion. To the best of our knowledge, it is the first study conducted on Cantonese rumor detection on social networks. Specifically, we build a Cantonese rumor dataset and a multi-domain Cantonese corpus. Next, a total of 27 statistical features are extracted and seven of them are newly proposed. Then, a novel deep learning model called BLA is designed to identify Cantonese rumors, which generates text and Jyutping embeddings using a further pre-trained BERT model and a CNN model. Moreover, the BLA model integrates the statistical and semantic features to implement the classification of Cantonese rumors. Experiments demonstrate that the BLA model achieves a remarkable Cantonese rumor detection performance with an F1 Score of 0.9225.
引用
收藏
页数:16
相关论文
共 71 条
[1]   Detecting breaking news rumors of emerging topics in social media [J].
Alkhodair, Sarah A. ;
Ding, Steven H. H. ;
Fung, Benjamin C. M. ;
Liu, Junqiang .
INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (02)
[2]   Rumor detection in Arabic tweets using semi-supervised and unsupervised expectation-maximization [J].
Alzanin, Samah M. ;
Azmi, Aqil M. .
KNOWLEDGE-BASED SYSTEMS, 2019, 185
[3]  
Bauer RS, 2016, Global Chinese, V2, P115, DOI [10.1515/glochi-2016-0007, 10.1515/glochi-2016-0007, DOI 10.1515/GLOCHI-2016-0007, https://doi.org/10.1515/glochi-2016-0007]
[4]  
Bauer RS, 2018, Global Chinese, V4, P103, DOI [10.1515/glochi-2018-0006, 10.1515/glochi-2018-0006, DOI 10.1515/GLOCHI-2018-0006]
[5]  
Bian T, 2020, AAAI CONF ARTIF INTE, V34, P549
[6]   A survey on fake news and rumour detection techniques [J].
Bondielli, Alessandro ;
Marcelloni, Francesco .
INFORMATION SCIENCES, 2019, 497 :38-55
[7]  
Cai GY, 2014, 2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), P912, DOI 10.1109/ASONAM.2014.6921694
[8]  
Carletta J, 1996, COMPUT LINGUIST, V22, P249
[9]  
Castillo C., 2011, Proceedings of the 20th international conference on World wide web, WWW'11, (New York, NY, USA), P675, DOI DOI 10.1145/1963405.1963500
[10]   Statistical Features-Based Real-Time Detection of Drifted Twitter Spam [J].
Chen, Chao ;
Wang, Yu ;
Zhang, Jun ;
Xiang, Yang ;
Zhou, Wanlei ;
Min, Geyong .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (04) :914-925