A Novel Stacking Approach for Accurate Detection of Fake News

被引：84

作者：

Jiang, Tao ^{[1
]}

Li, Jian Ping ^{[1
]}

Ul Haq, Amin ^{[1
]}

Saboor, Abdus ^{[1
]}

Ali, Amjad ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China

[2] Univ Swat, Dept Comp Sci & Software Technol, Mingora 19200, Pakistan

来源：

IEEE ACCESS | 2021年 / 9卷

基金：

中国国家自然科学基金; 国家高技术研究发展计划(863计划);

关键词：

Support vector machines; Machine learning; Social networking (online); Deep learning; Feature extraction; Stacking; Neural networks; Deception detection; deep learning; fake news; machine learning; McNemar’ s test; performance evaluation; stacking;

D O I：

10.1109/ACCESS.2021.3056079

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the increasing popularity of social media, people has changed the way they access news. News online has become the major source of information for people. However, much information appearing on the Internet is dubious and even intended to mislead. Some fake news are so similar to the real ones that it is difficult for human to identify them. Therefore, automated fake news detection tools like machine learning and deep learning models have become an essential requirement. In this paper, we evaluated the performance of five machine learning models and three deep learning models on two fake and real news datasets of different size with hold out cross validation. We also used term frequency, term frequency-inverse document frequency and embedding techniques to obtain text representation for machine learning and deep learning models respectively. To evaluate models' performance, we used accuracy, precision, recall and F1-score as the evaluation metrics and a corrected version of McNemar's test to determine if models' performance is significantly different. Then, we proposed our novel stacking model which achieved testing accuracy of 99.94% and 96.05 % respectively on the ISOT dataset and KDnugget dataset. Furthermore, the performance of our proposed method is high as compared to baseline methods. Thus, we highly recommend it for fake news detection.

引用

页码：22626 / 22639

页数：14

共 45 条

[1] MIMETIC: Mobile encrypted traffic classification using multimodal deep learning [J].

Aceto, Giuseppe ;

Ciuonzo, Domenico ;

Montieri, Antonio ;

Pescape, Antonio .

COMPUTER NETWORKS, 2019, 165

[2] Detecting opinion spams and fake news using text classification [J].

Ahmed, Hadeer ;

Traore, Issa ;

Saad, Sherif .

SECURITY AND PRIVACY, 2018, 1 (01)

[3] Fake News Identification on Twitter with Hybrid CNN and RNN Models [J].

Ajao, Oluwaseun ;

Bhowmik, Deepayan ;

Zargari, Shahrzad .

SMSOCIETY'18: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON SOCIAL MEDIA AND SOCIETY, 2018, :226-230

[4] "Bend the truth": Benchmark dataset for fake news detection in Urdu language and its evaluation [J].

Amjad, Maaz ;

Sidorov, Grigori ;

Zhila, Alisa ;

Gomez-Adorno, Helena ;

Voronkov, Ilia ;

Gelbukh, Alexander .

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) :2457-2469

[5]

[Anonymous], 2019, ARXIV 1902 06673

[6]

[Anonymous], 2014, EMNLP, DOI DOI 10.3115/V1

[7]

[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI [DOI 10.1145/1961189.1961199, 10.1145/1961189.1961199]

[8] Fake news detection in multiple platforms and languages [J].

Arruda Faustini, Pedro Henrique ;

Covoes, Thiago Ferreira .

EXPERT SYSTEMS WITH APPLICATIONS, 2020, 158

[9] A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis [J].

Chen, Hui-Ling ;

Yang, Bo ;

Liu, Jie ;

Liu, Da-You .

EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) :9014-9022

[10]

Cristianini N., 2000, An Introduction to Support Vector Machines and Other Kernel-basedLearning Methods, DOI [DOI 10.1017/CBO9780511801389, 10.1017/CBO9780511801389]

← 1 2 3 4 5 →