Complex Network and Source Inspired COVID-19 Fake News Classification on Twitter

被引:30
作者
Qureshi, Khubaib Ahmed [1 ]
Malick, Rauf Ahmed Shams [2 ]
Sabih, Muhammad [3 ]
Cherifi, Hocine [4 ]
机构
[1] DHA Suffa Univ, Dept Comp Sci, Karachi 75500, Pakistan
[2] Natl Univ Comp & Emerging Sci, Dept Comp Sci, Karachi 75300, Pakistan
[3] DHA Suffa Univ, Dept Elect Engn, Karachi 75500, Pakistan
[4] Univ Burgundy, Lab Informat Bourgogne, LIB, F-21078 Dijon, France
关键词
Social networking (online); Feature extraction; COVID-19; Complex networks; Biological system modeling; Knowledge based systems; Blogs; Context based model; complex network measures; dis; misinformation; hybrid model; machine learning; pandemics; source based; social media; social network analysis; FALSE NEWS; INFORMATION;
D O I
10.1109/ACCESS.2021.3119404
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In COVID-19 related infodemic, social media becomes a medium for wrongdoers to spread rumors, fake news, hoaxes, conspiracies, astroturf memes, clickbait, satire, smear campaigns, and other forms of deception. It puts a tremendous strain on society by damaging reputation, public trust, freedom of expression, journalism, justice, truth, and democracy. Therefore, it is of paramount importance to detect and contain unreliable information. Multiple techniques have been proposed to detect fake news propagation in tweets based on tweets content, propagation on the network of users, and the profile of the news generators. Generating human-like content allows deceiving content-based methods. Network-based methods rely on the complete graph to detect fake news, resulting in late detection. User profile-based techniques are effective for bots or fake accounts detection. However, they are not suited to detect fake news from original accounts. To deal with the shortcomings in existing methods, we introduce a source-based method focusing on the news propagators' community, including posters and re-tweeters to detect such contents. Propagators are connected using follower-following relations. A feature set combining the connectivity patterns of news propagators with their profile features is used in a machine learning framework to perform binary classification of tweets. Complex network measures and user profile features are also examined separately. We perform an extensive comparative analysis of the proposed methodology on a real-world COVID-19 dataset, exploiting various machine learning and deep learning models at the community and node levels. Results show that hybrid features perform better than network features and user features alone. Further optimization demonstrates that Ensemble's boosting model CATBoost and deep learning model RNN are the most effective, with an AUC score of 98%. Furthermore, preliminary results show that the proposed solution can also handle fake news in the political and entertainment domain using a small training set.
引用
收藏
页码:139636 / 139656
页数:21
相关论文
共 82 条
[1]   An Epidemic Model for News Spreading on Twitter [J].
Abdullah, Saeed ;
Wu, Xindong .
2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, :163-169
[2]  
Ahmed S., 2019, P AAAI C ART INT, V12, P8
[3]   Fake News Identification on Twitter with Hybrid CNN and RNN Models [J].
Ajao, Oluwaseun ;
Bhowmik, Deepayan ;
Zargari, Shahrzad .
SMSOCIETY'18: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON SOCIAL MEDIA AND SOCIETY, 2018, :226-230
[4]   Lies Kill, Facts Save: Detecting COVID-19 Misinformation in Twitter [J].
Al-Rakhami, Mabrook S. ;
Al-Amri, Atif M. .
IEEE ACCESS, 2020, 8 :155961-155970
[5]  
Al-Zaman M.S., 2021, JOURNAL MEDIA, V2, P100, DOI [10.3390/journalmedia2010007, DOI 10.3390/JOURNALMEDIA2010007]
[6]   COVID-19 Public Sentiment Insights: A Text Mining Approach to the Gulf Countries [J].
Albahli, Approach Saleh ;
Algsham, Ahmad ;
Aeraj, Shamsulhaq ;
Alsaeed, Muath ;
Alrashed, Muath ;
Rauf, Hafiz Tayyab ;
Arif, Muhammad ;
Mohammed, Mazin Abed .
CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (02) :1613-1627
[7]   Social Media and Fake News in the 2016 Election [J].
Allcott, Hunt ;
Gentzkow, Matthew .
JOURNAL OF ECONOMIC PERSPECTIVES, 2017, 31 (02) :211-235
[8]  
Amador J., 2017, ARXIV171205999
[9]  
[Anonymous], 2014, PROC ICONF
[10]  
[Anonymous], 2012, P SIGCHI C HUM FACT, DOI DOI 10.1145/2207676.2208672