Analysis of Text Feature Extractors using Deep Learning on Fake News

被引:17
作者
Ahmed, Bilal [1 ]
Ali, Gulsher [1 ]
Hussain, Arif [1 ]
Buriro, Abdul Baseer [1 ]
Ahmed, Junaid [1 ]
机构
[1] Sukkur IBA Univ, Dept Elect Engn, Sukkur, Pakistan
关键词
fake news; natural language processing; feature extractors; deep learning;
D O I
10.48084/etasr.4069
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Social media and easy internet access have allowed the instant sharing of news, ideas, and information on a global scale. However, rapid spread and instant access to information/news can also enable rumors or fake news to spread very easily and rapidly. In order to monitor and minimize the spread of fake news in the digital community, fake news detection using Natural Language Processing (NLP) has attracted significant attention. In NLP, different text feature extractors and word embeddings are used to process the text data. The aim of this paper is to analyze the performance of a fake news detection model based on neural networks using 3 feature extractors: TD-IDF vectorizer, Glove embeddings, and BERT embeddings. For the evaluation, multiple metrics, namely accuracy, precision, F1, recall, AUC ROC, and AUC PR were computed for each feature extractor. All the transformation techniques were fed to the deep learning model. It was found that BERT embeddings for text transformation delivered the best performance. TD-IDF has been performed far better than Glove and competed the BERT as well at some stages.
引用
收藏
页码:7001 / 7005
页数:5
相关论文
共 22 条
[11]  
Liu SS, 2019, CHIN AUTOM CONGR, P5842, DOI [10.1109/CAC48633.2019.8996183, 10.1109/cac48633.2019.8996183]
[12]  
Mikolov T., 2013, ARXIV13013781
[13]  
Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
[14]  
Pennington J., 2014, P 2014 C EMP METH NA, P1532, DOI [10.3115/v1/D14-1162, DOI 10.3115/V1/D14-1162]
[15]  
Quintanilha TL., 2019, Communication Society, V32, P17, DOI [DOI 10.15581/003.32.3.17-33, 10.15581/003.32.3.17-32]
[16]  
Sangamnerkar S., 2020, INT C EMERGING TECHN, P1
[17]  
Sari W. K., 2019, JURNAL ILMIAH TEKNIK, V5, P85, DOI [10.26555/jiteki.v5i2.15021, DOI 10.26555/JITEKI.V5I2.15021]
[18]  
Tanvir AA., 2019, 2019 7 INT C SMART, DOI DOI 10.1109/icscc.2019.8843612
[19]   KNN with TF-IDF Based Framework for Text Categorization [J].
Trstenjak, Bruno ;
Mikac, Sasa ;
Donko, Dzenana .
24TH DAAAM INTERNATIONAL SYMPOSIUM ON INTELLIGENT MANUFACTURING AND AUTOMATION, 2013, 2014, 69 :1356-1364
[20]  
Wolf T., 2020, P 2020 C EMPIRICAL M, P38