Fake News Detection Using Deep Learning and Natural Language Processing

被引:2
作者
Matheven, Anand [1 ]
Venkata, Burra [1 ]
Kumar, Durga [1 ]
机构
[1] Xiamen Univ Malaysia, Sch Comp & Data Sci, Jalan Sunsuria, Sepang 43900, Selangor, Malaysia
来源
2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI | 2022年
关键词
fake news; deep Learning; natural language processing;
D O I
10.1109/ISCMI56532.2022.10068440
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rise of social media has brought the rise of fake news and this fake news comes with negative consequences. With fake news being such a huge issue, efforts should be made to identify any forms of fake news however it is not so simple. Manually identifying fake news can be extremely subjective as determining the accuracy of the information in a story is complex and difficult to perform, even for experts. On the other hand, an automated solution would require a good understanding of NLP which is also complex and may have difficulties producing an accurate output. Therefore, the main problem focused on this project is the viability of developing a system that can effectively and accurately detect and identify fake news. Finding a solution would be a significant benefit to the media industry, particularly the social media industry as this is where a large proportion of fake news is published and spread. In order to find a solution to this problem, this project proposed the development of a fake news identification system using deep learning and natural language processing. The system was developed using a Word2vec model combined with a Long Short-Term Memory model in order to showcase the compatibility of the two models in a whole system. This system was trained and tested using two different dataset collections that each consisted of one real news dataset and one fake news dataset. Furthermore, three independent variables were chosen which were the number of training cycles, data diversity and vector size to analyze the relationship between these variables and the accuracy levels of the system. It was found that these three variables did have a significant effect on the accuracy of the system. From this, the system was then trained and tested with the optimal variables and was able to achieve the minimum expected accuracy level of 90%. The achieving of this accuracy levels confirms the compatibility of the LSTM and Word2vec model and their capability to be synergized into a single system that is able to identify fake news with a high level of accuracy.
引用
收藏
页码:11 / 14
页数:4
相关论文
共 7 条
[1]  
Berduygina O. N., 2019, Media Watch, V10, P122
[2]   Sentiment Analysis Using Word2vec And Long Short-Term Memory (LSTM) For Indonesian Hotel Reviews [J].
Muhammad, Putra Fissabil ;
Kusumaningrum, Retno ;
Wibowo, Adi .
5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 :728-735
[3]  
Naeem S.B., 2020, WILEY PUBLIC HLTH EM, DOI 10.1111%2Fhir.12320
[4]  
Rodr¡guez AI, 2019, Arxiv, DOI arXiv:1910.03496
[5]  
Rong X, 2016, Arxiv, DOI [arXiv:1411.2738, 10.48550/arXiv.1411.2738]
[6]  
Thota A., 2018, SMU Data Sci Rev, V1, P10
[7]   A review on the long short-term memory model [J].
Van Houdt, Greg ;
Mosquera, Carlos ;
Napoles, Gonzalo .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (08) :5929-5955