Recurrent Neural Networks With TF-IDF Embedding Technique for Detection and Classification in Tweets of Dengue Disease

被引:31
作者
Amin, Samina [1 ]
Uddin, M. Irfan [1 ]
Hassan, Saima [1 ]
Khan, Atif [2 ]
Nasser, Nidal [3 ]
Alharbi, Abdullah [4 ]
Alyami, Hashem [5 ]
机构
[1] Kohat Univ Sci & Technol, Inst Comp, Kohat 26000, Pakistan
[2] Islamia Coll Peshawar, Dept Comp Sci, Peshawar 25120, Pakistan
[3] Alfaisal Univ, Coll Engn, Riyadh 11533, Saudi Arabia
[4] Taif Univ, Dept Informat Technol, Coll Comp & Informat Technol, At Taif 21944, Saudi Arabia
[5] Taif Univ, Dept Comp Sci, Coll Comp & Informat Technol, At Taif 21944, Saudi Arabia
关键词
Deep learning; disease classification; machine learning; RNN; text processing; SOCIAL MEDIA;
D O I
10.1109/ACCESS.2020.3009058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increased usage of Web 2.0 and data-affluent tools such as social media platforms and web blog services, the challenge of extracting public sentiment and disseminating personal health information has become more common than ever in the last decade. This paper proposes a novel model for Dengue disease detection based on social media posts alone. The model does not access any personal information of people or any medical record. The model extracts the presence of patients infected with Dengue disease based on tweets only and decides whether it is a general discussion about the disease, and no one is actually infected, or people are actually infected with that disease. The identification of people infected with Dengue is determined by clinical tests, but the propose technique is used for automatic surveillance and identification of regions where the spread is happening at an alarming rate and guide healthcare professional to take necessary actions to control the spread. This paper uses different machine/deep learning approaches to utilize tweets data for automatic and efficient disease detection. Experimental results demonstrate that the proposed model is able to achieve 92% accuracy compared to the current state-of-the-art techniques in this domain.
引用
收藏
页码:131522 / 131533
页数:12
相关论文
共 67 条
[1]   Using online social networks to track a pandemic: A systematic review [J].
Al-garadi, Mohammed Ali ;
Khan, Muhammad Sadiq ;
Varathan, Kasturi Dewi ;
Mujtaba, Ghulam ;
Al-Kabsi, Abdelkodose M. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 :1-11
[2]   Preliminary Flu Outbreak Prediction Using Twitter Posts Classification and Linear Regression With Historical Centers for Disease Control and Prevention Reports: Prediction Framework Study [J].
Alessa, Ali ;
Faezipour, Miad .
JMIR PUBLIC HEALTH AND SURVEILLANCE, 2019, 5 (02) :97-120
[3]   A review of influenza detection and prediction through social networking sites [J].
Alessa, Ali ;
Faezipour, Miad .
THEORETICAL BIOLOGY AND MEDICAL MODELLING, 2018, 15
[4]   Can We Predict a Riot? Disruptive Event Detection Using Twitter [J].
Alsaedi, Nasser ;
Burnap, Pete ;
Rana, Omer .
ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2017, 17 (02)
[5]   SCANCPECLENS: A Framework for Automatic Lexicon Generation and Sentiment Analysis of Micro Blogging Data on China Pakistan Economic Corridor [J].
Amina, Bibi ;
Azim, Tayyaba .
IEEE ACCESS, 2019, 7 :133876-133887
[6]   Trends in dengue research in the Philippines: A systematic review [J].
An Agrupis, Kristal ;
Ylade, Michelle ;
Aldaba, Josephine ;
Lena Lopez, Anna ;
Deen, Jacqueline .
PLOS NEGLECTED TROPICAL DISEASES, 2019, 13 (04)
[7]  
Artstein R., 2017, Handbook of linguistic annotation, DOI 10.1007/978-94-024-0881-211
[8]  
Ayukekbong James Ayukepi, 2017, World J Virol, V6, P9, DOI 10.5501/wjv.v6.i1.9
[9]  
Baruah DowerahTrisha., 2012, INT J SCI RES PUBLIC, V2, P1
[10]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166