Neural attention with character embeddings for hay fever detection from twitter

被引:52
作者
Du, Jiahua [1 ]
Michalska, Sandra [1 ]
Subramani, Sudha [1 ]
Wang, Hua [1 ]
Zhang, Yanchun [1 ]
机构
[1] Victoria Univ, Inst Sustainable Ind & Liveable Cities, Melbourne, Vic, Australia
关键词
Pollen allergy; Hay fever; Twitter; Deep learning; CLASSIFICATION; AGREEMENT;
D O I
10.1007/s13755-019-0084-2
中图分类号
R-058 [];
学科分类号
摘要
The paper aims to leverage the highly unstructured user-generated content in the context of pollen allergy surveillance using neural networks with character embeddings and the attention mechanism. Currently, there is no accurate representation of hay fever prevalence, particularly in real-time scenarios. Social media serves as an alternative to extract knowledge about the condition, which is valuable for allergy sufferers, general practitioners, and policy makers. Despite tremendous potential offered, conventional natural language processing methods prove limited when exposed to the challenging nature of user-generated content. As a result, the detection of actual hay fever instances among the number of false positives, as well as the correct identification of non-technical expressions as pollen allergy symptoms poses a major problem. We propose a deep architecture enhanced with character embeddings and neural attention to improve the performance of hay fever-related content classification from Twitter data. Improvement in prediction is achieved due to the character-level semantics introduced, which effectively addresses the out-of-vocabulary problem in our dataset where the rate is approximately 9%. Overall, the study is a step forward towards improved real-time pollen allergy surveillance from social media with state-of-art technology.
引用
收藏
页数:7
相关论文
共 37 条
[1]  
[Anonymous], ALL RHIN HAY FEV
[2]  
[Anonymous], 2014, EFFECTIVE USE WORD O
[3]  
[Anonymous], PLANET RISK
[4]  
[Anonymous], WORLD ALL WEEK 2016
[5]  
Byrd K, 2016, 2016 IEEE/ACM INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING IN HEALTHCARE SYSTEMS (SEHS), P43, DOI [10.1109/SEHS.2016.016, 10.1145/2897683.2897693]
[6]  
Carletta J, 1996, COMPUT LINGUIST, V22, P249
[7]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[8]  
Coppersmith G, 2014, Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, P51, DOI [DOI 10.3115/V1/W14-3207, 10.3115/v1/W14-3207]
[9]   @choo: Tracking Pollen and Hayfever in the UK Using Social Media [J].
Cowie, Sophie ;
Arthur, Rudy ;
Williams, Hywel T. P. .
SENSORS, 2018, 18 (12)
[10]  
Cowling D., 2018, Social Media Statistics Australia - October 2018