Topic Modelling for Identification of Vaccine Reactions in Twitter

被引:12
作者
Habibabadi, Sedigheh Khademi [1 ]
Haghighi, Pari Delir [1 ]
机构
[1] Monash Univ, Informat Technol, Melbourne, Vic, Australia
来源
PROCEEDINGS OF THE AUSTRALASIAN COMPUTER SCIENCE WEEK MULTICONFERENCE (ACSW 2019) | 2019年
关键词
Topic modelling; Vaccine safety surveillance; Social media; Twitter; TWEETS; TEXT;
D O I
10.1145/3290688.3290735
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Detection of vaccine safety signals depends on various established reporting systems, where there is inevitably a lag between an adverse reaction to a vaccine and the reporting of it, and subsequent processing of reports. Therefore, it is desirable to try and detect safety signals earlier, ideally close to real-time. Extensive use of social media has provided a platform for sharing and seeking health-related information, and the immediacy of social media conversations mean that they are an ideal candidate for early detection of vaccine safety signals. The objective of this study is to evaluate topic models for identifying user posts on Twitter that most likely contain vaccine safety signals. This is an initial step in the overall research to determine if reliable vaccine safety signals can be detected in social media streams. The techniques used were focused on identifying the model design and number of topics that best revealed documents that contained vaccine safety signals, to assist with dimension reduction and subsequent labelling of the text data. The study compared Gensim LDA, MALLET, and jLDADMM DMM models to determine the most effective model for detecting vaccine safety signals, assisted by an evaluation process that used an adjusted F-Scoring technique over a labelled subset of the documents.
引用
收藏
页数:10
相关论文
共 37 条
[11]  
[Anonymous], PYLDAVIS PYTHON LIB
[12]  
[Anonymous], 2010, P LREC 2010 WORKSHOP
[13]   Epidemiological study of severe febrile reactions in young children in Western Australia caused by a 2010 trivalent inactivated influenza vaccine [J].
Armstrong, P. K. ;
Dowse, G. K. ;
Effler, P. V. ;
Carcione, D. ;
Blyth, C. C. ;
Richmond, P. C. ;
Geelhoed, G. C. ;
Mascaro, F. ;
Scully, M. ;
Weeramanthri, T. S. .
BMJ OPEN, 2011, 1 (01)
[14]   A general framework to expand short text for topic modeling [J].
Bicalho, Paulo ;
Pita, Marcelo ;
Pedrosa, Gabriel ;
Lacerda, Anisio ;
Pappa, Gisele L. .
INFORMATION SCIENCES, 2017, 393 :66-81
[15]   Probabilistic Topic Models [J].
Blei, David ;
Carin, Lawrence ;
Dunson, David .
IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) :55-65
[16]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[17]   Active surveillance for adverse events following immunization [J].
Crawford, Nigel W. ;
Clothier, Hazel ;
Hodgson, Kate ;
Selvaraj, Gowri ;
Easton, Mee Lee ;
Buttery, Jim P. .
EXPERT REVIEW OF VACCINES, 2014, 13 (02) :265-276
[18]   Improving the safety of vaccine delivery [J].
Evans, Huw P. ;
Cooper, Alison ;
Williams, Huw ;
Carson-Stevens, Andrew .
HUMAN VACCINES & IMMUNOTHERAPEUTICS, 2016, 12 (05) :1280-+
[19]   What are we 'tweeting' about obesity? Mapping tweets with topic modeling and Geographic Information System [J].
Ghosh, Debarchana ;
Guha, Rajarshi .
CARTOGRAPHY AND GEOGRAPHIC INFORMATION SCIENCE, 2013, 40 (02) :90-102
[20]  
Honnibal M., 2017, spaCy 2: Natural language understanding with Bloom embeddings, convolutional neural networks and incremental parsing", V7, P411