A natural language processing pipeline to advance the use of Twitter data for digital epidemiology of adverse pregnancy outcomes

被引:12
作者
Klein A.Z. [1 ]
Cai H. [1 ]
Weissenbacher D. [1 ]
Levine L.D. [2 ]
Gonzalez-Hernandez G. [1 ]
机构
[1] Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
[2] Maternal and Child Health Research Center, Department of Obstetrics and Gynecology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
基金
美国国家卫生研究院;
关键词
Data mining; Epidemiology; Machine learning; Natural language processing; Pregnancy; Social media;
D O I
10.1016/j.yjbinx.2020.100076
中图分类号
学科分类号
摘要
Background: In the United States, 17% of pregnancies end in fetal loss: miscarriage or stillbirth. Preterm birth affects 10% of live births in the United States and is the leading cause of neonatal death globally. Preterm births with low birthweight are the second leading cause of infant mortality in the United States. Despite their prevalence, the causes of miscarriage, stillbirth, and preterm birth are largely unknown. Objective: The primary objectives of this study are to (1) assess whether women report miscarriage, stillbirth, and preterm birth, among others, on Twitter, and (2) develop natural language processing (NLP) methods to automatically identify users from which to select cases for large-scale observational studies. Methods: We handcrafted regular expressions to retrieve tweets that mention an adverse pregnancy outcome, from a database containing more than 400 million publicly available tweets posted by more than 100,000 users who have announced their pregnancy on Twitter. Two annotators independently annotated 8109 (one random tweet per user) of the 22,912 retrieved tweets, distinguishing those reporting that the user has personally experienced the outcome (“outcome” tweets) from those that merely mention the outcome (“non-outcome” tweets). Inter-annotator agreement was κ = 0.90 (Cohen's kappa). We used the annotated tweets to train and evaluate feature-engineered and deep learning-based classifiers. We further annotated 7512 (of the 8109) tweets to develop a generalizable, rule-based module designed to filter out reported speech—that is, posts containing what was said by others—prior to automatic classification. We performed an extrinsic evaluation assessing whether the reported speech filter could improve the detection of women reporting adverse pregnancy outcomes on Twitter. Results: The tweets annotated as “outcome” include 1632 women reporting miscarriage, 119 stillbirth, 749 preterm birth or premature labor, 217 low birthweight, 558 NICU admission, and 458 fetal/infant loss in general. A deep neural network, BERT-based classifier achieved the highest overall F1-score (0.88) for automatically detecting “outcome” tweets (precision = 0.87, recall = 0.89), with an F1-score of at least 0.82 and a precision of at least 0.84 for each of the adverse pregnancy outcomes. Our reported speech filter significantly (P < 0.05) improved the accuracy of Logistic Regression (from 78.0% to 80.8%) and majority voting-based ensemble (from 81.1% to 82.9%) classifiers. Although the filter did not improve the F1-score of the BERT-based classifier, it did improve precision—a trade-off of recall that may be acceptable for automated case selection of more prevalent outcomes. Without the filter, reported speech is one of the main sources of errors for the BERT-based classifier. Conclusion: This study demonstrates that (1) women do report their adverse pregnancy outcomes on Twitter, (2) our NLP pipeline can automatically identify users from which to select cases for large-scale observational studies, and (3) our reported speech filter would reduce the cost of annotating health-related social media data and can significantly improve the overall performance of feature-based classifiers. © 2020 The Authors
引用
收藏
相关论文
共 38 条
[1]  
Gregory E.C.W., MacDorman M.F., Martin J.A., Trends in fetal and perinatal mortality in the United States, 2006–2012, NCHS Data Brief, 169, pp. 1-8, (2014)
[2]  
Gregory E.C.W., Drake P., Martin J.A., Lack of change in perinatal mortality in the United States, 2014–2016, NCHS Data Brief, 316, pp. 1-8, (2018)
[3]  
MacDorman M.F., Matthews T.J., Mohangoo A.D., Zeitlin J., International comparisons of infant mortality and related factors: United States and Europe, 2010, Natl. Vital Stat. Rep., 63, 5, pp. 1-6, (2014)
[4]  
Ventura S.J., Curtin S.C., Abma J.C., Estimated pregnancy rates and rates of pregnancy outcomes for the United States, 1990–2008, Natl. Vital Stat. Rep., 60, 7, pp. 1-21, (2012)
[5]  
MacDorman M.F., Gregory E.C.W., Fetal and perinatal mortality: United States, 2013, Natl. Vital Stat. Rep., 64, 8, pp. 1-24, (2015)
[6]  
Ammon Avalos L., Galindo C., Li D.K., A systematic review to calculate background miscarriage rates using life table analysis, Birth Defects Res. A Clin. Mol. Teratol., 94, 6, pp. 417-423, (2012)
[7]  
Ferre C., Callaghan W., Olson C., Sharma A., Barfield W., Effects of maternal age and age-specific preterm birth rates on overall preterm birth rates—United States, 2007 and 2014, MMWR Morb. Mortal Wkly. Rep., 65, 43, pp. 1181-1184, (2016)
[8]  
Liu L., Johnson H.L., Cousens S., Perin J., Scott S., Lawn J.E., Rudan I., Campbell H., Cibulskis R., Li M., Mathers C., Black R.E., Global, regional, and national causes of child mortality: an updated systematic analysis for 2010 with time trends since 2000, Lancet, 379, 9832, pp. 2151-2161, (2012)
[9]  
Xu J., Murphy S.L., Kochanek K., Bastian B., Arias E., Deaths: final data for 2016, Natl. Vital Stat. Rep., 67, 5, pp. 1-76, (2018)
[10]  
Regan L., Rai R., Epidemiology and the medical causes of miscarriage, Baillieres Best Pract, Res. Clin. Obstet. Gynaecol., 14, 5, pp. 839-854, (2000)