A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data

被引:116
作者
Dreisbach, Caitlin [1 ,2 ]
Koleck, Theresa A. [3 ]
Bourne, Philip E. [2 ,4 ]
Bakken, Suzanne [3 ,5 ,6 ]
机构
[1] Univ Virginia, Sch Nursing, Charlottesville, VA 22903 USA
[2] Univ Virginia, Data Sci Inst, Charlottesville, VA USA
[3] Columbia Univ, Sch Nursing, New York, NY USA
[4] Univ Virginia, Dept Biomed Engn, Charlottesville, VA USA
[5] Columbia Univ, Dept Biomed Informat, New York, NY USA
[6] Columbia Univ, Data Sci Inst, New York, NY USA
关键词
Natural language processing; Signs and symptoms; Electronic patient-authored text; Review; ADVERSE DRUG-REACTIONS; SOCIAL MEDIA; PHARMACOVIGILANCE; SCIENCE; CANCER; SURVEILLANCE; INFORMATION; MANAGEMENT; BENEFITS; DISEASES;
D O I
10.1016/j.ijmedinf.2019.02.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: In this systematic review, we aim to synthesize the literature on the use of natural language processing (NLP) and text mining as they apply to symptom extraction and processing in electronic patient-authored text (ePAT). Materials and methods: A comprehensive literature search of 1964 articles from PubMed and EMBASE was narrowed to 21 eligible articles. Data related to purpose, text source, number of users and/or posts, evaluation metrics, and quality indicators were recorded. Results: Pain (n= 18) and fatigue and sleep disturbance (n= 18) were the most frequently evaluated symptom clinical content categories. Studies accessed ePAT from sources such as Twitter and online community forums or patient portals focused on diseases, including diabetes, cancer, and depression. Fifteen studies used NLP as a primary methodology. Studies reported evaluation metrics including the precision, recall, and F-measure for symptom-specific research questions. Discussion: NLP and text mining have been used to extract and analyze patient-authored symptom data in a wide variety of online communities. Though there are computational challenges with accessing ePAT, the depth of information provided directly from patients offers new horizons for precision medicine, characterization of sub-clinical symptoms, and the creation of personal health libraries as outlined by the National Library of Medicine. Conclusion: Future research should consider the needs of patients expressed through ePAT and its relevance to symptom science. Understanding the role that ePAT plays in health communication and real-time assessment of symptoms, through the use of NLP and text mining, is critical to a patient-centered health system.
引用
收藏
页码:37 / 46
页数:10
相关论文
共 51 条
  • [1] Alvaro Nestor, 2017, JMIR Public Health Surveill, V3, pe24, DOI 10.2196/publichealth.6396
  • [2] [Anonymous], 2017, PAR17159 NIH
  • [3] Towards linking patients and clinical information: detecting UMLS concepts in e-mail
    Brennan, PF
    Aronson, AR
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2003, 36 (4-5) : 334 - 341
  • [4] Natural language processing in mental health applications using non-clinical texts
    Calvo, Rafael A.
    Milne, David N.
    Hussain, M. Sazzad
    Christensen, Helen
    [J]. NATURAL LANGUAGE ENGINEERING, 2017, 23 (05) : 649 - 685
  • [5] Automatable algorithms to identify nonmedical opioid use using electronic data: a systematic review
    Canan, Chelsea
    Polinski, Jennifer M.
    Alexander, G. Caleb
    Kowal, Mary K.
    Brennan, Troyen A.
    Shrank, William H.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (06) : 1204 - 1210
  • [6] National Institutes of Health Symptom Science Model sheds light on patient symptoms
    Cashion, Ann K.
    Gill, Jessica
    Hawes, Rebecca
    Henderson, Wendy A.
    Saligan, Leorey
    [J]. NURSING OUTLOOK, 2016, 64 (05) : 499 - 506
  • [7] Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts
    Cocos, Anne
    Fiks, Alexander G.
    Masino, Aaron J.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (04) : 813 - 821
  • [8] A comparison of rule-based and machine learning approaches for classifying patient portal messages
    Cronin, Robert M.
    Fabbri, Daniel
    Denny, Joshua C.
    Rosenbloom, S. Trent
    Jackson, Gretchen Purcell
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2017, 105 : 110 - 120
  • [9] Social media for arthritis-related comparative effectiveness and safety research and the impact of direct-to-consumer advertising
    Curtis, Jeffrey R.
    Chen, Lang
    Higginbotham, Phillip
    Nowell, W. Benjamin
    Gal-Levy, Ronit
    Willig, James
    Safford, Monika
    Coe, Joseph
    O'Hara, Kaitlin
    Sa'adon, Roee
    [J]. ARTHRITIS RESEARCH & THERAPY, 2017, 19
  • [10] Advancing the science of symptom management
    Dodd, M
    Janson, S
    Facione, N
    Faucett, J
    Froelicher, ES
    Humphreys, J
    Lee, K
    Miaskowski, C
    Puntillo, K
    Rankin, S
    Taylor, D
    [J]. JOURNAL OF ADVANCED NURSING, 2001, 33 (05) : 668 - 676