Using clinical Natural Language Processing for health outcomes research: Overview and actionable suggestions for future advances

被引:137
作者
Velupillai, Sumithra [1 ,2 ]
Suominen, Hanna [3 ,4 ]
Liakata, Maria [5 ]
Roberts, Angus [1 ]
Shah, Anoop D. [6 ,7 ]
Morley, Katherine [1 ,8 ]
Osborn, David [9 ,10 ]
Hayes, Joseph [9 ,10 ]
Stewart, Robert [1 ,11 ]
Downs, Johnny [1 ,11 ]
Chapman, Wendy [12 ]
Dutta, Rina [1 ,11 ]
机构
[1] Kings Coll London, Inst Psychiat Psychol & Neurosci, London, England
[2] KTH, Sch Elect Engn & Comp Sci, Stockholm, Sweden
[3] Australian Natl Univ, CSIRO Data61, Univ Canberra, Coll Engn & Comp Sci, Canberra, ACT, Australia
[4] Univ Turku, Turku, Finland
[5] Univ Warwick, Alan Turing Inst, Dept Comp Sci, Coventry, W Midlands, England
[6] UCL, Inst Hlth Informat, London, England
[7] Univ Coll London NHS Fdn Trust, London, England
[8] Univ Melbourne, Melbourne Sch Populat & Global Hlth, Melbourne, Vic, Australia
[9] UCL, Div Psychiat, London, England
[10] Camden & Islington NHS Fdn Trust, London, England
[11] South London & Maudsley NHS Fdn Trust, London, England
[12] Univ Utah, Dept Biomed Informat, Salt Lake City, UT 84112 USA
基金
英国惠康基金; 英国医学研究理事会; 英国工程与自然科学研究理事会; 瑞典研究理事会;
关键词
Natural Language Processing; Information extraction; Text analytics; Evaluation; Clinical informatics; Mental Health Informatics; Epidemiology; Public Health; OF-THE-ART; SPEECH RECOGNITION; INFORMATION; PREDICTION; RECORDS; TEXT; DOCUMENTATION;
D O I
10.1016/j.jbi.2018.10.005
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The importance of incorporating Natural Language Processing (NLP) methods in clinical informatics research has been increasingly recognized over the past years, and has led to transformative advances. Typically, clinical NLP systems are developed and evaluated on word, sentence, or document level annotations that model specific attributes and features, such as document content (e.g., patient status, or report type), document section types (e.g., current medications, past medical history, or discharge summary), named entities and concepts (e.g., diagnoses, symptoms, or treatments) or semantic attributes (e.g., negation, severity, or temporality). From a clinical perspective, on the other hand, research studies are typically modelled and evaluated on a patient-or population-level, such as predicting how a patient group might respond to specific treatments or patient monitoring over time. While some NLP tasks consider predictions at the individual or group user level, these tasks still constitute a minority. Owing to the discrepancy between scientific objectives of each field, and because of differences in methodological evaluation priorities, there is no clear alignment between these evaluation approaches. Here we provide a broad summary and outline of the challenging issues involved in defining appropriate intrinsic and extrinsic evaluation methods for NLP research that is to be used for clinical outcomes research, and vice versa. A particular focus is placed on mental health research, an area still relatively understudied by the clinical NLP research community, but where NLP methods are of notable relevance. Recent advances in clinical NLP method development have been significant, but we propose more emphasis needs to be placed on rigorous evaluation for the field to advance further. To enable this, we provide actionable suggestions, including a minimal protocol that could be used when reporting clinical NLP method development and its evaluation.
引用
收藏
页码:11 / 19
页数:9
相关论文
共 91 条
[1]  
Althoff T., ABS160504462 CORR
[2]   Visualising Conversation Structure across Time: Insights into Effective Doctor-Patient Consultations [J].
Angus, Daniel ;
Watson, Bernadette ;
Smith, Andrew ;
Gallois, Cindy ;
Wiles, Janet .
PLOS ONE, 2012, 7 (06)
[3]  
[Anonymous], 2015, P 2 WORKSH COMP LING, DOI [DOI 10.3115/V1/W15-1204, 10.3115/v1/W15-1204]
[4]  
[Anonymous], 2017, Long Papers, DOI DOI 10.18653/V1/E17-1015
[5]  
[Anonymous], 2014, P NTCIR 11 C
[6]  
[Anonymous], 2016, P 3 WORKSHOP COMPUTA, DOI DOI 10.18653/V1/W16-0312
[7]   Use of electronic health records in identifying drug and alcohol misuse among psychiatric in-patients [J].
Bell, James ;
Kilic, Cise ;
Prabakaran, Reena ;
Wang, Yuan Yuan ;
Wilson, Robin ;
Broadbent, Matthew ;
Kumar, Anil ;
Curtis, Vivienne .
PSYCHIATRIC BULLETIN, 2013, 37 (01) :15-20
[8]   The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) Statement [J].
Benchimol, Eric I. ;
Smeeth, Liam ;
Guttmann, Astrid ;
Harron, Katie ;
Moher, David ;
Petersen, Irene ;
Sorensen, Henrik T. ;
von Elm, Erik ;
Langan, Sinead M. .
PLOS MEDICINE, 2015, 12 (10)
[9]  
Bethard S., 2016, P 10 INT WORKSH SEV, P1052
[10]  
Bethard Steven., 2015, P 9 INT WORKSHOP SEM, P806, DOI DOI 10.18653/V1/S15-2136