Trustworthy Data and AI Environments for Clinical Prediction: Application to Crisis-Risk in People With Depression

被引:8
作者
Msosa, Yamiko Joseph [1 ]
Grauslys, Arturas [2 ]
Zhou, Yifan [2 ]
Wang, Tao [1 ]
Buchan, Iain [3 ]
Langan, Paul [4 ]
Foster, Steven [5 ]
Walker, Michael [4 ]
Pearson, Michael [3 ]
Folarin, Amos [1 ]
Roberts, Angus [1 ]
Maskell, Simon [2 ]
Dobson, Richard [1 ]
Kullu, Cecil [5 ]
Kehoe, Dennis [4 ]
机构
[1] Kings Coll London, Inst Psychiat Psychol & Neurosci, Dept Biostat & Hlth Informat, London WC2R 2LS, England
[2] Univ Liverpool, Dept Elect Engn & Elect, Liverpool L69 3BX, Merseyside, England
[3] Univ Liverpool, Inst Populat Hlth, Liverpool L69 3BX, Merseyside, England
[4] AIMES, Liverpool L7 9NJ, Merseyside, England
[5] Mersey Care NHS Fdn Trust, Prescot L34 1PJ, England
关键词
Artificial intelligence; depression; digital health; mental health crisis; natural language processing; HEALTH; CARE; OUTCOMES;
D O I
10.1109/JBHI.2023.3312011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Depression is a common mental health condition that often occurs in association with other chronic illnesses, and varies considerably in severity. Electronic Health Records (EHRs) contain rich information about a patient's medical history and can be used to train, test and maintain predictive models to support and improve patient care. This work evaluated the feasibility of implementing an environment for predicting mental health crisis among people living with depression based on both structured and unstructured EHRs. A large EHR from a mental health provider, Mersey Care, was pseudonymised and ingested into the Natural Language Processing (NLP) platform CogStack, allowing text content in binary clinical notes to be extracted. All unstructured clinical notes and summaries were semantically annotated by MedCAT and BioYODIE NLP services. Cases of crisis in patients with depression were then identified. Random forest models, gradient boosting trees, and Long Short-Term Memory (LSTM) networks, with varying feature arrangement, were trained to predict the occurrence of crisis. The results showed that all the prediction models can use a combination of structured and unstructured EHR information to predict crisis in patients with depression with good and useful accuracy. The LSTM network that was trained on a modified dataset with only 1000 most-important features from the random forest model with temporality showed the best performance with a mean AUC of 0.901 and a standard deviation of 0.006 using a training dataset and a mean AUC of 0.810 and 0.01 using a hold-out test dataset. Comparing the results from the technical evaluation with the views of psychiatrists shows that there are now opportunities to refine and integrate such prediction models into pragmatic point-of-care clinical decision support tools for supporting mental healthcare delivery.
引用
收藏
页码:5588 / 5598
页数:11
相关论文
共 55 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Alderwick H, 2019, BMJ-BRIT MED J, V364, DOI [10.38192/12.1.4, 10.1136/bmj.l84]
[3]   Big Data for Health [J].
Andreu-Perez, Javier ;
Poon, Carmen C. Y. ;
Merrifield, Robert D. ;
Wong, Stephen T. C. ;
Yang, Guang-Zhong .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2015, 19 (04) :1193-1208
[4]   An overview of MetaMap: historical perspective and recent advances [J].
Aronson, Alan R. ;
Lang, Francois-Michel .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (03) :229-236
[5]   Why does the NHS struggle to adopt eHealth innovations? A review of macro, meso and micro factors [J].
Asthana, Sheena ;
Jones, Ray ;
Sheaff, Rod .
BMC HEALTH SERVICES RESEARCH, 2019, 19 (01)
[6]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[7]   Characteristics, comorbidities, and correlates of atypical depression: evidence from the UK Biobank Mental Health Survey [J].
Brailean, Anamaria ;
Curtis, Jessica ;
Davis, Katrina ;
Dregan, Alexandru ;
Hotopf, Matthew .
PSYCHOLOGICAL MEDICINE, 2020, 50 (07) :1129-1138
[8]  
Breiman L, 2001, MACH LEARN, V45, P5, DOI [10.1186/s12859-018-2419-4, 10.3322/caac.21834]
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]  
de Lusignan Simon, 2014, Inform Prim Care, V21, P61, DOI 10.14236/jhi.v21i2.68