Generation and evaluation of artificial mental health records for Natural Language Processing

被引:0
|
作者
Julia Ive
Natalia Viani
Joyce Kam
Lucia Yin
Somain Verma
Stephen Puntis
Rudolf N. Cardinal
Angus Roberts
Robert Stewart
Sumithra Velupillai
机构
[1] Imperial College London,Department of Computing
[2] King’s College London,IoPPN
[3] University of Oxford,Department of Psychiatry
[4] Warneford Hospital,Department of Psychiatry
[5] University of Cambridge,Cambridge Biomedical Campus
[6] Cambridgeshire and Peterborough NHS Foundation Trust,undefined
[7] South London and Maudsley NHS Foundation Trust,undefined
来源
npj Digital Medicine | / 3卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
A serious obstacle to the development of Natural Language Processing (NLP) methods in the clinical domain is the accessibility of textual data. The mental health domain is particularly challenging, partly because clinical documentation relies heavily on free text that is difficult to de-identify completely. This problem could be tackled by using artificial medical data. In this work, we present an approach to generate artificial clinical documents. We apply this approach to discharge summaries from a large mental healthcare provider and discharge summaries from an intensive care unit. We perform an extensive intrinsic evaluation where we (1) apply several measures of text preservation; (2) measure how much the model memorises training data; and (3) estimate clinical validity of the generated text based on a human evaluation task. Furthermore, we perform an extrinsic evaluation by studying the impact of using artificial text in a downstream NLP text classification task. We found that using this artificial data as training data can lead to classification results that are comparable to the original results. Additionally, using only a small amount of information from the original data to condition the generation of the artificial data is successful, which holds promise for reducing the risk of these artificial data retaining rare information from the original data. This is an important finding for our long-term goal of being able to generate artificial clinical data that can be released to the wider research community and accelerate advances in developing computational methods that use healthcare data.
引用
收藏
相关论文
共 50 条
  • [41] Cohort design and natural language processing to reduce bias in electronic health records research
    Shaan Khurshid
    Christopher Reeder
    Lia X. Harrington
    Pulkit Singh
    Gopal Sarma
    Samuel F. Friedman
    Paolo Di Achille
    Nathaniel Diamant
    Jonathan W. Cunningham
    Ashby C. Turner
    Emily S. Lau
    Julian S. Haimovich
    Mostafa A. Al-Alusi
    Xin Wang
    Marcus D. R. Klarqvist
    Jeffrey M. Ashburner
    Christian Diedrich
    Mercedeh Ghadessi
    Johanna Mielke
    Hanna M. Eilken
    Alice McElhinney
    Andrea Derix
    Steven J. Atlas
    Patrick T. Ellinor
    Anthony A. Philippakis
    Christopher D. Anderson
    Jennifer E. Ho
    Puneet Batra
    Steven A. Lubitz
    npj Digital Medicine, 5
  • [42] Maternal mental health monitoring in an online community: a natural language processing approach
    Zhu, Zhen
    BEHAVIOUR & INFORMATION TECHNOLOGY, 2024,
  • [43] Natural language processing for mental health interventions: a systematic review and research framework
    Malgaroli, Matteo
    Hull, Thomas D.
    Zech, James M.
    Althoff, Tim
    TRANSLATIONAL PSYCHIATRY, 2023, 13 (01)
  • [44] Global Research on Pandemics or Epidemics and Mental Health: A Natural Language Processing Study
    Ye, Xin
    Wang, Xinfeng
    Lin, Hugo
    JOURNAL OF EPIDEMIOLOGY AND GLOBAL HEALTH, 2024, 14 (03) : 1268 - 1280
  • [45] Scalable Mental Health Analysis in the Clinical Whitespace via Natural Language Processing
    Coppersmith, Glen
    Hilland, Casey
    Frieder, Ophir
    Leary, Ryan
    2017 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2017, : 393 - 396
  • [46] Natural language processing for mental health interventions: a systematic review and research framework
    Matteo Malgaroli
    Thomas D. Hull
    James M. Zech
    Tim Althoff
    Translational Psychiatry, 13
  • [47] Natural language processing of multi-hospital electronic health records for public health surveillance of suicidality
    Romain Bey
    Ariel Cohen
    Vincent Trebossen
    Basile Dura
    Pierre-Alexis Geoffroy
    Charline Jean
    Benjamin Landman
    Thomas Petit-Jean
    Gilles Chatellier
    Kankoe Sallah
    Xavier Tannier
    Aurelie Bourmaud
    Richard Delorme
    npj Mental Health Research, 3 (1):
  • [48] Natural Language Processing and Electronic Medical Records Reply
    Murff, Harvey J.
    FitzHenry, Fern
    Speroff, Theodore
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 306 (21): : 2325 - 2326
  • [49] Artificial Intelligence Algorithms and Natural Language Processing for the Recognition of Syncope Patients on Emergency Department Medical Records
    Dipaola, Franca
    Gatti, Mauro
    Pacetti, Veronica
    Bottaccioli, Anna Giulia
    Shiffer, Dana
    Minonzio, Maura
    Mene, Roberto
    Levra, Alessandro Giaj
    Solbiati, Monica
    Costantino, Giorgio
    Anastasio, Marco
    Sini, Elena
    Barbic, Franca
    Brunetta, Enrico
    Furlan, Raffaello
    JOURNAL OF CLINICAL MEDICINE, 2019, 8 (10)
  • [50] Natural language processing identification of documented mental health symptoms associated with risk of mental health disorders in patients with cancer
    Friesner, Isabel D.
    Mohindra, Somya
    Boreta, Lauren
    Chen, William Cheng
    Braunstein, Steve E.
    Rabow, Michael W.
    Hong, Julian C.
    JOURNAL OF CLINICAL ONCOLOGY, 2023, 41 (16)