Generation and evaluation of artificial mental health records for Natural Language Processing

被引：0

作者：

Julia Ive

Natalia Viani

Joyce Kam

Lucia Yin

Somain Verma

Stephen Puntis

Rudolf N. Cardinal

Angus Roberts

Robert Stewart

Sumithra Velupillai

机构：

[1] Imperial College London,Department of Computing

[2] King’s College London,IoPPN

[3] University of Oxford,Department of Psychiatry

[4] Warneford Hospital,Department of Psychiatry

[5] University of Cambridge,Cambridge Biomedical Campus

[6] Cambridgeshire and Peterborough NHS Foundation Trust,undefined

[7] South London and Maudsley NHS Foundation Trust,undefined

来源：

npj Digital Medicine | / 3卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A serious obstacle to the development of Natural Language Processing (NLP) methods in the clinical domain is the accessibility of textual data. The mental health domain is particularly challenging, partly because clinical documentation relies heavily on free text that is difficult to de-identify completely. This problem could be tackled by using artificial medical data. In this work, we present an approach to generate artificial clinical documents. We apply this approach to discharge summaries from a large mental healthcare provider and discharge summaries from an intensive care unit. We perform an extensive intrinsic evaluation where we (1) apply several measures of text preservation; (2) measure how much the model memorises training data; and (3) estimate clinical validity of the generated text based on a human evaluation task. Furthermore, we perform an extrinsic evaluation by studying the impact of using artificial text in a downstream NLP text classification task. We found that using this artificial data as training data can lead to classification results that are comparable to the original results. Additionally, using only a small amount of information from the original data to condition the generation of the artificial data is successful, which holds promise for reducing the risk of these artificial data retaining rare information from the original data. This is an important finding for our long-term goal of being able to generate artificial clinical data that can be released to the wider research community and accelerate advances in developing computational methods that use healthcare data.

引用

共 50 条

[21] A series of natural language processing for predicting tumor response evaluation and survival curve from electronic health records
Takeuchi, Toshiki
Horinouchi, Hidehito
Takasawa, Ken
Mukai, Masami
Masuda, Ken
Shinno, Yuki
Okuma, Yusuke
Yoshida, Tatsuya
Goto, Yasushi
Yamamoto, Noboru
Ohe, Yuichiro
Miyake, Mototaka
Watanabe, Hirokazu
Kusumoto, Masahiko
Aoki, Takashi
Nishimura, Kunihiro
Hamamoto, Ryuji
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2025, 25 (01)
[22] Neural Natural Language Processing for unstructured data in electronic health records: A review
Li, Irene
Pan, Jessica
Goldwasser, Jeremy
Verma, Neha
Wong, Wai Pan
Nuzumlali, Muhammed Yavuz
Rosand, Benjamin
Li, Yixin
Zhang, Matthew
Chang, David
Taylor, R. Andrew
Krumholz, Harlan M.
Radev, Dragomir
COMPUTER SCIENCE REVIEW, 2022, 46
[23] Natural Language Processing Identifies Goals of Care Documentation in Electronic Health Records
Joehl, Hillarie E.
Friend, Patricia
JOURNAL OF PAIN AND SYMPTOM MANAGEMENT, 2024, 67 (05) : E720 - E721
[24] Prediction and evaluation of combination pharmacotherapy using natural language processing, machine learning and patient electronic health records
Ding, Pingjian
Pan, Yiheng
Wang, Quanqiu
Xu, Rong
JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 133
[25] Machine Learning and Natural Language Processing in Mental Health: Systematic Review
Le Glaz, Aziliz
Haralambous, Yannis
Kim-Dufor, Deok-Hee
Lenca, Philippe
Billot, Romain
Ryan, Taylor C.
Marsh, Jonathan
DeVylder, Jordan
Walter, Michel
Berrouiguet, Sofian
Lemey, Christophe
JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (05)
[26] Artificial life for natural language processing
Bel-Enguix, G
Jiménez-López, MD
ADVANCES IN ARTIFICAL LIFE, PROCEEDINGS, 2005, 3630 : 765 - 774
[27] Investigating online activity in UK adolescent mental health patients: a feasibility study using a natural language processing approach for electronic health records
Sedgwick, Rosemary
Bittar, Andre
Kalsi, Herkiran
Barack, Tamara
Downs, Johnny
Dutta, Rina
BMJ OPEN, 2023, 13 (05):
[28] Natural language processing for electronic health records in anaesthesiology: an introduction to clinicians with recommendations and pitfalls
Bernstorff, Martin
Vistisen, Simon Tilma
Enevoldsen, Kenneth C.
JOURNAL OF CLINICAL MONITORING AND COMPUTING, 2024, 38 (02) : 241 - 245
[29] Using Natural Language Processing to Identify Different Lens Pathology in Electronic Health Records
Stein, Joshua d.
Zhou, Yunshu
Andrews, Chris a.
Kim, Judy e.
Addis, Victoria
Bixler, Jill
Grove, Nathan
Mcmillan, Brian
Munir, Saleha z.
Pershing, Suzann
Schultz, Jeffrey s.
Stagg, Brian c.
Wang, Sophia y.
Woreta, Fasika
AMERICAN JOURNAL OF OPHTHALMOLOGY, 2024, 262 : 153 - 160
[30] Natural Language Processing of Clinical Notes in Electronic Health Records to Improve Capture of Hypoglycemia
Nunes, Anthony P.
Yu, Shengsheng
Kurtyka, Karen
Senerchia, Cynthia
Hill, Jefffrey
Brodovicz, Kimberly G.
Radican, Larry
Engel, Samuel S.
Calvo, Sean R.
Dore, David D.
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2014, 23 : 494 - 494

← 1 2 3 4 5 →