Understanding psychiatric illness through natural language processing (UNDERPIN): Rationale, design, and methodology

被引:3
作者
Kishimoto, Taishiro [1 ,2 ]
Nakamura, Hironobu [3 ]
Kano, Yoshinobu [4 ]
Eguchi, Yoko [1 ]
Kitazawa, Momoko [1 ]
Liang, Kuo-ching [1 ]
Kudo, Koki [1 ,5 ]
Sento, Ayako [1 ]
Takamiya, Akihiro [1 ]
Horigome, Toshiro [1 ]
Yamasaki, Toshihiko [6 ]
Sunami, Yuki [7 ]
Kikuchi, Toshiaki [1 ]
Nakajima, Kazuki [1 ]
Tomita, Masayuki [8 ]
Bun, Shogyoku [1 ,9 ]
Momota, Yuki [1 ]
Sawada, Kyosuke [1 ]
Murakami, Junichi [10 ]
Takahashi, Hidehiko [3 ]
Mimura, Masaru [1 ]
机构
[1] Keio Univ, Dept Neuropsychiat, Sch Med, Tokyo, Japan
[2] Keio Univ, Hills Joint Res Lab Future Prevent Med & Wellness, Sch Med, Tokyo, Japan
[3] Tokyo Med & Dent Univ, Grad Sch Med & Dent Sci, Dept Psychiat & Behav Sci, Tokyo, Japan
[4] Shizuoka Univ, Fac Informat, Shizuoka, Japan
[5] St Marianna Univ, Dept Neuropsychiat, Sch Med Hosp, Kawasaki, Japan
[6] Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Informat & Commun Engn, Comp Vis & Media Lab,Yamasaki Lab, Tokyo, Japan
[7] Keio Univ, Sch Med, Tokyo, Japan
[8] Oizumi Hosp, Dept Psychiat, Tokyo, Japan
[9] Koutokukai Sato Hosp, Dept Psychiat, Yamagata, Japan
[10] Biwako Hosp, Dept Psychiat, Otsu, Japan
来源
FRONTIERS IN PSYCHIATRY | 2022年 / 13卷
基金
日本科学技术振兴机构;
关键词
language; psychiatric disorders; biomarker; machine learning; natural language processing (computer science); neurocognitive disorders; QUALITY-OF-LIFE; BIPOLAR DISORDER; PSYCHOSIS; SCHIZOPHRENIA; BIOMARKERS; IDENTIFICATION; DIAGNOSIS; NETWORK; PEOPLE;
D O I
10.3389/fpsyt.2022.954703
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Introduction: Psychiatric disorders are diagnosed through observations of psychiatrists according to diagnostic criteria such as the DSM-5. Such observations, however, are mainly based on each psychiatrist's level of experience and often lack objectivity, potentially leading to disagreements among psychiatrists. In contrast, specific linguistic features can be observed in some psychiatric disorders, such as a loosening of associations in schizophrenia. Some studies explored biomarkers, but biomarkers have yet to be used in clinical practice. Aim: The purposes of this study are to create a large dataset of Japanese speech data labeled with detailed information on psychiatric disorders and neurocognitive disorders to quantify the linguistic features of those disorders using natural language processing and, finally, to develop objective and easy-to-use biomarkers for diagnosing and assessing the severity of them. Methods: This study will have a multi-center prospective design. The DSM-5 or ICD-11 criteria for major depressive disorder, bipolar disorder, schizophrenia, and anxiety disorder and for major and minor neurocognitive disorders will be regarded as the inclusion criteria for the psychiatric disorder samples. For the healthy subjects, the absence of a history of psychiatric disorders will be confirmed using the Mini-International Neuropsychiatric Interview (M.I.N.I.). The absence of current cognitive decline will be confirmed using the Mini-Mental State Examination (MMSE). A psychiatrist or psychologist will conduct 30-to-60-min interviews with each participant; these interviews will include free conversation, picture-description task, and story-telling task, all of which will be recorded using a microphone headset. In addition, the severity of disorders will be assessed using clinical rating scales. Data will be collected from each participant at least twice during the study period and up to a maximum of five times at an interval of at least one month. Discussion: This study is unique in its large sample size and the novelty of its method, and has potential for applications in many fields. We have some challenges regarding inter-rater reliability and the linguistic peculiarities of Japanese. As of September 2022, we have collected a total of > 1000 records from > 400 participants. To the best of our knowledge, this data sample is one of the largest in this field.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Investigating the impact of structured reporting on the linguistic standardization of radiology reports through natural language processing over a 10-year period
    Vosshenrich, Jan
    Nesic, Ivan
    Boll, Daniel T. T.
    Heye, Tobias
    EUROPEAN RADIOLOGY, 2023, 33 (11) : 7496 - 7506
  • [42] Virtual learning communities (VLCs) rethinking: influence on behavior modification-bullying detection through machine learning and natural language processing
    Nikiforos, Stefanos
    Tzanavarisl, Spyros
    Kermanidis, Katia-Lida
    JOURNAL OF COMPUTERS IN EDUCATION, 2020, 7 (04) : 531 - 551
  • [43] Investigating the impact of structured reporting on the linguistic standardization of radiology reports through natural language processing over a 10-year period
    Jan Vosshenrich
    Ivan Nesic
    Daniel T. Boll
    Tobias Heye
    European Radiology, 2023, 33 : 7496 - 7506
  • [44] Analyzing credit risk model problems through natural language processing-based clustering and machine learning: insights from validation reports
    Lis, Szymon
    Kubkowski, Mariusz
    Borkowska, Olimpia
    Serwa, Dobromil
    Kurpanik, Jaroslaw
    JOURNAL OF RISK MODEL VALIDATION, 2024, 18 (02): : 59 - 86
  • [45] A New Natural Language Processing-Inspired Methodology (Detection, Initial Characterization, and Semantic Characterization) to Investigate Temporal Shifts (Drifts) in Health Care Data: Quantitative Study
    Paiva, Bruno
    Goncalves, Marcos Andre
    Dutra da Rocha, Leonardo Chaves
    Marcolino, Milena Soriano
    Barbosa Lana, Fernanda Cristina
    Souza-Silva, Maira Viana Rego
    Almeida, Jussara M.
    Pereira, Polianna Delfino
    Valiense de Andrade, Claudio Moises
    dos Reis Gomes, Angelica Gomides
    Pires Ferreira, Maria Angelica
    Bartolazzi, Frederico
    Sacioto, Manuela Furtado
    Boscato, Ana Paula
    Guimaraes-Junior, Milton Henriques
    dos Reis, Priscilla Pereira
    Costa, Felicio Roberto
    Jorge, Alzira de Oliveira
    Coelho, Laryssa Reis
    Carneiro, Marcelo
    Souza Sales, Thais Lorenna
    Araujo, Silvia Ferreira
    Silveira, Daniel Vitorio
    Ruschel, Karen Brasil
    Veloso Santos, Fernanda Caldeira
    de Almeida Cenci, Evelin Paola
    Monteiro Menezes, Luanna Silva
    Anschau, Fernando
    Camargos Bicalho, Maria Aparecida
    Fernandes Manenti, Euler Roberto
    Finger, Renan Goulart
    Ponce, Daniela
    de Aguiar, Filipe Carrilho
    Marques, Luiza Margoto
    de Castro, Luis Cesar
    Vietta, Giovanna Grunewald
    de Godoy, Mariana Frizzo
    Vilaca, Mariana do Nascimento
    Morais, Vivian Costa
    JMIR MEDICAL INFORMATICS, 2024, 12
  • [46] Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project
    Jackson, Richard G.
    Patel, Rashmi
    Jayatilleke, Nishamali
    Kolliakou, Anna
    Ball, Michael
    Gorrell, Genevieve
    Roberts, Angus
    Dobson, Richard J.
    Stewart, Robert
    BMJ OPEN, 2017, 7 (01):
  • [47] An exploratory database study of factors influencing the continuation of brexpiprazole treatment (prescription) in patients with schizophrenia using information from psychiatric electronic medical records processed with natural language processing
    Iyo, Masaomi
    Akiyoshi, Hisashi
    Sekine, Daisuke
    Shibasaki, Yoshiyuki
    Mamiya, Noriyuki
    SCHIZOPHRENIA RESEARCH, 2023, 255 : 122 - 131
  • [48] Identifying Patient Populations in Texts Describing Drug Approvals Through Deep Learning-Based Information Extraction: Development of a Natural Language Processing Algorithm
    Gendrin, Aline
    Souliotis, Leonidas
    Loudon-Griffiths, James
    Aggarwal, Ravisha
    Amoako, Daniel
    Desouza, Gregory
    Dimitrievska, Sashka
    Metcalfe, Paul
    Louvet, Emilie
    Sahni, Harpreet
    JMIR FORMATIVE RESEARCH, 2023, 7
  • [49] Motor signs in Alzheimer's disease and vascular dementia: Detection through natural language processing, co-morbid features and relationship to adverse outcomes
    Al-Harrasi, Ahmed M.
    Iqbal, Ehtesham
    Tsamakis, Konstantinos
    Lasek, Judista
    Gadelrab, Romayne
    Soysal, Pinar
    Kohlhoff, Enno
    Tsiptsios, Dimitrios
    Rizos, Emmanouil
    Perera, Gayan
    Aarsland, Dag
    Stewart, Robert
    Mueller, Christoph
    EXPERIMENTAL GERONTOLOGY, 2021, 146
  • [50] Integrating Natural Language Processing and Interpretive Thematic Analyses to Gain Human-Centered Design Insights on HIV Mobile Health: Proof-of-Concept Analysis
    Skeen, Simone J.
    Jones, Stephen Scott
    Cruse, Carolyn Marie
    Horvath, Keith J.
    JMIR HUMAN FACTORS, 2022, 9 (03):