Understanding psychiatric illness through natural language processing (UNDERPIN): Rationale, design, and methodology

被引:3
作者
Kishimoto, Taishiro [1 ,2 ]
Nakamura, Hironobu [3 ]
Kano, Yoshinobu [4 ]
Eguchi, Yoko [1 ]
Kitazawa, Momoko [1 ]
Liang, Kuo-ching [1 ]
Kudo, Koki [1 ,5 ]
Sento, Ayako [1 ]
Takamiya, Akihiro [1 ]
Horigome, Toshiro [1 ]
Yamasaki, Toshihiko [6 ]
Sunami, Yuki [7 ]
Kikuchi, Toshiaki [1 ]
Nakajima, Kazuki [1 ]
Tomita, Masayuki [8 ]
Bun, Shogyoku [1 ,9 ]
Momota, Yuki [1 ]
Sawada, Kyosuke [1 ]
Murakami, Junichi [10 ]
Takahashi, Hidehiko [3 ]
Mimura, Masaru [1 ]
机构
[1] Keio Univ, Dept Neuropsychiat, Sch Med, Tokyo, Japan
[2] Keio Univ, Hills Joint Res Lab Future Prevent Med & Wellness, Sch Med, Tokyo, Japan
[3] Tokyo Med & Dent Univ, Grad Sch Med & Dent Sci, Dept Psychiat & Behav Sci, Tokyo, Japan
[4] Shizuoka Univ, Fac Informat, Shizuoka, Japan
[5] St Marianna Univ, Dept Neuropsychiat, Sch Med Hosp, Kawasaki, Japan
[6] Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Informat & Commun Engn, Comp Vis & Media Lab,Yamasaki Lab, Tokyo, Japan
[7] Keio Univ, Sch Med, Tokyo, Japan
[8] Oizumi Hosp, Dept Psychiat, Tokyo, Japan
[9] Koutokukai Sato Hosp, Dept Psychiat, Yamagata, Japan
[10] Biwako Hosp, Dept Psychiat, Otsu, Japan
来源
FRONTIERS IN PSYCHIATRY | 2022年 / 13卷
基金
日本科学技术振兴机构;
关键词
language; psychiatric disorders; biomarker; machine learning; natural language processing (computer science); neurocognitive disorders; QUALITY-OF-LIFE; BIPOLAR DISORDER; PSYCHOSIS; SCHIZOPHRENIA; BIOMARKERS; IDENTIFICATION; DIAGNOSIS; NETWORK; PEOPLE;
D O I
10.3389/fpsyt.2022.954703
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Introduction: Psychiatric disorders are diagnosed through observations of psychiatrists according to diagnostic criteria such as the DSM-5. Such observations, however, are mainly based on each psychiatrist's level of experience and often lack objectivity, potentially leading to disagreements among psychiatrists. In contrast, specific linguistic features can be observed in some psychiatric disorders, such as a loosening of associations in schizophrenia. Some studies explored biomarkers, but biomarkers have yet to be used in clinical practice. Aim: The purposes of this study are to create a large dataset of Japanese speech data labeled with detailed information on psychiatric disorders and neurocognitive disorders to quantify the linguistic features of those disorders using natural language processing and, finally, to develop objective and easy-to-use biomarkers for diagnosing and assessing the severity of them. Methods: This study will have a multi-center prospective design. The DSM-5 or ICD-11 criteria for major depressive disorder, bipolar disorder, schizophrenia, and anxiety disorder and for major and minor neurocognitive disorders will be regarded as the inclusion criteria for the psychiatric disorder samples. For the healthy subjects, the absence of a history of psychiatric disorders will be confirmed using the Mini-International Neuropsychiatric Interview (M.I.N.I.). The absence of current cognitive decline will be confirmed using the Mini-Mental State Examination (MMSE). A psychiatrist or psychologist will conduct 30-to-60-min interviews with each participant; these interviews will include free conversation, picture-description task, and story-telling task, all of which will be recorded using a microphone headset. In addition, the severity of disorders will be assessed using clinical rating scales. Data will be collected from each participant at least twice during the study period and up to a maximum of five times at an interval of at least one month. Discussion: This study is unique in its large sample size and the novelty of its method, and has potential for applications in many fields. We have some challenges regarding inter-rater reliability and the linguistic peculiarities of Japanese. As of September 2022, we have collected a total of > 1000 records from > 400 participants. To the best of our knowledge, this data sample is one of the largest in this field.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Automatic generation of conclusions from neuroradiology MRI reports through natural language processing
    Pilar López-Úbeda
    Teodoro Martín-Noguerol
    Jorge Escartín
    Antonio Luna
    Neuroradiology, 2024, 66 : 477 - 485
  • [22] Identifying individual expectations in service recovery through natural language processing and machine learning
    Liu, Yijiang
    Wan, Yinghong
    Su, Xiao
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 131 : 288 - 298
  • [23] Natural Language Processing of Clinical Notes in Biome Identifies Relationship Between Psychiatric Phenotype and Polygenic Risk Scores
    Charney, Alexendar
    BIOLOGICAL PSYCHIATRY, 2020, 87 (09) : S69 - S69
  • [24] A Natural Language Processing Approach to Understanding Context in the Extraction and GeoCoding of Historical Floods, Storms, and Adaptation Measures
    Lai, Kelvin
    Porter, Jeremy R.
    Amodeo, Mike
    Miller, David
    Marston, Michael
    Armal, Saman
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (01)
  • [25] Natural Language Processing to Assess Documentation of Features of Critical Illness in Discharge Documents of Acute Respiratory Distress Syndrome Survivors
    Weissman, Gary E.
    Harhay, Michael O.
    Lugo, Ricardo M.
    Fuchs, Barry D.
    Halpern, Scott D.
    Mikkelsen, Mark E.
    ANNALS OF THE AMERICAN THORACIC SOCIETY, 2016, 13 (09) : 1538 - 1545
  • [26] Tourism Management Through Natural Language Processing and Sentiment Analysis. A Case Study of the Main Natural Areas of Extremadura, Spain
    Sanchez-Rivero, Marcelino
    Murillo-Gonzalez, Luis
    Rodriguez-Rangel, Maria Cristina
    TOURISM, 2025, 73 (01): : 169 - 185
  • [27] Maritime piracy and armed robbery analysis in the Straits of Malacca and Singapore through the utilization of natural language processing
    Fahreza, Muhammad I.
    Hirata, Enna
    MARITIME POLICY & MANAGEMENT, 2024,
  • [28] Automatic SDG budget tagging: Building public financial management capacity through natural language processing
    Guariso, Daniele
    Guerrero, Omar A.
    Castaneda, Gonzalo
    DATA & POLICY, 2023, 5
  • [29] Teaching Natural Language Processing through Big Data Text Summarization with Problem-Based Learning
    Li L.
    Geissinger J.
    Ingram W.A.
    Fox E.A.
    Data and Information Management, 2020, 4 (01) : 18 - 43
  • [30] Methodological Issues in Predicting Pediatric Epilepsy Surgery Candidates Through Natural Language Processing and Machine Learning
    Cohen, Kevin Bretonnel
    Glass, Benjamin
    Greiner, Hansel M.
    Holland-Bouley, Katherine
    Standridge, Shannon
    Arya, Ravindra
    Faist, Robert
    Morita, Diego
    Mangano, Francesco
    Connolly, Brian
    Glauser, Tracy
    Pestian, John
    BIOMEDICAL INFORMATICS INSIGHTS, 2016, 8 : 11 - 18