Natural language processing methods are sensitive to sub-clinical linguistic differences in schizophrenia spectrum disorders

被引:77
作者
Tang, Sunny X. [1 ,2 ,3 ]
Kriz, Reno [4 ]
Cho, Sunghye [3 ]
Park, Suh Jung [2 ]
Harowitz, Jenna [2 ]
Gur, Raquel E. [2 ]
Bhati, Mahendra T. [2 ,5 ]
Wolf, Daniel H. [2 ]
Sedoc, Joao [6 ]
Liberman, Mark Y. [3 ,7 ]
机构
[1] Zucker Hillside Hosp, Dept Psychiat, 75-59 263rd St, Glen Oaks, NY 11004 USA
[2] Univ Penn, Dept Psychiat, 3400 Spruce St,Gates Bldg, Philadelphia, PA 19104 USA
[3] Linguist Data Consortium, 3600 Market St,Suite 810, Philadelphia, PA 19104 USA
[4] Univ Penn, Dept Comp Sci, 3330 Walnut St,Levine Hall, Philadelphia, PA 19104 USA
[5] Stanford Univ, Dept Psychiat & Neurosurg, 401 Quarry Rd, Stanford, CA 94305 USA
[6] NYU, Dept Technol Operat & Stat, Kaufman Management Ctr, 44 West Fourth St, New York, NY USA
[7] Univ Penn, Dept Linguist, 3401-C Walnut St,Suite 300,C Wing, Philadelphia, PA 19104 USA
来源
NPJ SCHIZOPHRENIA | 2021年 / 7卷 / 01期
关键词
SPEECH; COMMUNICATION; THOUGHT; INDIVIDUALS; NARRATIVES; PSYCHOSIS;
D O I
10.1038/s41537-021-00154-3
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Computerized natural language processing (NLP) allows for objective and sensitive detection of speech disturbance, a hallmark of schizophrenia spectrum disorders (SSD). We explored several methods for characterizing speech changes in SSD (n = 20) compared to healthy control (HC) participants (n = 11) and approached linguistic phenotyping on three levels: individual words, parts-of-speech (POS), and sentence-level coherence. NLP features were compared with a clinical gold standard, the Scale for the Assessment of Thought, Language and Communication (TLC). We utilized Bidirectional Encoder Representations from Transformers (BERT), a state-of-the-art embedding algorithm incorporating bidirectional context. Through the POS approach, we found that SSD used more pronouns but fewer adverbs, adjectives, and determiners (e.g., "the," "a,"). Analysis of individual word usage was notable for more frequent use of first-person singular pronouns among individuals with SSD and first-person plural pronouns among HC. There was a striking increase in incomplete words among SSD. Sentence-level analysis using BERT reflected increased tangentiality among SSD with greater sentence embedding distances. The SSD sample had low speech disturbance on average and there was no difference in group means for TLC scores. However, NLP measures of language disturbance appear to be sensitive to these subclinical differences and showed greater ability to discriminate between HC and SSD than a model based on clinical ratings alone. These intriguing exploratory results from a small sample prompt further inquiry into NLP methods for characterizing language disturbance in SSD and suggest that NLP measures may yield clinically relevant and informative biomarkers.
引用
收藏
页数:8
相关论文
共 51 条
[1]   THOUGHT, LANGUAGE, AND COMMUNICATION DISORDERS .1. CLINICAL-ASSESSMENT, DEFINITION OF TERMS, AND EVALUATION OF THEIR RELIABILITY [J].
ANDREASEN, NC .
ARCHIVES OF GENERAL PSYCHIATRY, 1979, 36 (12) :1315-1321
[2]   THOUGHT, LANGUAGE, AND COMMUNICATION IN SCHIZOPHRENIA - DIAGNOSIS AND PROGNOSIS [J].
ANDREASEN, NC ;
GROVE, WM .
SCHIZOPHRENIA BULLETIN, 1986, 12 (03) :348-359
[3]   SCALE FOR THE ASSESSMENT OF THOUGHT, LANGUAGE, AND COMMUNICATION (TLC) [J].
ANDREASEN, NC .
SCHIZOPHRENIA BULLETIN, 1986, 12 (03) :473-482
[4]  
[Anonymous], 1994, AM PSYCHIATR ASSOC
[5]   Transcriber: Development and use of a tool for assisting speech corpora production [J].
Barras, C ;
Geoffrois, E ;
Wu, ZB ;
Liberman, M .
SPEECH COMMUNICATION, 2001, 33 (1-2) :5-22
[6]   Automated analysis of free speech predicts psychosis onset in high-risk youths [J].
Bedi G. ;
Carrillo F. ;
Cecchi G.A. ;
Slezak D.F. ;
Sigman M. ;
Mota N.B. ;
Ribeiro S. ;
Javitt D.C. ;
Copelli M. ;
Corcoran C.M. .
npj Schizophrenia, 1 (1)
[7]   Schizophrenia and second language acquisition [J].
Bersudsky, Y ;
Fine, J ;
Gorjaltsan, I ;
Chen, O ;
Walters, J .
PROGRESS IN NEURO-PSYCHOPHARMACOLOGY & BIOLOGICAL PSYCHIATRY, 2005, 29 (04) :535-542
[8]   Detecting relapse in youth with psychotic disorders utilizing patient-generated and patient-contributed digital data from Facebook [J].
Birnbaum, M. L. ;
Ernala, S. K. ;
Rizvi, A. F. ;
Arenare, E. ;
Van Meter, A. R. ;
De Choudhury, M. ;
Kane, J. M. .
NPJ SCHIZOPHRENIA, 2019, 5 (1)
[9]  
Bleuler E., 1950, Dementia praecox or the group of schizophrenias
[10]   Differential lexical correlates of social cognition and metacognition in schizophrenia; a study of spontaneously-generated life narratives [J].
Buck, Benjamin ;
Minor, Kyle S. ;
Lysaker, Paul H. .
COMPREHENSIVE PSYCHIATRY, 2015, 58 :138-145