Automated classification of primary progressive aphasia subtypes from narrative speech transcripts

被引:161
作者
Fraser, Kathleen C. [1 ]
Meltzer, Jed A. [2 ]
Graham, Naida L. [3 ,4 ]
Leonard, Carol [5 ]
Hirst, Graeme [1 ]
Black, Sandra E. [6 ,7 ]
Rochon, Elizabeth [3 ,4 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3A6, Canada
[2] Baycrest Ctr Geriatr Care, Rotman Res Inst, Toronto, ON, Canada
[3] Univ Toronto, Dept Speech Language Pathol, Toronto, ON M5S 3A6, Canada
[4] Toronto Rehabil Inst, Toronto, ON, Canada
[5] Univ Ottawa, Sch Rehabil Sci, Ottawa, ON, Canada
[6] Sunnybrook Hlth Sci Ctr, LC Campbell Cognit Neurol Res Unit, Toronto, ON M4N 3M5, Canada
[7] Univ Toronto, Dept Med Neurol, Toronto, ON M5S 3A6, Canada
基金
加拿大自然科学与工程研究理事会; 加拿大健康研究院;
关键词
Semantic dementia; Progressive nonfluent aphasia; Narrative speech; Natural language processing; Machine learning; MILD COGNITIVE IMPAIRMENT; SEMANTIC DEMENTIA; LANGUAGE; REGRESSION; DETERIORATION; IMAGEABILITY; ACQUISITION; FAMILIARITY; COMPLEXITY; FREQUENCY;
D O I
10.1016/j.cortex.2012.12.006
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
In the early stages of neurodegenerative disorders, individuals may exhibit a decline in language abilities that is difficult to quantify with standardized tests. Careful analysis of connected speech can provide valuable information about a patient's language capacities. To date, this type of analysis has been limited by its time-consuming nature. In this study, we present a method for evaluating and classifying connected speech in primary progressive aphasia using computational techniques. Syntactic and semantic features were automatically extracted from transcriptions of narrative speech for three groups: semantic dementia (SD), progressive nonfluent aphasia (PNFA), and healthy controls. Features that varied significantly between the groups were used to train machine learning classifiers, which were then tested on held-out data. We achieved accuracies well above baseline on the three binary classification tasks. An analysis of the influential features showed that in contrast with controls, both patient groups tended to use words which were higher in frequency (especially nouns for SD, and verbs for PNFA). The SD patients also tended to use words (especially nouns) that were higher in familiarity, and they produced fewer nouns, but more demonstratives and adverbs, than controls. The speech of the PNFA group tended to be slower and incorporate shorter words than controls. The patient groups were distinguished from each other by the SD patients' relatively increased use of words which are high in frequency and/or familiarity. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:43 / 60
页数:18
相关论文
共 67 条
[1]  
[Anonymous], P 41 M ASS COMP LING
[2]  
[Anonymous], CLASSIC CASES NEUROP
[3]  
[Anonymous], PART SPEECH TAGGING
[4]  
[Anonymous], 2008, Introduction to information retrieval
[5]  
[Anonymous], P 2003 C N AM CHAPT
[6]  
[Anonymous], PARTIAL LEAST SQUARE
[7]   Trying to tell a tale - Discourse impairments in progressive aphasia and frontotemporal dementia [J].
Ash, S ;
Moore, P ;
Antani, S ;
McCawley, G ;
Work, M ;
Grossman, M .
NEUROLOGY, 2006, 66 (09) :1405-1413
[8]   Speech errors in progressive non-fluent aphasia [J].
Ash, Sharon ;
McMillan, Corey ;
Gunawardena, Delani ;
Avants, Brian ;
Morgan, Brianna ;
Khan, Alea ;
Moore, Peachie ;
Gee, James ;
Grossman, Murray .
BRAIN AND LANGUAGE, 2010, 113 (01) :13-20
[9]   Non-fluent speech in frontotemporal lobar degeneration [J].
Ash, Sharon ;
Moore, Peachie ;
Vesely, Luisa ;
Gunawardena, Delani ;
McMillan, Corey ;
Anderson, Chivon ;
Avants, Brian ;
Grossman, Murray .
JOURNAL OF NEUROLINGUISTICS, 2009, 22 (04) :370-383
[10]   Non-semantic aspects of language in semantic dementia: As normal as they're said to be? [J].
Benedet, M ;
Patterson, K ;
Gomez-Pastor, I ;
de la Rocha, MLG .
NEUROCASE, 2006, 12 (01) :15-26