Critical analysis of the impact of glottal features in the classification of clinical depression in speech

被引:136
作者
Moore, Elliot, II [1 ]
Clements, Mark A. [1 ]
Peifer, John W. [2 ]
Weisser, Lydia [3 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Savannah, GA 31407 USA
[2] Georgia Inst Technol, Biomed Interact Technol Ctr, Savannah, GA 30308 USA
[3] Med Coll Georgia, Dept Psychiat & Hlth Behav, Augusta, GA 30912 USA
关键词
affect; depression; glottal; prosodics; vocal tract; voice source;
D O I
10.1109/TBME.2007.900562
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The motivation for this work is in an attempt to rectify the current lack of objective tools for clinical analysis of emotional disorders. This study involves the examination of a large breadth of objectively measurable features for use in discriminating depressed speech. Analysis is based on features related to prosodics, the vocal tract, and parameters extracted directly from the glottal waveform. Discrimination of the depressed speech was based on a feature selection strategy utilizing the following combinations of feature domains: prosodic measures alone, prosodic and vocal tract measures, prosodic and glottal measures, and all three domains. The combination of glottal and prosodic features produced better discrimination overall than the combination of prosodic and vocal tract features. Analysis of discriminating feature sets used in the study reflect a clear indication that glottal descriptors are vital components of vocal affect analysis.
引用
收藏
页码:96 / 107
页数:12
相关论文
共 47 条
  • [1] ALKU P, 1996, P INT C SPOK LANG PR, V3, P1569
  • [2] An amplitude quotient based method to analyze changes in the shape of the glottal pulse in the regulation of vocal intensity
    Alku, Paavo
    Airas, Matti
    Bjorkner, Eva
    Sundberg, Johan
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (02) : 1052 - 1062
  • [3] [Anonymous], P IEEE INT C AC SPEE
  • [4] [Anonymous], 2000, Speech Processing and Synthesis Toolboxes
  • [5] BREZNITZ Z, 1992, J GEN PSYCHOL, V76, P235
  • [6] Conwell Y, 1995, Int Psychogeriatr, V7, P149, DOI 10.1017/S1041610295001943
  • [7] Emotion recognition in human-computer interaction
    Cowie, R
    Douglas-Cowie, E
    Tsapatsoulis, N
    Votsis, G
    Kollias, S
    Fellenz, W
    Taylor, JG
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2001, 18 (01) : 32 - 80
  • [8] Cowie R, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P1989, DOI 10.1109/ICSLP.1996.608027
  • [9] ANALYSIS OF THE GLOTTAL EXCITATION OF EMOTIONALLY STYLED AND STRESSED SPEECH
    CUMMINGS, KE
    CLEMENTS, MA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 98 (01) : 88 - 98
  • [10] Dellaert F, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P1970, DOI 10.1109/ICSLP.1996.608022