Performance Evaluation of HMM-Based Style Classification with a Small Amount of Training Data

被引:0
|
作者
Tachibana, Makoto [1 ]
Kawashima, Keigo [1 ]
Yamagishi, Junichi [1 ]
Kobayashi, Takao [1 ]
机构
[1] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Yokohama, Kanagawa 2268502, Japan
关键词
emotional speech; speaking style; speech emotion recognition; classification; MSD-HMM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a classification technique for emotional expressions and speaking styles of speech using only a small amount of training data of a target speaker. We model spectral and fundamental frequency (F0) features simultaneously using multi-space probability distribution HMM (MSD-HMM), and adapt a speaker-independent neutral style model to a certain target speaker's style model with a small amount of data using MSD-MLLR which is extended MLLR for MSD-HMM. We perform classification experiments for professional narrators' speech and non-professional speakers' speech and evaluate the performance of proposed technique by comparing with other commonly used classifiers. We show that the proposed technique gives better result than the other classifiers when using a few sentences of target speaker's style data.
引用
收藏
页码:569 / 572
页数:4
相关论文
共 50 条
  • [41] Initialization, training, and context-dependency in HMM-based formant tracking
    Toledano, DT
    Villardebó, JG
    Gómez, LH
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 511 - 523
  • [42] Speaker and Language Adaptive Training for HMM-Based Polyglot Speech Synthesis
    Zen, Heiga
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 410 - 413
  • [43] A training method of average voice model for HMM-based speech synthesis
    Yamagishi, J
    Tamura, M
    Masuko, T
    Tokuda, K
    Kobayashi, T
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2003, E86A (08) : 1956 - 1963
  • [44] TRAJECTORY TRAINING CONSIDERING GLOBAL VARIANCE FOR HMM-BASED SPEECH SYNTHESIS
    Toda, Tomoki
    Young, Steve
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4025 - +
  • [45] Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery
    Siu, Man-Hung
    Gish, Herbert
    Chan, Arthur
    Belfield, William
    Lowe, Steve
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 210 - 223
  • [46] HMM-based singing voice synthesis system using pitch-shifted pseudo training data
    Mase, Ayami
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 845 - 848
  • [47] An HMM-based approach for automatic detection and classification of duplicate bug reports
    Ebrahimi, Neda
    Trabelsi, Abdelaziz
    Islam, Md Shariful
    Hamou-Lhadj, Abdelwahab
    Khanmohammadi, Kobra
    INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 113 : 98 - 109
  • [48] Determining Optimal Signal Features and Parameters for HMM-Based Emotion Classification
    Boeck, Ronald
    Huebner, David
    Wendemuth, Andreas
    MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, : 1586 - 1590
  • [49] Implementation and Evaluation of an HMM-based Thai Speech Synthesis System
    Chomphan, Suphattharachai
    Kobayashi, Takao
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 173 - 176
  • [50] Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis
    Andersson, Sebastian
    Yamagishi, Junichi
    Clark, Robert A. J.
    SPEECH COMMUNICATION, 2012, 54 (02) : 175 - 188