Performance Evaluation of HMM-Based Style Classification with a Small Amount of Training Data

被引:0
|
作者
Tachibana, Makoto [1 ]
Kawashima, Keigo [1 ]
Yamagishi, Junichi [1 ]
Kobayashi, Takao [1 ]
机构
[1] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Yokohama, Kanagawa 2268502, Japan
关键词
emotional speech; speaking style; speech emotion recognition; classification; MSD-HMM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a classification technique for emotional expressions and speaking styles of speech using only a small amount of training data of a target speaker. We model spectral and fundamental frequency (F0) features simultaneously using multi-space probability distribution HMM (MSD-HMM), and adapt a speaker-independent neutral style model to a certain target speaker's style model with a small amount of data using MSD-MLLR which is extended MLLR for MSD-HMM. We perform classification experiments for professional narrators' speech and non-professional speakers' speech and evaluate the performance of proposed technique by comparing with other commonly used classifiers. We show that the proposed technique gives better result than the other classifiers when using a few sentences of target speaker's style data.
引用
收藏
页码:569 / 572
页数:4
相关论文
共 50 条
  • [21] Discriminative training for HMM-based offline handwritten character recognition
    Nopsuwanchai, R
    Povey, D
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 114 - 118
  • [22] Minimum generation error training for HMM-based speech synthesis
    Wu, Yi-Jian
    Wang, Ren-Hua
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 89 - 92
  • [23] Efficient Model Training for HMM-based Person Identification by Gait
    Aqmar, Muhammad Rasyid
    Shinoda, Koichi
    Furui, Sadaoki
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [24] HMM-BASED SPEECH SYNTHESIS ADAPTATION USING NOISY DATA: ANALYSIS AND EVALUATION METHODS
    Karhila, Reima
    Remes, Ulpu
    Kurimo, Mikko
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6930 - 6934
  • [25] Evaluation of the Slovenian HMM-based speech synthesis system
    Vesnicer, B
    Mihelic, F
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 513 - 520
  • [26] An HMM-based Performance Diagnosis Approach for Hadoop Clusters
    Li, Jiacong
    Wang, Ying
    Yu, Jinke
    Guo, Shaoyong
    2016 18TH ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2016,
  • [27] SPEAKER-INDEPENDENT STYLE CONVERSION FOR HMM-BASED EXPRESSIVE SPEECH SYNTHESIS
    Kanagawa, Hiroki
    Nose, Takashi
    Kobayashi, Takao
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7864 - 7868
  • [28] HMM-based classification of seismic events recorded at Stromboli and Etna Volcanoes
    Gutierrez, L. A.
    Ramirez, J.
    Benitez, C.
    Ibanez, J. M.
    Almendros, J.
    Garcia-Yeguas, A.
    2006 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, 2006, : 2765 - +
  • [29] Speaker and style adaptation using average voice model for style control in HMM-based speech synthesis
    Tachibana, Makoto
    Izawa, Shinsuke
    Nose, Takashi
    Kobayashi, Takao
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4633 - 4636
  • [30] Improvement of HMM-based Action Classification by using State Transition Probability
    Kitamura, Yuka
    Aruga, Haruki
    Hashimoto, Manabu
    TWELFTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2015, 9534