Performance Evaluation of HMM-Based Style Classification with a Small Amount of Training Data

被引:0
|
作者
Tachibana, Makoto [1 ]
Kawashima, Keigo [1 ]
Yamagishi, Junichi [1 ]
Kobayashi, Takao [1 ]
机构
[1] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Yokohama, Kanagawa 2268502, Japan
关键词
emotional speech; speaking style; speech emotion recognition; classification; MSD-HMM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a classification technique for emotional expressions and speaking styles of speech using only a small amount of training data of a target speaker. We model spectral and fundamental frequency (F0) features simultaneously using multi-space probability distribution HMM (MSD-HMM), and adapt a speaker-independent neutral style model to a certain target speaker's style model with a small amount of data using MSD-MLLR which is extended MLLR for MSD-HMM. We perform classification experiments for professional narrators' speech and non-professional speakers' speech and evaluate the performance of proposed technique by comparing with other commonly used classifiers. We show that the proposed technique gives better result than the other classifiers when using a few sentences of target speaker's style data.
引用
收藏
页码:569 / 572
页数:4
相关论文
共 50 条
  • [1] Unsupervised training of an HMM-based Speech Recognizer for Topic Classification
    Gish, Herbert
    Siu, Man-hung
    Chan, Arthur
    Belfield, Bill
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1895 - 1898
  • [2] SUPERVISED CLASSIFICATION OF ARRAY CGH DATA WITH HMM-BASED FEATURE SELECTION
    Daemen, Anneleen
    Gevaert, Olivier
    Leunen, Karin
    Legius, Eric
    Vergote, Ignace
    De Moor, Bart
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2009, 2009, : 468 - +
  • [3] Generation of synthetic training data for an HMM-based handwriting recognition system
    Varga, T
    Bunke, H
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 618 - 622
  • [4] A Comparison of Speech Synthesis Systems Based on GPR, HMM, and DNN with a Small Amount of Training Data
    Koriyama, Tomoki
    Kobayashi, Takao
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3496 - 3500
  • [5] EVALUATION OF HMM-BASED LAUGHTER SYNTHESIS
    Urbain, Jerome
    Cakmak, Huseyin
    Dutoit, Thierry
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7835 - 7839
  • [6] Chinese handwritten legal amount recognition with HMM-based approach
    Chi, Bingyu
    Chen, Youbin
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 778 - 782
  • [7] Discrete/Continuous Modelling of Speaking Style in HMM-based Speech Synthesis: Design and Evaluation
    Obin, Nicolas
    Lanchantin, Pierre
    Lacheret, Anne
    Rodet, Xavier
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2796 - +
  • [8] HMM-based IMU data processing for arm gesture classification and motion tracking
    Wang, Danping
    Wang, Jina
    Liu, Yang
    Meng, Xianming
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2023, 42 (01) : 54 - 63
  • [9] A style control technique for HMM-based expressive speech synthesis
    Nose, Takashi
    Yamagishi, Junichi
    Masuko, Takashi
    Kobayashi, Takao
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (09) : 1406 - 1413
  • [10] Normalized training for HMM-based visual speech recognition
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    Kitamura, Tadashi
    Kobayashi, Takao
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (11): : 40 - 50