LSM-based unit pruning for concatenative speech synthesis

被引:0
|
作者
Bellegarda, Jerome R. [1 ]
机构
[1] Apple Comp Inc, Speech & Language Technol, Cupertino, CA 95014 USA
来源
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年
关键词
text-to-speech synthesis; unit selection; inventory pruning; outlier removal; unit redundancy management;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The level of quality that can be achieved in concatenative text-to-speech synthesis is primarily governed by the inventory of units used in unit selection. This has led to the collection of ever larger corpora in the quest for ever more natural synthetic speech. As operational considerations limit the size of the unit inventory, however, pruning is critical to removing any instances that prove either spurious or superfluous. This paper proposes a novel pruning strategy based on a data-driven feature extraction framework separately optimized for each unit type in the inventory. A single distinctiveness/redundancy measure can then address, in a consistent manner, the (traditionally separate) problems of outliers and redundant units. Experimental results underscore the viability of this approach for both moderate and aggressive inventory pruning.
引用
收藏
页码:521 / 524
页数:4
相关论文
共 50 条
  • [1] Further analysis of LSM-based unit pruning for unit selection TTS
    Bellegarda, Jerome R.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3961 - 3964
  • [2] Unit database pruning based on the cost degradation criterion for concatenative speech synthesis
    Nishizawa, Nobuyuki
    Kawai, Hisashi
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3969 - 3972
  • [3] Further Developments in LSM-Based Boundary Training for Unit Selection TTS
    Bellegarda, Jerome R.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1320 - 1323
  • [4] PERCEPTUAL CLUSTERING BASED UNIT SELECTION OPTIMIZATION FOR CONCATENATIVE TEXT-TO-SPEECH SYNTHESIS
    Jiang, Tao
    Wu, Zhiyong
    Jia, Jia
    Cai, Lianhong
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 64 - 68
  • [5] An efficient unit-selection method for embedded concatenative speech synthesis
    Gros, Jerneja Zganec
    Zganec, Mario
    INFORMACIJE MIDEM-JOURNAL OF MICROELECTRONICS ELECTRONIC COMPONENTS AND MATERIALS, 2007, 37 (03): : 158 - 164
  • [6] Unit-centric feature mapping for inventory pruning in unit selection text-to-speech synthesis
    Bellegarda, Jerome R.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01): : 74 - 82
  • [7] Admissible stopping in Viterbi beam search for unit selection in concatenative speech synthesis
    Sakai, Shinsuke
    Kawahara, Tatsuya
    Nakamura, Satoshi
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4613 - 4616
  • [8] Fast concatenative speech synthesis using pre-fused speech units based on the plural unit selection and fusion method
    Tamura, Masatsune
    Mizutani, Tatsuya
    Kagoshima, Takehiko
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (02) : 544 - 553
  • [9] Discriminative training for concatenative speech synthesis
    Kim, NS
    Park, SS
    IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (01) : 40 - 43
  • [10] High-Individuality Voice Conversion Based on Concatenative Speech Synthesis
    Fujii, Kei
    Okawa, Jun
    Suigetsu, Kaori
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 483 - 488