STATE MAPPING FOR CROSS-LANGUAGE SPEAKER ADAPTATION IN TTS

被引:12
|
作者
Chen, Yi-Ning [1 ]
Jiao, Yang [1 ]
Qian, Yao [1 ]
Soong, Frank K. [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
HMM-based TTS; Speaker adaptation; Cross language; Kullback-Leibler divergence;
D O I
10.1109/ICASSP.2009.4960573
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Cross-language speaker adaptation has many interesting applications, e.g. speech-to-speech translation. However, in cross-language speaker adaptation, a common phoneme set, assumed to be used by different speakers of the same language, does not exist any longer. Instead, a nearest neighbor based phoneme mapping from one language to the other has been adopted. In this study, we used our recently proposed sub-phonemic HMM state mapping for cross-language adaptations. The sub-phonemic HMM states, due to their phonetic segment nature, tend to be more sharable across different languages than phonemes. Kullback-Leibler divergence, an information-theoretic measure, is chosen here to measure the similarity between given states in different languages. Experimental results show that new state mapping outperforms the phoneme mapping baseline system in terms of three objective measures: log spectral distance, F0 adaptation error and F0 correlations. In comparing with intra-language adaptation, the cross-language result of the new algorithm is also fairly decent.
引用
收藏
页码:4273 / 4276
页数:4
相关论文
共 50 条
  • [41] Cross-language activation of phonology in young bilingual readers
    Jared, Debra
    Cormier, Pierre
    Levy, Betty Ann
    Wade-Woolley, Lesly
    READING AND WRITING, 2012, 25 (06) : 1327 - 1343
  • [42] A cross-language personalized recommendation model in digital libraries
    Lai, Yuangen
    Zeng, Jianxun
    ELECTRONIC LIBRARY, 2013, 31 (03) : 264 - 277
  • [43] Cross-language predictors of consonant-vowel syllables
    Ember, M
    Ember, CR
    AMERICAN ANTHROPOLOGIST, 1999, 101 (04) : 730 - 742
  • [44] High-Performance Cross-Language Interoperability in a Multi-language Runtime
    Grimmer, Matthias
    Seaton, Chris
    Schatz, Roland
    Wurthinger, Thomas
    Moessenboeck, Hanspeter
    ACM SIGPLAN NOTICES, 2016, 51 (02) : 78 - 90
  • [45] A cross-language study of verbal and visuospatial working memory span
    Chen, Zen-Yong
    Cowell, Patricia E.
    Varley, Rosemary
    Wang, Yi-Ching
    JOURNAL OF CLINICAL AND EXPERIMENTAL NEUROPSYCHOLOGY, 2009, 31 (04) : 385 - 391
  • [46] Cross-Language Plagiarism Detection Model Based On Multiple Features
    Liu, Gang
    Dong, Yichao
    Li, Guangxi
    26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,
  • [47] Self-Supervised Cross-Language Scene Text Editing
    Yang, Fuxiang
    Su, Tonghua
    Zhou, Xiang
    Di, Donglin
    Wang, Zhongjie
    Li, Songze
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4546 - 4554
  • [48] Cross-language Sentiment Classification Based on Support Vector Machine
    Ma, Hongxia
    Zhang, Yangsen
    Du, Zhenlei
    2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 507 - 513
  • [49] Clustering synonymous English and Chinese keywords for cross-language queries
    Chen, Rung-Ching
    Huang, Chung-Yi
    Huang, Yu-Len
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1875 - +
  • [50] A Cross-Language Name Binding Recognition and Discrimination Approach for Identifiers
    Ju, Yue
    Tang, Yixuan
    Lan, Jinpeng
    Mi, Xiangbo
    Zhang, Jingxuan
    2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING, SANER, 2023, : 948 - 955