Embedded Unit Selection Text-to-Speech Synthesis for Mobile Devices

被引:15
|
作者
Karabetsos, Sotiris [1 ]
Tsiakoulis, Pirros [1 ]
Chalamandaris, Aimilios [1 ]
Raptis, Spyros [1 ]
机构
[1] Inst Language & Speech Proc RC Athena, Dept Voice & Sound Technol, GR-15125 Athens, Greece
关键词
Embedded Speech Synthesis; Unit Selection; Text-to-Speech; Mobile Devices; Mobile Phones;
D O I
10.1109/TCE.2009.5174430
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays, unit selection based text-to-speech technology is the mainstream approach for near natural speech,synthesis systems. However, this is achieved at the expense of raised requirements in terms of computational resources. This work describes design and implementation approaches for the efficient integration of this technology in computational environments with limited resources, such as mobile devices, with no considerable speech quality degradation. In particular, the issues of database reduction, acoustic inventory compression and runtime computational load minimization are mainly addressed in this paper. Both objective and subjective assessments confirm the effectiveness of these approaches in terms of constructing a general purpose embedded unit selection TTS system and reducing the computational requirements while maintaining high speech quality(1).
引用
收藏
页码:613 / 621
页数:9
相关论文
共 50 条
  • [31] A Framework for Mixed-language Text-to-speech Synthesis
    Malcangi, Mario
    Grew, Philip
    PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, MAN-MACHINE SYSTEMS AND CYBERNETICS (CIMMACS '09), 2009, : 151 - +
  • [32] On building phonetically and prosodically rich speech corpus for text-to-speech synthesis
    Matousek, Jindrich
    Romportl, Jan
    PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2006, : 442 - +
  • [33] Schwa-deletion in Hindi text-to-speech synthesis
    Narasimhan B.
    Sproat R.
    Kiraz G.
    International Journal of Speech Technology, 2004, 7 (4) : 319 - 333
  • [34] A statistical method for database reduction for embedded unit selection speech synthesis
    Tsiakoulis, Pirros
    Chalamandaris, Aimilios
    Karabetsos, Sotiris
    Raptis, Spyros
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4601 - 4604
  • [35] An efficient unit-selection method for embedded concatenative speech synthesis
    Gros, Jerneja Zganec
    Zganec, Mario
    INFORMACIJE MIDEM-JOURNAL OF MICROELECTRONICS ELECTRONIC COMPONENTS AND MATERIALS, 2007, 37 (03): : 158 - 164
  • [36] EXAMPLAR-BASED SPEECH WAVEFORM GENERATION FOR TEXT-TO-SPEECH
    Valentini-Botinhao, Cassia
    Watts, Oliver
    Espic, Felipe
    King, Simon
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 332 - 338
  • [37] Emotional Intelligence in Text-To-Speech Synthesis in Pali Language Using Fuzzy Logic
    Mache, Suhas
    Dabhade, Siddharth
    JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2024, 6 (03): : 179 - 192
  • [38] Enhancing the Quality of Nepali Text-to-Speech Systems
    Ghimire, Rupak Raj
    Bal, Bal Krishna
    CREATIVITY IN INTELLIGENT TECHNOLOGIES AND DATA SCIENCE, (CIT&DS), 2017, 754 : 187 - 197
  • [39] Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis
    Dagba, Theophile K.
    Aoga, John O. R.
    Fanou, Codjo C.
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 161 - 169
  • [40] ARM based implementation of Text-To-Speech (TTS) for real time Embedded System
    Rawoof, Abdul
    Kulesh
    Ray, Kailash Chandra
    2014 FIFTH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2014), 2014, : 192 - 196