Generating fundamental frequency contours for speech synthesis in Yoruba

被引：0

作者：

van Niekerk, Daniel R. ^{[1
]}

Barnard, Etienne ^{[2
]}

机构：

[1] North West Univ, Ctr Text Technol, Potchefstroom, South Africa

[2] North West Univ, Multilingual Speech Technol, Vanderbijlpark, South Africa

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

speech synthesis; text-to-speech; fundamental frequency; tone language; under-resourced; Yoruba; HTS; TONE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present methods for modelling and synthesising fundamental frequency (F-0) contours suitable for application in text to-speech (TTS) synthesis of Yoffiba (an African tone language). These methods are discussed and compared with a baseline approach using the HMM-based speech synthesis system HTS. Evaluation is done by comparing ten-fold cross validation squared errors on a small corpus of four speakers. We show that the proposed methods are relatively effective at modelling and generating F-0 contours in this context, achieving lower error rates than the baseline. These results suggest that our methods will be useful for the generation of improved synthesis of tone in African languages, which has been a challenge to date.

引用

页码：1026 / 1030

页数：5

共 50 条

[1] Modeling of Fundamental Frequency Contours for HMM-based Speech Synthesis Representation of fundamental frequency contours for statistical speech synthesis
Hirose, Keikichi
PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 171 - 176
[2] ANALYSIS OF FUNDAMENTAL FREQUENCY CONTOURS IN SPEECH
LEVITT, H
RABINER, LR
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (02): : 569 - &
[3] A dynamical system model for generating fundamental frequency for speech synthesis
Ross, KN
Ostendorf, M
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03): : 295 - 309
[4] CHARACTERIZATION OF FUNDAMENTAL-FREQUENCY CONTOURS OF SPEECH
MAEDA, S
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 56 : S33 - S33
[5] The role of fundamental frequency contours in the perception of speech against interfering speech
Binns, Christine
Culling, John F.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (03): : 1765 - 1776
[6] Generation of Fundamental Frequency Contours for Thai Speech Synthesis using Tone Nucleus Model
Krityakien, Oraphan
Hirose, Keikichi
Minematsu, Nobuaki
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1036 - 1040
[7] The Effect of Fundamental Frequency on the Intelligibility of Speech With Flattened Intonation Contours
Watson, Peter J.
Schlauch, Robert S.
AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2008, 17 (04) : 348 - 355
[8] Improved Automatic Extraction of Generation Process Model Commands and Its use for Generating Fundamental Frequency Contours for Training HMM-based Speech Synthesis
Hashimoto, Hiroya
Hirose, Keikichi
Minematsu, Nobuaki
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 458 - 461
[9] Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin
Ni, Jinfu
Hirose, Keikichi
SPEECH COMMUNICATION, 2006, 48 (08) : 989 - 1008
[10] Use of Generation Process Model for Synthesizing Fundamental Frequency Contours in HMM-based Speech Synthesis
Hirose, Keikichi
Hashimoto, Hiroya
Ikeshima, Jun
Minematsu, Nobuaki
PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 575 - +

← 1 2 3 4 5 →