Speech modeling and processing by low-dimensional dynamic glottal models

被引：0

作者：

Drioli, Carlo ^{[1
]}

Calanca, Andrea ^{[1
]}

机构：

[1] Univ Udine, Dept Math & Comp Sci, I-33100 Udine, Italy

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

speech synthesis; glottal modeling; speech coding; physical modeling;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. A class of waveform-adaptive dynamic glottal models and parameter tracking procedures are illustrated. The model and analysis procedures are assessed by addressing signal transformations on recorded speech, achievable by fitting the model to the data, and then acting on the physically-oriented parameters of the voice source. The class of models proposed provides in principle a tool for both the estimation of glottal source signals, and the encoding of the speech signal for transformation purposes. The application of this model to time stretching and to frequency control (pitch shifting) is also illustrated. The experiments show that copy synthesis is perceptually almost indistinguishable form the target, and that time stretching and "pitch extrapolation" effects can be obtained by simple control strategies.

引用

页码：1606 / 1609

页数：4

共 8 条

[1] ARX-LF-BASED SOURCE-FILTER METHODS FOR VOICE MODIFICATION AND TRANSFORMATION [J].

Agiomyrgiannakis, Yannis ;

Rosec, Olivier .

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :3589-3592

[2] A flow waveform-matched low-dimensional glottal model based on physical knowledge [J].

Drioli, C .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (05) :3184-3195

[3]

Drioli C, 2003, P STOCKH MUS AC C SM, P377

[4]

Drioli C., 2003, P 3 INT WORKSH MOD A

[5] Robust glottal source estimation based on joint source-filter model optimization [J].

Fu, Q ;

Murphy, P .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02) :492-501

[6]

Jinachitra P, 2007, INT CONF ACOUST SPEE, P281

[7] HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering [J].

Raitio, Tuomo ;

Suni, Antti ;

Yamagishi, Junichi ;

Pulakka, Hannu ;

Nurminen, Jani ;

Vainio, Martti ;

Alku, Paavo .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01) :153-165

[8]

Schroeter J., 1991, Advances in speech signal processing, P231

← 1 →