AM-FM Estimation for Speech Based on a Time-Varying Sinusoidal Model

被引：0

作者：

Pantazis, Yannis

Rosec, Olivier

Stylianou, Yannis

机构：

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

Sinusoidal modeling; AM-FM demodulation; Speech analysis; Speech reconstruction;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a method based on a time-varying sinusoidal model for a robust and accurate estimation of amplitude and frequency modulations (AM-FM) in speech. The suggested approach has two main steps. First, speech is modeled as a sinusoidal model with time-varying amplitudes. Specifically, the model makes use of a first order time polynomial with complex coefficients for capturing instantaneous amplitude and frequency (phase) components. Next, the model parameters are updated by using the previously estimated instantaneous phase information. Thus, an iterative scheme for AM-FM decomposition of speech is suggested which was validated on synthetic AM-FM signals and tested on reconstruction of voiced speech signals where the signal-to-error reconstruction ratio (SERR) was used as measure. Compared to the standard sinusoidal representation, the suggested approach found to improve the corresponding SERR by 47%, resulting in over 30 dB of SERR.

引用

页码：112 / 115

页数：4

共 16 条

[1] Adaptive AM-FM Signal Decomposition With Application to Speech Analysis
Pantazis, Yannis
Rosec, Olivier
Stylianou, Yannis
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 290 - 300
[2] STATISTICAL ANALYSIS OF AMPLITUDE MODULATION IN SPEECH SIGNALS USING AN AM-FM MODEL
Tsiakoulis, Pirros
Potamianos, Alexandros
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3981 - +
[3] INCORPORATING AM-FM EFFECT IN VOICED SPEECH FOR PROBABILISTIC ACOUSTIC TUBE MODEL
Zhang, Yang
Ou, Zhijian
Hasegawa-Johnson, Mark
2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
[4] Time-varying sinusoidal demodulation for non-stationary modeling of speech
Sharma, Neeraj Kumar
Sreenivas, Thippur V.
SPEECH COMMUNICATION, 2018, 105 : 77 - 91
[5] CHIRP RATE ESTIMATION OF SPEECH BASED ON A TIME-VARYING QUASI-HARMONIC MODEL
Pantazis, Yannis
Rosec, Olivier
Stylianou, Yannis
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3985 - 3988
[6] Joint Bayesian Estimation of Time-Varying LP Parameters and Excitation for Speech
Chetupalli, Srikanth Raj
Sreenivas, T. V.
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (04) : 357 - 361
[7] On the Properties of a Time-Varying Quasi-Harmonic Model of Speech
Pantazis, Yannis
Rosec, Olivier
Stylianou, Yannis
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1044 - +
[8] M-mode echocardiography image and video segmentation based on AM-FM demodulation techniques
Rodríguez, PV
Pattichis, MS
Goens, MB
PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: A NEW BEGINNING FOR HUMAN HEALTH, 2003, 25 : 1176 - 1179
[9] Local AM/FM Parameters Estimation: Application to Sinusoidal Modeling and Blind Audio Source Separation
Fourer, Dominique
Auger, Francois
Peeters, Geoffroy
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (10) : 1600 - 1604
[10] SPEECH ENHANCEMENT BASED ON A SINUSOIDAL MODEL
KATES, JM
JOURNAL OF SPEECH AND HEARING RESEARCH, 1994, 37 (02): : 449 - 464

← 1 2 →