AM-FM Estimation for Speech Based on a Time-Varying Sinusoidal Model

被引:0
作者
Pantazis, Yannis
Rosec, Olivier
Stylianou, Yannis
机构
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
Sinusoidal modeling; AM-FM demodulation; Speech analysis; Speech reconstruction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a method based on a time-varying sinusoidal model for a robust and accurate estimation of amplitude and frequency modulations (AM-FM) in speech. The suggested approach has two main steps. First, speech is modeled as a sinusoidal model with time-varying amplitudes. Specifically, the model makes use of a first order time polynomial with complex coefficients for capturing instantaneous amplitude and frequency (phase) components. Next, the model parameters are updated by using the previously estimated instantaneous phase information. Thus, an iterative scheme for AM-FM decomposition of speech is suggested which was validated on synthetic AM-FM signals and tested on reconstruction of voiced speech signals where the signal-to-error reconstruction ratio (SERR) was used as measure. Compared to the standard sinusoidal representation, the suggested approach found to improve the corresponding SERR by 47%, resulting in over 30 dB of SERR.
引用
收藏
页码:112 / 115
页数:4
相关论文
共 16 条
  • [1] Adaptive AM-FM Signal Decomposition With Application to Speech Analysis
    Pantazis, Yannis
    Rosec, Olivier
    Stylianou, Yannis
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 290 - 300
  • [2] STATISTICAL ANALYSIS OF AMPLITUDE MODULATION IN SPEECH SIGNALS USING AN AM-FM MODEL
    Tsiakoulis, Pirros
    Potamianos, Alexandros
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3981 - +
  • [3] INCORPORATING AM-FM EFFECT IN VOICED SPEECH FOR PROBABILISTIC ACOUSTIC TUBE MODEL
    Zhang, Yang
    Ou, Zhijian
    Hasegawa-Johnson, Mark
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [4] Time-varying sinusoidal demodulation for non-stationary modeling of speech
    Sharma, Neeraj Kumar
    Sreenivas, Thippur V.
    SPEECH COMMUNICATION, 2018, 105 : 77 - 91
  • [5] CHIRP RATE ESTIMATION OF SPEECH BASED ON A TIME-VARYING QUASI-HARMONIC MODEL
    Pantazis, Yannis
    Rosec, Olivier
    Stylianou, Yannis
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3985 - 3988
  • [6] Joint Bayesian Estimation of Time-Varying LP Parameters and Excitation for Speech
    Chetupalli, Srikanth Raj
    Sreenivas, T. V.
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (04) : 357 - 361
  • [7] On the Properties of a Time-Varying Quasi-Harmonic Model of Speech
    Pantazis, Yannis
    Rosec, Olivier
    Stylianou, Yannis
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1044 - +
  • [8] M-mode echocardiography image and video segmentation based on AM-FM demodulation techniques
    Rodríguez, PV
    Pattichis, MS
    Goens, MB
    PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: A NEW BEGINNING FOR HUMAN HEALTH, 2003, 25 : 1176 - 1179
  • [9] Local AM/FM Parameters Estimation: Application to Sinusoidal Modeling and Blind Audio Source Separation
    Fourer, Dominique
    Auger, Francois
    Peeters, Geoffroy
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (10) : 1600 - 1604
  • [10] SPEECH ENHANCEMENT BASED ON A SINUSOIDAL MODEL
    KATES, JM
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1994, 37 (02): : 449 - 464