Time-varying sinusoidal demodulation for non-stationary modeling of speech

被引:1
|
作者
Sharma, Neeraj Kumar [1 ]
Sreenivas, Thippur V. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India
关键词
Speech modeling; Sinusoidal modeling; Speech analysis; Speech synthesis; Harmonic demodulation; Subband modeling; INSTANTANEOUS-FREQUENCY; SIGNAL DECOMPOSITION; ENVELOPE; REPRESENTATIONS;
D O I
10.1016/j.specom.2018.10.008
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech signals contain a fairly rich time-evolving spectral content. Accurate analysis of this time-evolving spectrum is an open challenge in signal processing. Towards this, we visit time-varying sinusoidal modeling of speech and propose an alternate model estimation approach. The estimation operates on the whole signal without any short-time analysis. The approach proceeds by extracting the fundamental frequency sinusoid (FFS) from speech signal. The instantaneous amplitude (IA) of the FFS is used for voiced/unvoiced stream segregation. The voiced stream is then demodulated using a variant of in-phase and quadrature-phase demodulation carried at harmonics of the FFS. The result is a non-parametric time-varying sinusoidal representation, specifically, an additive mixture of quasi-harmonic sinusoids for voiced stream and a wideband mono-component sinusoid for unvoiced stream. The representation is evaluated for analysis-synthesis, and the bandwidth of IA and IF signals are found to be crucial in preserving the quality. Also, the obtained IA and IF signals are found to be carriers of perceived speech attributes, such as speaker characteristics and intelligibility. On comparing the proposed modeling framework with the existing approaches, which operate on short-time segments, improvement is found in simplicity of implementation, objective-scores, and computation time. The listening test scores suggest that the quality preserves naturalness but does not yet beat the state-of-the-art short-time analysis methods. In summary, the proposed representation lends itself for high resolution temporal analysis of non-stationary speech signals, and also allows quality preserving modification and synthesis.
引用
收藏
页码:77 / 91
页数:15
相关论文
共 50 条
  • [1] Time-varying modeling of a non-stationary signal
    Al-Shoshan, AI
    10TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 1997, : 179 - 182
  • [2] Time-varying bispectral analysis of non-stationary signals
    Akan, A
    Artan, RBÜ
    SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 1, PROCEEDINGS, 2003, : 569 - 572
  • [3] Non-stationary structural model with time-varying demand elasticities
    Kim, Kun Ho
    Zhou, Zhou
    Wu, Wei Biao
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (12) : 3809 - 3819
  • [4] Determination of time-varying means for non-stationary typhoon winds
    Xie, Bo
    Luo, Xiaoqun
    Zhang, Qilin
    Ding, Jiemin
    Fu, Shenghui
    JOURNAL OF BUILDING ENGINEERING, 2025, 99
  • [5] Derivation of time-varying mean for non-stationary downburst winds
    Su, Yanwen
    Huang, Guoqing
    Xu, You-lin
    JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2015, 141 : 39 - 48
  • [6] Time-varying power spectra and coherences of non-stationary typhoon winds
    Huang, Zifeng
    Xu, You-Lin
    Tao, Tianyou
    Zhan, Sheng
    Journal of Wind Engineering and Industrial Aerodynamics, 2020, 198
  • [7] Optimal channel equalization for time-varying channels with non-stationary noises
    Badran, EF
    Gu, GX
    PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 576 - 579
  • [8] Time-varying power spectra and coherences of non-stationary typhoon winds
    Huang, Zifeng
    Xu, You-Lin
    Tao, Tianyou
    Zhan, Sheng
    JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2020, 198
  • [9] Direct estimation of multiple time-varying frequencies of non-stationary signals
    Samanta, Anik Kumar
    Routray, Aurobinda
    Khare, Swanand R.
    Naha, Arunava
    SIGNAL PROCESSING, 2020, 169
  • [10] Adaptive synchrosqueezing transform with a time-varying parameter for non-stationary signal separation
    Li, Lin
    Cai, Haiyan
    Jiang, Qingtang
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2020, 49 (03) : 1075 - 1106