Time-varying sinusoidal demodulation for non-stationary modeling of speech

被引:1
作者
Sharma, Neeraj Kumar [1 ]
Sreenivas, Thippur V. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India
关键词
Speech modeling; Sinusoidal modeling; Speech analysis; Speech synthesis; Harmonic demodulation; Subband modeling; INSTANTANEOUS-FREQUENCY; SIGNAL DECOMPOSITION; ENVELOPE; REPRESENTATIONS;
D O I
10.1016/j.specom.2018.10.008
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech signals contain a fairly rich time-evolving spectral content. Accurate analysis of this time-evolving spectrum is an open challenge in signal processing. Towards this, we visit time-varying sinusoidal modeling of speech and propose an alternate model estimation approach. The estimation operates on the whole signal without any short-time analysis. The approach proceeds by extracting the fundamental frequency sinusoid (FFS) from speech signal. The instantaneous amplitude (IA) of the FFS is used for voiced/unvoiced stream segregation. The voiced stream is then demodulated using a variant of in-phase and quadrature-phase demodulation carried at harmonics of the FFS. The result is a non-parametric time-varying sinusoidal representation, specifically, an additive mixture of quasi-harmonic sinusoids for voiced stream and a wideband mono-component sinusoid for unvoiced stream. The representation is evaluated for analysis-synthesis, and the bandwidth of IA and IF signals are found to be crucial in preserving the quality. Also, the obtained IA and IF signals are found to be carriers of perceived speech attributes, such as speaker characteristics and intelligibility. On comparing the proposed modeling framework with the existing approaches, which operate on short-time segments, improvement is found in simplicity of implementation, objective-scores, and computation time. The listening test scores suggest that the quality preserves naturalness but does not yet beat the state-of-the-art short-time analysis methods. In summary, the proposed representation lends itself for high resolution temporal analysis of non-stationary speech signals, and also allows quality preserving modification and synthesis.
引用
收藏
页码:77 / 91
页数:15
相关论文
共 41 条
[21]   A Comparison Study between Non-parameterized and Parameterized Time frequency Representation for Non-stationary Signals [J].
Yang, Y. ;
Qiu, X. L. ;
Peng, Z. K. .
2017 INTERNATIONAL CONFERENCE ON SENSING, DIAGNOSTICS, PROGNOSTICS, AND CONTROL (SDPC), 2017, :577-584
[22]   A fast approach of implementing the Fourier decomposition method for nonlinear and non-stationary time series analysis [J].
Kizilkaya, Aydin ;
Elbi, Mehmet Dogan .
SIGNAL PROCESSING, 2023, 206
[23]   A time-variant filtering approach for non-stationary random signals based on the fractional convolution [J].
Chinchilla, Lenin ;
Sierra, Daniel A. ;
Torres, Rafael .
SIGNAL PROCESSING, 2016, 119 :92-101
[24]   Instantaneous Frequency Estimation of Multi-Component Non- Stationary Signals using Fourier Bessel series and Time-Varying Auto Regressive Model [J].
Reddy, G. Ravi Shankar ;
Rao, Rameshwar .
INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2015, 61 (04) :365-376
[25]   An active noise control method of non-stationary noise under time-variant secondary path [J].
Chen, Bin ;
Guo, Rongrong ;
Yu, Shuyue ;
Yu, Yang .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2021, 149
[26]   A Tutorial Review on Time-Frequency Analysis of Non-Stationary Vibration Signals with Nonlinear Dynamics Applications [J].
Varanis, Marcus ;
Silva, Anderson L. ;
Balthazar, Jose M. ;
Pederiva, Robson .
BRAZILIAN JOURNAL OF PHYSICS, 2021, 51 (03) :859-877
[27]   Empirical Fourier decomposition: An accurate signal decomposition method for nonlinear and non-stationary time series analysis [J].
Zhou, Wei ;
Feng, Zhongren ;
Xu, Y. F. ;
Wang, Xiongjiang ;
Lv, Hao .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 163
[28]   Modeling the time-varying microstructure of simulated sleep EEG spindles using time-frequency analysis methods [J].
Xanthopoulos, P. ;
Golemati, S. ;
Sakkalis, V. ;
Ktonas, P. Y. ;
Zervakis, M. ;
Soldatos, C. R. .
2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, :1571-+
[29]   Refining the time-frequency characteristic of non-stationary signal for improving time-frequency representation under variable speeds [J].
Liu, Yi ;
Xiang, Hang ;
Jiang, Zhansi ;
Xiang, Jiawei .
SCIENTIFIC REPORTS, 2023, 13 (01)
[30]   Recognition of signals with time-varying spectrum using time-frequency transformation with non-uniform sampling [J].
Swiercz, Ewa .
2018 22ND INTERNATIONAL MICROWAVE AND RADAR CONFERENCE (MIKON 2018), 2018, :140-144