Time-varying sinusoidal demodulation for non-stationary modeling of speech

被引：1

作者：

Sharma, Neeraj Kumar ^{[1
]}

Sreenivas, Thippur V. ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India

来源：

SPEECH COMMUNICATION | 2018年 / 105卷

关键词：

Speech modeling; Sinusoidal modeling; Speech analysis; Speech synthesis; Harmonic demodulation; Subband modeling; INSTANTANEOUS-FREQUENCY; SIGNAL DECOMPOSITION; ENVELOPE; REPRESENTATIONS;

D O I：

10.1016/j.specom.2018.10.008

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech signals contain a fairly rich time-evolving spectral content. Accurate analysis of this time-evolving spectrum is an open challenge in signal processing. Towards this, we visit time-varying sinusoidal modeling of speech and propose an alternate model estimation approach. The estimation operates on the whole signal without any short-time analysis. The approach proceeds by extracting the fundamental frequency sinusoid (FFS) from speech signal. The instantaneous amplitude (IA) of the FFS is used for voiced/unvoiced stream segregation. The voiced stream is then demodulated using a variant of in-phase and quadrature-phase demodulation carried at harmonics of the FFS. The result is a non-parametric time-varying sinusoidal representation, specifically, an additive mixture of quasi-harmonic sinusoids for voiced stream and a wideband mono-component sinusoid for unvoiced stream. The representation is evaluated for analysis-synthesis, and the bandwidth of IA and IF signals are found to be crucial in preserving the quality. Also, the obtained IA and IF signals are found to be carriers of perceived speech attributes, such as speaker characteristics and intelligibility. On comparing the proposed modeling framework with the existing approaches, which operate on short-time segments, improvement is found in simplicity of implementation, objective-scores, and computation time. The listening test scores suggest that the quality preserves naturalness but does not yet beat the state-of-the-art short-time analysis methods. In summary, the proposed representation lends itself for high resolution temporal analysis of non-stationary speech signals, and also allows quality preserving modification and synthesis.

引用

页码：77 / 91

页数：15

共 50 条

[1] Time-varying modeling of a non-stationary signal
Al-Shoshan, AI
10TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 1997, : 179 - 182
[2] Time-varying bispectral analysis of non-stationary signals
Akan, A
Artan, RBÜ
SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 1, PROCEEDINGS, 2003, : 569 - 572
[3] Non-stationary structural model with time-varying demand elasticities
Kim, Kun Ho
Zhou, Zhou
Wu, Wei Biao
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (12) : 3809 - 3819
[4] Determination of time-varying means for non-stationary typhoon winds
Xie, Bo
Luo, Xiaoqun
Zhang, Qilin
Ding, Jiemin
Fu, Shenghui
JOURNAL OF BUILDING ENGINEERING, 2025, 99
[5] Derivation of time-varying mean for non-stationary downburst winds
Su, Yanwen
Huang, Guoqing
Xu, You-lin
JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2015, 141 : 39 - 48
[6] Time-varying power spectra and coherences of non-stationary typhoon winds
Huang, Zifeng
Xu, You-Lin
Tao, Tianyou
Zhan, Sheng
Journal of Wind Engineering and Industrial Aerodynamics, 2020, 198
[7] Optimal channel equalization for time-varying channels with non-stationary noises
Badran, EF
Gu, GX
PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 576 - 579
[8] Time-varying power spectra and coherences of non-stationary typhoon winds
Huang, Zifeng
Xu, You-Lin
Tao, Tianyou
Zhan, Sheng
JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 2020, 198
[9] Direct estimation of multiple time-varying frequencies of non-stationary signals
Samanta, Anik Kumar
Routray, Aurobinda
Khare, Swanand R.
Naha, Arunava
SIGNAL PROCESSING, 2020, 169
[10] Adaptive synchrosqueezing transform with a time-varying parameter for non-stationary signal separation
Li, Lin
Cai, Haiyan
Jiang, Qingtang
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2020, 49 (03) : 1075 - 1106

← 1 2 3 4 5 →