Spectral-envelope-group-delay models for transients

被引：0

作者：

Shenoy, Ravi R. ^{[1
,2
]}

Seelamantula, Chandra Sekhar ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Elect Engn, Bangalore 560012, Karnataka, India

[2] Nokia India Pvt Ltd, Nokia Res Ctr, Bangalore 560103, Karnataka, India

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2013年 / 133卷 / 05期

关键词：

INSTANTANEOUS FREQUENCY; SIGNAL; AMPLITUDE; MODULATIONS; SEPARATION;

D O I：

10.1121/1.4798580

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Transient signals such as plosives in speech or Castanets in audio do not have a specific modulation or periodic structure in time domain. However, in the spectral domain they exhibit a prominent modulation structure, which is a direct consequence of their narrow time localization. Based on this observation, a spectral-domain AM-FM model for transients is proposed. The spectral AM-FM model is built starting from real spectral zero-crossings. The AM and FM correspond to the spectral envelope (SE) and group delay (GD), respectively. Taking into account the modulation structure and spectral continuity, a local polynomial regression technique is proposed to estimate the GD function from the real spectral zeros. The SE is estimated based on the phase function computed from the estimated GD. Since the GD estimation is parametric, the degree of smoothness can be controlled directly. Simulation results based on synthetic transient signals generated using a beta density function are presented to analyze the noise-robustness of the SEGD model. Three specific applications are considered: (1) SEGD based modeling of Castanet sounds; (2) appropriateness of the model for transient compression; and (3) determining glottal closure instants in speech using a short-time SEGD model of the linear prediction residue. (C) 2013 Acoustical Society of America. [http://dx.doi.org/10.1121/1.4798580]

引用

页码：2788 / 2802

页数：15

共 23 条

[1] Spectral and Temporal Envelope Cues for Human and Automatic Speech Recognition in Noise
Hu, Guangxin
Determan, Sarah C.
Dong, Yue
Beeve, Alec T.
Collins, Joshua E.
Gai, Yan
JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2020, 21 (01): : 73 - 87
[2] Importance of temporal-envelope speech cues in different spectral regions
Ardoint, Marine
Agus, Trevor
Sheft, Stanley
Lorenzi, Christian
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (02) : EL115 - EL121
[3] Group Delay Based Methods for Speaker Segregation and its Application in Multimedia Information Retrieval
Nathwani, Karan
Pandit, Pranav
Hegde, Rajesh M.
IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (06) : 1326 - 1339
[4] Instantaneous frequency and group delay of a filtered signal
Cohen, L
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2000, 337 (04): : 329 - 346
[5] Sparse optimization for nonlinear group delay mode estimation
Liang, Hao
Ding, Xinghao
Jakobsson, Andreas
Tu, Xiaotong
Huang, Yue
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 152 (04) : 2187 - 2203
[6] Methodology of elementary negative group delay active topologies identification
Ravelo, Blaise
IET CIRCUITS DEVICES & SYSTEMS, 2013, 7 (03) : 105 - 113
[7] An instantaneous frequency and group delay based feature for classifying EEG signals
Khan, Nabeel Ali
Ali, Sadiq
Choi, Kwonhue
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 67 (67)
[8] Fault diagnosis of reciprocating compressor using Teager-Kaiser energy operator and envelope spectral feature extraction
Hou, Chin-Che
Pan, Min-Chun
ADVANCES IN MECHANICAL ENGINEERING, 2024, 16 (03)
[9] X-band negative group-delay lossy stub line
Ravelo, Blaise
IET MICROWAVES ANTENNAS & PROPAGATION, 2018, 12 (01) : 137 - 143
[10] Arbitrary Prescribed Wideband Flat Group Delay Circuits Using Coupled Lines
Chaudhary, Girdhari
Jeong, Yongchae
IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2018, 66 (04) : 1885 - 1894

← 1 2 3 →