Dimensionality Reduction of Modulation Frequency Features for Speech Discrimination

被引：0

作者：

Markaki, Maria ^{[1
]}

Stylianou, Yannis ^{[1
]}

机构：

[1] Univ Crete, Dept Comp Sci, Khania, Greece

来源：

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年

关键词：

modulation spectrum; multilinear algebra; feature selection; mutual information; speech discrimination;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We describe a dimensionality reduction method for modulation spectral features, which keeps the time-varying information of interest to the classification task. Due to the varying degrees of redundancy and discriminative power of the acoustic and modulation frequency subspaces, we first employ a generalization of SVD to tensors (Higher Order SVD) to reduce dimensions. Projection of modulation spectral features on the principal axes with the higher energy in each subspace results in a compact feature set. We further estimate the relevance of these projections to speech discrimination based on mutual information to the target class. Reconstruction of modulation spectrograms from the "best" 22 features back to the initial dimensions, shows that modulation spectral features close to syllable and phoneme rates as well as pitch values of speakers are preserved.

引用

页码：646 / 649

页数：4

共 11 条

[1]

[Anonymous], 2005, Estimating mutual information and multi-information in large networks

[2]

[Anonymous], MODULATION TOOLBOX

[3] Joint acoustic and modulation frequency [J].