Dimensionality Reduction of Modulation Frequency Features for Speech Discrimination

被引:0
|
作者
Markaki, Maria [1 ]
Stylianou, Yannis [1 ]
机构
[1] Univ Crete, Dept Comp Sci, Khania, Greece
关键词
modulation spectrum; multilinear algebra; feature selection; mutual information; speech discrimination;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a dimensionality reduction method for modulation spectral features, which keeps the time-varying information of interest to the classification task. Due to the varying degrees of redundancy and discriminative power of the acoustic and modulation frequency subspaces, we first employ a generalization of SVD to tensors (Higher Order SVD) to reduce dimensions. Projection of modulation spectral features on the principal axes with the higher energy in each subspace results in a compact feature set. We further estimate the relevance of these projections to speech discrimination based on mutual information to the target class. Reconstruction of modulation spectrograms from the "best" 22 features back to the initial dimensions, shows that modulation spectral features close to syllable and phoneme rates as well as pitch values of speakers are preserved.
引用
收藏
页码:646 / 649
页数:4
相关论文
共 50 条
  • [1] Discrimination of speech from nonspeeech in broadcast news based on modulation frequency features
    Markaki, Maria
    Stylianou, Yannis
    SPEECH COMMUNICATION, 2011, 53 (05) : 726 - 735
  • [2] Dimensionality Reduction for Speech Emotion Features by Multiscale Kernels
    Xu, Xinzhou
    Deng, Jun
    Zheng, Wenming
    Zhao, Li
    Schuller, Bjoern
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1532 - 1536
  • [3] Dimensionality Reduction of Speech Features using Nonlinear Principal Components Analysis
    Zahorian, Stephen A.
    Singh, Tara
    Hu, Hongbing
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 281 - +
  • [4] Modulation frequency features for phoneme recognition in noisy speech
    Ganapathy, Sriram
    Thomas, Samuel
    Hermansky, Hynek
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (01): : EL8 - EL12
  • [5] Features Dimensionality Reduction andMulti Dimensional Voice Processing Program to Parkinson Disease Discrimination
    Meghraoui, D.
    Boudraa, B.
    Meksen, T. Merazi
    Boudraa, M.
    2016 4TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING & INFORMATION TECHNOLOGY (CEIT), 2016,
  • [6] Improved Frequency Modulation Features for Multichannel Distant Speech Recognition
    Rodomagoulakis, Isidoros
    Maragos, Petros
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (04) : 841 - 849
  • [7] Dimensionality Reduction for Emotional Speech Recognition
    Fewzee, Pouria
    Karray, Fakhri
    PROCEEDINGS OF 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY, RISK AND TRUST AND 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM/PASSAT 2012), 2012, : 532 - 537
  • [8] Time-Frequency Features Extraction for Infant Directed Speech Discrimination
    Mahdhaoui, Ammar
    Chetouani, Mohamed
    Kessous, Loic
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 120 - +
  • [9] FREQUENCY MODULATION OF SPEECH
    SILBIGER, HR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1964, 36 (10): : 2001 - &
  • [10] A Dimensionality Reduction Framework for Automatic Speech Recognition
    ElMoudden, Ismail
    ElBernoussi, Souad
    Benyacoub, Badreddine
    INNOVATION MANAGEMENT AND SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE: FROM REGIONAL DEVELOPMENT TO GLOBAL GROWTH, VOLS I - VI, 2015, 2015, : 2602 - 2608