Feature normalization based on non-extensive statistics for speech recognition

被引:17
|
作者
Pardede, Hilman F. [1 ]
Iwano, Koji [2 ]
Shinoda, Koichi [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Grad Sch Informat Sci & Engn, Meguro Ku, Tokyo 1528552, Japan
[2] Tokyo City Univ, Fac Environm & Informat Studies, Tsuzuki Ku, Yokohama, Kanagawa 2248551, Japan
关键词
Robust speech recognition; Normalization; q-Logarithm; Non-extensive statistics; CROSS-TERMS; NOISE; MODEL; ENHANCEMENT; ENVIRONMENT; SPECTRA; ALGEBRA;
D O I
10.1016/j.specom.2013.02.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most compensation methods to improve the robustness of speech recognition systems in noisy environments such as spectral subtraction, CMN, and MVN, rely on the fact that noise and speech spectra are independent. However, the use of limited window in signal processing may introduce a cross-term between them, which deteriorates the speech recognition accuracy. To tackle this problem, we introduce the q-logarithmic (q-log) spectral domain of non-extensive statistics and propose q-log spectral mean normalization (q-LSMN) which is an extension of log spectral mean normalization (LSMN) to this domain. The recognition experiments on a synthesized noisy speech database, the Aurora-2 database, showed that q-LSMN was consistently better than the conventional normalization methods, CMN, LSMN, and MVN. Furthermore, q-LSMN was even more effective when applied to a real noisy environment in the CEN-SREC-2 database. It significantly outperformed ETSI AFE front-end. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:587 / 599
页数:13
相关论文
共 50 条
  • [31] Non-extensive value-at-risk estimation during times of crisis
    Hajihasani, Ahmad
    Namaki, Ali
    Asadi, Nazanin
    Tehrani, Reza
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2021, 32 (07):
  • [32] On the Jointly Unsupervised Feature Vector Normalization and Acoustic Model Compensation for Robust Speech Recognition
    Buera, Luis
    Miguel, Antonio
    Lleida, Eduardo
    Saz, Oscar
    Ortega, Alfonso
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1381 - 1384
  • [33] Bose-Einstein condensation of pions in proton-proton collisions at the Large Hadron Collider using non-extensive Tsallis statistics
    Deb, Suman
    Sahu, Dushmanta
    Sahoo, Raghunath
    Pradhan, Anil Kumar
    EUROPEAN PHYSICAL JOURNAL A, 2021, 57 (06)
  • [34] FEATURE EXTRACTION BASED ON HEARING SYSTEM SIGNAL PROCESSING FOR ROBUST LARGE VOCABULARY SPEECH RECOGNITION
    Li, Qi
    Sun, Xie
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1262 - 1265
  • [35] Normalization of the Speech Modulation Spectra for Robust Speech Recognition
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1662 - 1674
  • [36] Tsallis non-extensive statistics and multifractal analysis of the dynamics of a fully-depleted MOSFET nano-device
    Antoniades, I. P.
    Marinos, G.
    Karakatsanis, L. P.
    Pavlos, E. G.
    Stavrinides, S. G.
    Tassis, D.
    Pavlos, G. P.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 533
  • [37] Hadronization within the non-extensive approach and the evolution of the parameters
    Shen, Keming
    Barnafoldi, Gergely Gabor
    Biro, Tamas Sandor
    EUROPEAN PHYSICAL JOURNAL A, 2019, 55 (08)
  • [38] Non-extensive treatment of surface nucleation on glass particles
    Ferreira Nascimento, Marcio Luis
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2012, 391 (23) : 6077 - 6083
  • [39] Black Hole Thermodynamics and Generalised Non-Extensive Entropy
    Elizalde, Emilio
    Nojiri, Shin'ichi
    Odintsov, Sergei D.
    UNIVERSE, 2025, 11 (02)
  • [40] Hadron gas in the presence of a magnetic field using non-extensive statistics: a transition from diamagnetic to paramagnetic system
    Pradhan, Girija Sankar
    Sahu, Dushmanta
    Deb, Suman
    Sahoo, Raghunath
    JOURNAL OF PHYSICS G-NUCLEAR AND PARTICLE PHYSICS, 2023, 50 (05)