Feature normalization based on non-extensive statistics for speech recognition

被引:17
|
作者
Pardede, Hilman F. [1 ]
Iwano, Koji [2 ]
Shinoda, Koichi [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Grad Sch Informat Sci & Engn, Meguro Ku, Tokyo 1528552, Japan
[2] Tokyo City Univ, Fac Environm & Informat Studies, Tsuzuki Ku, Yokohama, Kanagawa 2248551, Japan
关键词
Robust speech recognition; Normalization; q-Logarithm; Non-extensive statistics; CROSS-TERMS; NOISE; MODEL; ENHANCEMENT; ENVIRONMENT; SPECTRA; ALGEBRA;
D O I
10.1016/j.specom.2013.02.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most compensation methods to improve the robustness of speech recognition systems in noisy environments such as spectral subtraction, CMN, and MVN, rely on the fact that noise and speech spectra are independent. However, the use of limited window in signal processing may introduce a cross-term between them, which deteriorates the speech recognition accuracy. To tackle this problem, we introduce the q-logarithmic (q-log) spectral domain of non-extensive statistics and propose q-log spectral mean normalization (q-LSMN) which is an extension of log spectral mean normalization (LSMN) to this domain. The recognition experiments on a synthesized noisy speech database, the Aurora-2 database, showed that q-LSMN was consistently better than the conventional normalization methods, CMN, LSMN, and MVN. Furthermore, q-LSMN was even more effective when applied to a real noisy environment in the CEN-SREC-2 database. It significantly outperformed ETSI AFE front-end. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:587 / 599
页数:13
相关论文
共 50 条
  • [1] Spectral Subtraction Based on Non-extensive Statistics for Speech Recognition
    Pardede, Hilman
    Iwano, Koji
    Shinoda, Koichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (08): : 1774 - 1782
  • [2] Generalized ensemble theory with non-extensive statistics
    Shen, Ke-Ming
    Zhang, Ben-Wei
    Wang, En-Ke
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2017, 487 : 215 - 224
  • [3] Fractal Structure and Non-Extensive Statistics
    Deppman, Airton
    Frederico, Tobias
    Megias, Eugenio
    Menezes, Debora P.
    ENTROPY, 2018, 20 (09)
  • [4] Non-extensive statistics, relativistic kinetic theory and fluid dynamics
    Biro, T. S.
    Molnar, E.
    EUROPEAN PHYSICAL JOURNAL A, 2012, 48 (11) : 1 - 11
  • [5] The statistics of mesoscopic systems and the physical interpretation of extensive and non-extensive entropies
    Anghel, D., V
    Parvan, A. S.
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2018, 51 (44)
  • [6] Quasicanonical Gibbs distribution and Tsallis non-extensive statistics
    Aringazin, AK
    Mazhitov, MI
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2003, 325 (3-4) : 409 - 425
  • [7] Non-extensive statistics in Au-Au collisions
    Costa, Juliana O.
    Aguiara, Isabelle
    Barauna, Jadna L.
    Megias, Eugenio
    Deppman, Airton
    da Silva, Tiago N.
    Menezes, Debora P.
    PHYSICS LETTERS B, 2024, 854
  • [8] On Real Time Q-Log-based Feature Normalization for Distant Speech Recognition
    Zilvan, Vicky
    Ni'mah, Iftitahu
    Yuliani, Asri R.
    Pardede, Hilman F.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2016,
  • [9] Non-extensive block entropy statistics of Cantor fractal sets
    Provata, A.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2007, 381 : 148 - 154
  • [10] Nuclear modification factor using Tsallis non-extensive statistics
    Tripathy, Sushanta
    Bhattacharyya, Trambak
    Garg, Prakhar
    Kumar, Prateek
    Sahoo, Raghunath
    Cleymans, Jean
    EUROPEAN PHYSICAL JOURNAL A, 2016, 52 (09)