Non-linear techniques for robust speech recognition

被引:0
|
作者
Ge, Yubo [1 ]
Niu, Jing [1 ]
Ge, Lingnan [2 ]
Shirai, Katsuhiko [2 ]
机构
[1] Tsinghua Univ, Dept Math Sci, Beijing 100084, Peoples R China
[2] Waseda Univ, Sch Sci & Engn, Tokyo 1698555, Japan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An acoustic speech digital signal can be regarded as a random process repeatedly alternating stationary segments with non-stationary ones. However, the current features used in the mostly recognition system are drawn based linear model theory and are hardly to describe non-stationary character. Consequently, some syllables can not be distinguished in speech parameter space with dimensions as high as 50. This paper tries to develop several features to describe non-stationary measure, a trend degree and the character of continuously exchange between stationary pieces and non-stationary pieces with the help of statistical theory and non-linear random model, a type of doubly random time series model. Our experiments have shown the proposed feature to increase the recognition accurate and the ability of adaptation and self-organisation of the system.
引用
收藏
页码:134 / +
页数:3
相关论文
共 50 条
  • [21] Robust face recognition via non-linear correlation filter bank
    Taheri, Motahareh
    IET IMAGE PROCESSING, 2018, 12 (03) : 408 - 415
  • [22] Comparing linear and non-linear transformation of speech
    Mesbahi, Larbi
    Barreaud, Vincent
    Boeffard, Olivier
    PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON SIGNALS, SPEECH AND IMAGE PROCESSING/9TH WSEAS INTERNATIONAL CONFERENCE ON MULTIMEDIA, INTERNET & VIDEO TECHNOLOGIES, 2009, : 68 - 73
  • [23] Optimal non-linear robust control for non-linear uncertain systems
    Haddad, WM
    Chellaboina, V
    Fausz, JL
    Leonessa, A
    INTERNATIONAL JOURNAL OF CONTROL, 2000, 73 (04) : 329 - 342
  • [24] Signal conditioning techniques for robust speech recognition
    Rahim, MG
    Juang, BH
    Chou, W
    Buhrke, E
    IEEE SIGNAL PROCESSING LETTERS, 1996, 3 (04) : 107 - 109
  • [25] Missing data techniques for robust speech recognition
    Cooke, M
    Morris, A
    Green, P
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 863 - 866
  • [26] Signal processing techniques for robust speech recognition
    Asano, Futoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 393 - 401
  • [27] Non-Linear Predictive Vector Quantization of Feature Vectors for Distributed Speech Recognition
    Enrique Garcia, Jose
    Ortega, Alfonso
    Miguel, Antonio
    Lleida, Eduardo
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2378 - 2381
  • [28] Automatic Emotion Recognition in Compressed Speech Using Acoustic and Non-Linear Features
    Garcia, N.
    Vasquez-Correa, J. C.
    Arias-Londono, J. D.
    Vargas-Bonilla, J. F.
    Orozco-Arroyave, J. R.
    2015 20TH SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND COMPUTER VISION (STSIVA), 2015,
  • [29] Non-linear speech transition visualization
    Reinhard, K
    Niranjan, M
    FIFTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1997, (440): : 257 - 261
  • [30] A novel non-linear modifier for adaptive illumination normalization for robust face recognition
    Vishwakarma, Virendra P.
    Dalal, Sahil
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (17-18) : 11503 - 11529