Bionic optimization of MFCC features based on speaker fast recognition

被引:7
|
作者
Lin, Zhaodong [1 ]
Di, Changan [1 ]
Chen, Xiong [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Mech Engn, Nanjing, Jiangsu, Peoples R China
关键词
Adaptive endpoint detection; Bionic auditory curve; Improved Mel; Recognition filter; Voice signal;
D O I
10.1016/j.apacoust.2020.107682
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Surrounded by low SNR, how to make the voice faster and better recognize the owner has become a heated research topic. The human auditory system can accurately acquire the characteristics of acoustic events in complex systems or low SNR noise environment, which is of significance in the research of bionic hearing of human ear. The response curve of human ear output is obtained by bionic technology, which is the best response curve for sound enhancement to modify Mel filter. The method of adaptive threshold selection is used to integrate Mel features to realize the reduction and dynamic extraction of low SNR speech features. This method not only can resist the disadvantages of poor robustness and complexity of parameter model, but also obtain dynamic and comprehensive speech information of different speakers in different scenes. Finally, the improved CNN and I-vector system are contributed to reduce the dimension of the data and to verify the recognition, so as to achieve the optimal frequency selective amplification and simplification of the acoustic signal. In the case of SNR-5db, the model is reduced by 15% and the recognition accuracy is improved by 3%. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Speaker gender recognition based on combining the contribution of MFCC and pitch features
    Engineering Lab on Intelligent Perception for Internet of Things, Shenzhen Graduate School, Peking University, Shenzhen 518055, Guangdong, China
    Huazhong Ligong Daxue Xuebao, 2013, SUPPL.I (108-111+120):
  • [2] LDA combination of pitch and MFCC features in speaker recognition
    Harrag, A
    Mohamadi, T
    Serignat, JF
    INDICON 2005 Proceedings, 2005, : 237 - 240
  • [3] The speaker recognition system based on the dynamic MFCC
    Dong, Zhi-Feng
    Wang, Zeng-Fu
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2005, 18 (05): : 596 - 601
  • [4] Speaker recognition system using MFCC features and vector quantization
    Wang, Wei
    Deng, Huiwen
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2006, 27 (SUPPL.): : 2253 - 2255
  • [5] Speaker Recognition Based on Dynamic MFCC Parameters
    Wang Yutai
    Li Bo
    Jiang Xiaoqing
    Liu Feng
    Wang Lihao
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2009, : 406 - 409
  • [6] Exploration of Feature Reduction of MFCC Spectral Features in Speaker Recognition
    Kumar, Mohit
    Katti, Sachin
    Das, Pradip K.
    ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES, 2016, 452 : 151 - 159
  • [7] Analysis of Throat Microphone Using MFCC Features for Speaker Recognition
    Visalakshi, R.
    Dhanalakshmi, P.
    Palanivel, S.
    COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 35 - 41
  • [8] Linear discriminant analysis F-ratio for optimization of TESPAR & MFCC features for speaker recognition
    DSP Group, Jawaharlal Nehru Technological University, Hyderabad, India
    J. Multimedia, 2007, 6 (34-43):
  • [9] Speaker Recognition Based on MFCC and BP Neural Networks
    Wang, Yi
    Lawlor, Bob
    2017 28TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2017,
  • [10] The Research of Feature Extraction Based on MFCC for Speaker Recognition
    Zhang Wanli
    Li Guoxin
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1074 - 1077