Bionic optimization of MFCC features based on speaker fast recognition

被引：7

作者：

Lin, Zhaodong ^{[1
]}

Di, Changan ^{[1
]}

Chen, Xiong ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Mech Engn, Nanjing, Jiangsu, Peoples R China

来源：

APPLIED ACOUSTICS | 2021年 / 173卷

关键词：

Adaptive endpoint detection; Bionic auditory curve; Improved Mel; Recognition filter; Voice signal;

D O I：

10.1016/j.apacoust.2020.107682

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Surrounded by low SNR, how to make the voice faster and better recognize the owner has become a heated research topic. The human auditory system can accurately acquire the characteristics of acoustic events in complex systems or low SNR noise environment, which is of significance in the research of bionic hearing of human ear. The response curve of human ear output is obtained by bionic technology, which is the best response curve for sound enhancement to modify Mel filter. The method of adaptive threshold selection is used to integrate Mel features to realize the reduction and dynamic extraction of low SNR speech features. This method not only can resist the disadvantages of poor robustness and complexity of parameter model, but also obtain dynamic and comprehensive speech information of different speakers in different scenes. Finally, the improved CNN and I-vector system are contributed to reduce the dimension of the data and to verify the recognition, so as to achieve the optimal frequency selective amplification and simplification of the acoustic signal. In the case of SNR-5db, the model is reduced by 15% and the recognition accuracy is improved by 3%. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：6

共 50 条

[1] Speaker gender recognition based on combining the contribution of MFCC and pitch features
Engineering Lab on Intelligent Perception for Internet of Things, Shenzhen Graduate School, Peking University, Shenzhen 518055, Guangdong, China
Huazhong Ligong Daxue Xuebao, 2013, SUPPL.I (108-111+120):
[2] LDA combination of pitch and MFCC features in speaker recognition
Harrag, A
Mohamadi, T
Serignat, JF
INDICON 2005 Proceedings, 2005, : 237 - 240
[3] The speaker recognition system based on the dynamic MFCC
Dong, Zhi-Feng
Wang, Zeng-Fu
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2005, 18 (05): : 596 - 601
[4] Speaker recognition system using MFCC features and vector quantization
Wang, Wei
Deng, Huiwen
Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2006, 27 (SUPPL.): : 2253 - 2255
[5] Speaker Recognition Based on Dynamic MFCC Parameters
Wang Yutai
Li Bo
Jiang Xiaoqing
Liu Feng
Wang Lihao
PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2009, : 406 - 409
[6] Exploration of Feature Reduction of MFCC Spectral Features in Speaker Recognition
Kumar, Mohit
Katti, Sachin
Das, Pradip K.
ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES, 2016, 452 : 151 - 159
[7] Analysis of Throat Microphone Using MFCC Features for Speaker Recognition
Visalakshi, R.
Dhanalakshmi, P.
Palanivel, S.
COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 35 - 41
[8] Linear discriminant analysis F-ratio for optimization of TESPAR & MFCC features for speaker recognition
DSP Group, Jawaharlal Nehru Technological University, Hyderabad, India
J. Multimedia, 2007, 6 (34-43):
[9] Speaker Recognition Based on MFCC and BP Neural Networks
Wang, Yi
Lawlor, Bob
2017 28TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2017,
[10] The Research of Feature Extraction Based on MFCC for Speaker Recognition
Zhang Wanli
Li Guoxin
2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1074 - 1077

← 1 2 3 4 5 →