Analysis of Noise Reduction Techniques in Speech Recognition

被引:0
作者
Zheng, Bo [1 ]
Hu, Jinsong [1 ]
Zhang, Ge [2 ]
Wu, Yuling [2 ]
Deng, Jianshuang [3 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
[2] Guangzhou Power Supply Co Ltd, Power Dispatching & Controlling Ctr, Syst Operat Dept, Guangzhou, Peoples R China
[3] Guangzhou Kinth Network Technol Co Ltd, Guangzhou, Peoples R China
来源
PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020) | 2020年
关键词
speech recognition; noise reduction; speech enhancement; signal processing; SPECTRAL SUBTRACTION;
D O I
10.1109/itnec48623.2020.9084906
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Compared with keyboard input, speech recognition has its strengths it is more in line with the natural communication of people and frees people's hands. In recent years, with the development of artificial intelligence, speech recognition technology has developed rapidly, and the recognition rate is generally in excess of 90%. However, in the case of environmental noise, recognition rates of current speech recognition products have been severely reduced, and most of them cannot work normally. How to enhance these voices with noise and restore the information of the original signal to the greatest extent is an urgent problem. This article summarizes some current mainstream noise reduction algorithms, and compares them by explaining principles of these algorithms. Finally, this article makes a brief prediction of the general development direction of the speech enhancement field in the future.
引用
收藏
页码:928 / 933
页数:6
相关论文
共 16 条
[11]  
Rabiner L., 1993, Fundamentals of Speech Recognition
[12]  
Sim BL, 1998, IEEE T SPEECH AUDI P, V6, P328, DOI 10.1109/89.701361
[13]  
Sovka P., 1995, RES REPORT
[14]   A SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE FOR TIME-FREQUENCY WEIGHTED NOISY SPEECH [J].
Taal, Cees H. ;
Hendriks, Richard C. ;
Heusdens, Richard ;
Jensen, Jesper .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :4214-4217
[15]   Speech Enhancement using Spectral Subtraction-type Algorithms: A Comparison and Simulation Study [J].
Upadhyay, Navneet ;
Karmakar, Abhijit .
ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 :574-584
[16]   A Regression Approach to Speech Enhancement Based on Deep Neural Networks [J].
Xu, Yong ;
Du, Jun ;
Dai, Li-Rong ;
Lee, Chin-Hui .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (01) :7-19