Speech Recognition using Deep Learning

被引:1
作者
Lakkhanawannakun, Phoemporn [1 ]
Noyunsan, Chaluemwut [1 ]
机构
[1] Rajamangala Univ Technol Isan, Dept Comp Engn, Fac Engn, Khon Kaen Campus, Khon Kaen 40000, Thailand
来源
2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019) | 2019年
关键词
Speech recognition; Deep learning; Artificial neural networks;
D O I
10.1109/itc-cscc.2019.8793338
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Presently, computers have already replaced a tremendous number of humans in many creative professions. Therefore, Artificial Intelligence areas are composed of Machine Learning, Natural Language Processing, Computer Vision and Robotics. Similarly, speech recognition can be predicted by using computers. In audio files or video files that are large and have many minutes in length, many files have a variety of audio and audio files. This researcher chose to listen to the desired sound from a large file. In this research, deep learning was used to classify speech. Google corpus was used to train the model. We received 66.22% of accuracy.
引用
收藏
页码:514 / 517
页数:4
相关论文
共 17 条
[1]  
Abadi M., 2015, P 12 USENIX S OPERAT
[2]  
Amodei D, 2016, PR MACH LEARN RES, V48
[3]  
[Anonymous], 2005, Introduction to data mining
[4]  
Brian M., 2015, P 14 PYTHON SCI C, P18, DOI [DOI 10.25080/MAJORA-7B98E3ED-003, 10. 25080/Majora-7b98e3ed-003]
[5]  
Chen SH, 2007, COMPUTATIONAL INTELLIGENCE IN ECONOMICS AND FINANCE, VOL II, P1, DOI 10.1155/2007/41468
[6]  
DARPA-ISTO, 1990, DARPA TIMIT AC PHON
[7]  
Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947
[8]  
Guyon I, 2006, STUD FUZZ SOFT COMP, V207, P1
[9]  
Hasan Md Rashidul, 2004, variations, V1
[10]  
Hirsch H.G., 2000, P AUT SPEECH REC CHA