Voice recognition based on MFCC, SBC and Spectrograms

被引:4
作者
Martinez Mascorro, Guillermo Arturo [1 ]
Aguilar Torres, Gualberto [2 ]
机构
[1] Inst Politecn Nacl, Ciencias Ingn Microelect, Mexico City, DF, Mexico
[2] Inst Politecn Nacl, Secc Estudios Posgrad & Invest, ESIME Culhuacan, Mexico City, DF, Mexico
来源
INGENIUS-REVISTA DE CIENCIA Y TECNOLOGIA | 2013年 / 10期
关键词
Speech recognition with voice changes; Mel Frequency Cepstral Coefficients; Subband-Based Cepstral Parameters; Spectrogram; Support Vector Machine;
D O I
10.17163/ings.n10.2013.02
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
One of the problems of the Automatic Speech Recognition systems is the voice's changes. Typically, a person can have voluntary and involuntary voice's changes and the system can get confused in these cases, also the changes could be natural and artificial. This paper proposes and recognition system with a parallel identification, using three different algorithms: MFCC, SBC and Spectrogram. Using a Support Vector Machine as a classifier, every algorithm gives a group of persons with the highest likelihood and, after an evaluation, the result is obtained. The aim of this paper is to take advantage of the three algorithms.
引用
收藏
页码:12 / 20
页数:9
相关论文
共 50 条
[31]   Voice spoofing detection using a neural networks assembly considering spectrograms and mel frequency cepstral coefficients [J].
Hernandez-Nava, Carlos Alberto ;
Rincon-Garcia, Eric Alfredo ;
Lara-Velazquez, Pedro ;
de-los-Cobos-Silva, Sergio Gerardo ;
Gutierrez-Andrade, Miguel Angel ;
Mora-Gutierrez, Roman Anselmo .
PEERJ COMPUTER SCIENCE, 2023, 9
[32]   Electromagnetic Ion Cyclotron Waves Pattern Recognition Based on a Deep Learning Technique: Bag-of-Features Algorithm Applied to Spectrograms [J].
Medeiros, Claudia ;
Souza, V. M. ;
Vieira, L. E. A. ;
Sibeck, D. G. ;
Remya, B. ;
Da Silva, L. A. ;
Alves, L. R. ;
Marchezi, J. P. ;
Jauer, P. R. ;
Rockenbach, M. ;
Dal Lago, A. ;
Kletzing, C. A. .
ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 2020, 249 (01)
[33]   Evaluation of Singer's Voice Quality by Means of Visual Pattern Recognition [J].
Forczmanski, Pawel .
JOURNAL OF VOICE, 2016, 30 (01) :127.e21-127.e30
[34]   Performance Analysis of Isolated Speech Recognition Technique Using MFCC and Cross-Correlation [J].
Rahaman, Md. Ekhlasur ;
Alam, S. M. Shamsul ;
Mondal, Himadri Shekhar ;
Muntaseer, Ahmed Saif ;
Mandal, Rajib ;
Raihan, M. .
2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
[35]   A comprehensive study based on MFCC and spectrogram for audio classification [J].
Rawat, Priyanshu ;
Bajaj, Madhvan ;
Vats, Satvik ;
Sharma, Vikrant .
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (06) :1057-1074
[36]   Backdoor Defence for Voice Print Recognition Model Based on Speech Enhancement and Weight Pruning [J].
Zhu, Jiawei ;
Chen, Lin ;
Xu, Dongwei ;
Zhao, Wenhong .
IEEE ACCESS, 2022, 10 :114016-114023
[37]   Sub-voice Detection and Recognition based on Hybrid Audio Segmentation and Deep Learning [J].
Zhao, Xiaolei ;
Wang, Chenyin ;
Xu, Xibin .
PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT CONTROL AND ARTIFICIAL INTELLIGENCE (RICAI 2019), 2019, :143-147
[38]   IDENTIFICATION OF VIBRATION CAUSES BASED ON SPECTROGRAMS DURING THE STRAIGHTENING PROCESS [J].
Losak, P. .
ENGINEERING MECHANICS 2014, 2014, :368-371
[39]   A Human Gait Classification Method Based on Radar Doppler Spectrograms [J].
Fok Hing Chi Tivive ;
Abdesselam Bouzerdoum ;
Moeness G. Amin .
EURASIP Journal on Advances in Signal Processing, 2010
[40]   Acoustic Classification of Singing Insects Based on MFCC/LFCC Fusion [J].
Noda, Juan J. ;
Travieso-Gonzalez, Carlos M. ;
Sanchez-Rodriguez, David ;
Alonso-Hernandez, Jesus B. .
APPLIED SCIENCES-BASEL, 2019, 9 (19)