ROBUST AND FAST VOWEL RECOGNITION USING OPTIMUM-PATH FOREST

被引：5

作者：

Papa, Joao P. ^{[1
]}

Marana, Aparecido N. ^{[1
]}

Spadotto, Andre A. ^{[2
]}

Guido, Rodrigo C. ^{[2
]}

Falcao, Alexandre X. ^{[3
]}

机构：

[1] Sao Paulo State Univ, Dept Comp Sci, Sao Paulo, Brazil

[2] Univ Fed Sao Paulo, Phys Inst Sao Carlos, Sao Carlos, SP, Brazil

[3] Univ Estadual Campinas, Inst Comp, Campinas, SP, Brazil

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

关键词：

Speech recognition; Neural networks; Pattern recognition; Signal classification;

D O I：

10.1109/ICASSP.2010.5495695

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The applications of Automatic Vowel Recognition (AVR), which is a sub-part of fundamental importance in most of the speech processing systems, vary from automatic interpretation of spoken language to biometrics. State-of-the-art systems for AVR are based on traditional machine learning models such as Artificial Neural Networks (ANNs) and Support Vector Machines (SVMs), however, such classifiers can not deal with efficiency and effectiveness at the same time, existing a gap to be explored when real-time processing is required. In this work, we present an algorithm for AVR based on the Optimum-Path Forest (OPF), which is an emergent pattern recognition technique recently introduced in literature. Adopting a supervised training procedure and using speech tags from two public datasets, we observed that OPF has outperformed ANNs, SVMs, plus other classifiers, in terms of training time and accuracy.

引用

页码：2190 / 2193

页数：4

共 50 条

[31] A Robust Approach for Gender Recognition using Deep Learning
Arora, Shefali
Bhatia, M. P. S.
2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
[32] Robust speech recognition using time boundary detection
Mohajer, K
Hu, ZM
MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2003, 2003, 5099 : 335 - 343
[33] Robust speech recognition using probabilistic union models
Ming, J
Jancovic, P
Smith, FJ
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (06): : 403 - 414
[34] Robust speech recognition by using compensated acoustic scores
Sato, S
Onoe, K
Kobayashi, A
Imai, T
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 915 - 921
[35] Fast Likelihood Computation in Speech Recognition using Matrices
Gajjar, Mrugesh R.
Sreenivas, T. V.
Govindarajan, R.
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2013, 70 (02): : 219 - 234
[36] Fast Likelihood Computation in Speech Recognition using Matrices
Mrugesh R. Gajjar
T. V. Sreenivas
R. Govindarajan
Journal of Signal Processing Systems, 2013, 70 : 219 - 234
[37] Ultrasonic Sensor Signals and Optimum Path Forest Classifier for the Microstructural Characterization of Thermally-Aged Inconel 625 Alloy
de Albuquerque, Victor Hugo C.
Barbosa, Cleisson V.
Silva, Cleiton C.
Moura, Elineudo P.
Reboucas Filho, Pedro P.
Papa, Joao P.
Tavares, Joao Manuel R. S.
SENSORS, 2015, 15 (06) : 12474 - 12497
[38] Speaker-Independent Vowel Recognition for Malay Children Using Time-Delay Neural Network
Yong, B. F.
Ting, H. N.
5TH KUALA LUMPUR INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING 2011 (BIOMED 2011), 2011, 35 : 565 - 568
[39] An Efficient Noise-Robust Automatic Speech Recognition System using Artificial Neural Networks
Gupta, Santosh
Bhurchandi, Kishor M.
Keskar, Avinash G.
2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 1873 - 1877
[40] Robust face recognition using generalized neural reflectance model
Siu-Yeung Cho
Tommy W. S. Chow
Neural Computing & Applications, 2006, 15 : 170 - 182

← 1 2 3 4 5 →