ROBUST AND FAST VOWEL RECOGNITION USING OPTIMUM-PATH FOREST

被引：5

作者：

Papa, Joao P. ^{[1
]}

Marana, Aparecido N. ^{[1
]}

Spadotto, Andre A. ^{[2
]}

Guido, Rodrigo C. ^{[2
]}

Falcao, Alexandre X. ^{[3
]}

机构：

[1] Sao Paulo State Univ, Dept Comp Sci, Sao Paulo, Brazil

[2] Univ Fed Sao Paulo, Phys Inst Sao Carlos, Sao Carlos, SP, Brazil

[3] Univ Estadual Campinas, Inst Comp, Campinas, SP, Brazil

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

关键词：

Speech recognition; Neural networks; Pattern recognition; Signal classification;

D O I：

10.1109/ICASSP.2010.5495695

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The applications of Automatic Vowel Recognition (AVR), which is a sub-part of fundamental importance in most of the speech processing systems, vary from automatic interpretation of spoken language to biometrics. State-of-the-art systems for AVR are based on traditional machine learning models such as Artificial Neural Networks (ANNs) and Support Vector Machines (SVMs), however, such classifiers can not deal with efficiency and effectiveness at the same time, existing a gap to be explored when real-time processing is required. In this work, we present an algorithm for AVR based on the Optimum-Path Forest (OPF), which is an emergent pattern recognition technique recently introduced in literature. Adopting a supervised training procedure and using speech tags from two public datasets, we observed that OPF has outperformed ANNs, SVMs, plus other classifiers, in terms of training time and accuracy.

引用

页码：2190 / 2193

页数：4

共 50 条

[21] Fast pattern recognition using a neurocomputer
Masanori Sugisaka
Artificial Life and Robotics, 1997, 1 (2) : 69 - 72
[22] Facial expression recognition of a speaker using vowel judgment and thermal image processing
Yoshitomi, Yasunari
Asada, Taro
Shimada, Kyouhei
Tabuse, Masayoshi
ARTIFICIAL LIFE AND ROBOTICS, 2011, 16 (03) : 318 - 323
[23] Facial Expression Recognition of a Speaker Using Vowel Judgment and Thermal Image Processing
Yoshitomi, Y.
Asada, T.
Shimada, K.
Tabuse, M.
PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 225 - 230
[24] A Robust real time Handwritten recognition system using Neural Networks
Padmapriya, K.
Kubendran, Jenitha
RajaMuthu, Kowshika
Kumar, Janani Suresh
INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (02) : 4213 - 4218
[25] Robust speech recognition using harmonic features
Goh, Yeh Huann
Raveendran, Paramesran
Jamuar, Sudhanshu Shekhar
IET SIGNAL PROCESSING, 2014, 8 (02) : 167 - 175
[26] Vowel Pronunciation in Indonesian Language Recognition Using The Lips Angle Measurement and Lips Area
Ratnadewi
Wahyudi, Adhi Fajar Sakti
Prasetyaningtyas, Anisa Fardhani
2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, COMPUTER, AND ELECTRICAL ENGINEERING (ICITACEE), 2015, : 163 - 168
[27] Robust Noisy Speech Recognition Using Deep Neural Support Vector Machines
Amami, Rimah
Ben Ayed, Dorra
DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, 800 : 300 - 307
[28] A speech recognition system using fast learning algorithm and beta wavelet network
Ejbali, Ridha
Jemai, Olfa
Zaied, Mourad
Ben Amar, Chokri
2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 14 - 17
[29] Fast and Robust Face Detection Using Evolutionary Pruning
Jang, Jun-Su
Kim, Jong-Hwan
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2008, 12 (05) : 562 - 571
[30] Robust Speech Recognition using Generalized Distillation Framework
Markov, Konstantin
Matsui, Tomoko
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2364 - 2368

← 1 2 3 4 5 →