ROBUST AND FAST VOWEL RECOGNITION USING OPTIMUM-PATH FOREST

被引:5
|
作者
Papa, Joao P. [1 ]
Marana, Aparecido N. [1 ]
Spadotto, Andre A. [2 ]
Guido, Rodrigo C. [2 ]
Falcao, Alexandre X. [3 ]
机构
[1] Sao Paulo State Univ, Dept Comp Sci, Sao Paulo, Brazil
[2] Univ Fed Sao Paulo, Phys Inst Sao Carlos, Sao Carlos, SP, Brazil
[3] Univ Estadual Campinas, Inst Comp, Campinas, SP, Brazil
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
Speech recognition; Neural networks; Pattern recognition; Signal classification;
D O I
10.1109/ICASSP.2010.5495695
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The applications of Automatic Vowel Recognition (AVR), which is a sub-part of fundamental importance in most of the speech processing systems, vary from automatic interpretation of spoken language to biometrics. State-of-the-art systems for AVR are based on traditional machine learning models such as Artificial Neural Networks (ANNs) and Support Vector Machines (SVMs), however, such classifiers can not deal with efficiency and effectiveness at the same time, existing a gap to be explored when real-time processing is required. In this work, we present an algorithm for AVR based on the Optimum-Path Forest (OPF), which is an emergent pattern recognition technique recently introduced in literature. Adopting a supervised training procedure and using speech tags from two public datasets, we observed that OPF has outperformed ANNs, SVMs, plus other classifiers, in terms of training time and accuracy.
引用
收藏
页码:2190 / 2193
页数:4
相关论文
共 50 条
  • [21] Fast pattern recognition using a neurocomputer
    Masanori Sugisaka
    Artificial Life and Robotics, 1997, 1 (2) : 69 - 72
  • [22] Facial expression recognition of a speaker using vowel judgment and thermal image processing
    Yoshitomi, Yasunari
    Asada, Taro
    Shimada, Kyouhei
    Tabuse, Masayoshi
    ARTIFICIAL LIFE AND ROBOTICS, 2011, 16 (03) : 318 - 323
  • [23] Facial Expression Recognition of a Speaker Using Vowel Judgment and Thermal Image Processing
    Yoshitomi, Y.
    Asada, T.
    Shimada, K.
    Tabuse, M.
    PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 16TH '11), 2011, : 225 - 230
  • [24] A Robust real time Handwritten recognition system using Neural Networks
    Padmapriya, K.
    Kubendran, Jenitha
    RajaMuthu, Kowshika
    Kumar, Janani Suresh
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (02) : 4213 - 4218
  • [25] Robust speech recognition using harmonic features
    Goh, Yeh Huann
    Raveendran, Paramesran
    Jamuar, Sudhanshu Shekhar
    IET SIGNAL PROCESSING, 2014, 8 (02) : 167 - 175
  • [26] Vowel Pronunciation in Indonesian Language Recognition Using The Lips Angle Measurement and Lips Area
    Ratnadewi
    Wahyudi, Adhi Fajar Sakti
    Prasetyaningtyas, Anisa Fardhani
    2015 2ND INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, COMPUTER, AND ELECTRICAL ENGINEERING (ICITACEE), 2015, : 163 - 168
  • [27] Robust Noisy Speech Recognition Using Deep Neural Support Vector Machines
    Amami, Rimah
    Ben Ayed, Dorra
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, 800 : 300 - 307
  • [28] A speech recognition system using fast learning algorithm and beta wavelet network
    Ejbali, Ridha
    Jemai, Olfa
    Zaied, Mourad
    Ben Amar, Chokri
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 14 - 17
  • [29] Fast and Robust Face Detection Using Evolutionary Pruning
    Jang, Jun-Su
    Kim, Jong-Hwan
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2008, 12 (05) : 562 - 571
  • [30] Robust Speech Recognition using Generalized Distillation Framework
    Markov, Konstantin
    Matsui, Tomoko
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2364 - 2368