ROBUST AND FAST VOWEL RECOGNITION USING OPTIMUM-PATH FOREST

被引:5
|
作者
Papa, Joao P. [1 ]
Marana, Aparecido N. [1 ]
Spadotto, Andre A. [2 ]
Guido, Rodrigo C. [2 ]
Falcao, Alexandre X. [3 ]
机构
[1] Sao Paulo State Univ, Dept Comp Sci, Sao Paulo, Brazil
[2] Univ Fed Sao Paulo, Phys Inst Sao Carlos, Sao Carlos, SP, Brazil
[3] Univ Estadual Campinas, Inst Comp, Campinas, SP, Brazil
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
Speech recognition; Neural networks; Pattern recognition; Signal classification;
D O I
10.1109/ICASSP.2010.5495695
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The applications of Automatic Vowel Recognition (AVR), which is a sub-part of fundamental importance in most of the speech processing systems, vary from automatic interpretation of spoken language to biometrics. State-of-the-art systems for AVR are based on traditional machine learning models such as Artificial Neural Networks (ANNs) and Support Vector Machines (SVMs), however, such classifiers can not deal with efficiency and effectiveness at the same time, existing a gap to be explored when real-time processing is required. In this work, we present an algorithm for AVR based on the Optimum-Path Forest (OPF), which is an emergent pattern recognition technique recently introduced in literature. Adopting a supervised training procedure and using speech tags from two public datasets, we observed that OPF has outperformed ANNs, SVMs, plus other classifiers, in terms of training time and accuracy.
引用
收藏
页码:2190 / 2193
页数:4
相关论文
共 50 条
  • [31] A Robust Approach for Gender Recognition using Deep Learning
    Arora, Shefali
    Bhatia, M. P. S.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [32] Robust speech recognition using time boundary detection
    Mohajer, K
    Hu, ZM
    MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2003, 2003, 5099 : 335 - 343
  • [33] Robust speech recognition using probabilistic union models
    Ming, J
    Jancovic, P
    Smith, FJ
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (06): : 403 - 414
  • [34] Robust speech recognition by using compensated acoustic scores
    Sato, S
    Onoe, K
    Kobayashi, A
    Imai, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 915 - 921
  • [35] Fast Likelihood Computation in Speech Recognition using Matrices
    Gajjar, Mrugesh R.
    Sreenivas, T. V.
    Govindarajan, R.
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2013, 70 (02): : 219 - 234
  • [36] Fast Likelihood Computation in Speech Recognition using Matrices
    Mrugesh R. Gajjar
    T. V. Sreenivas
    R. Govindarajan
    Journal of Signal Processing Systems, 2013, 70 : 219 - 234
  • [37] Ultrasonic Sensor Signals and Optimum Path Forest Classifier for the Microstructural Characterization of Thermally-Aged Inconel 625 Alloy
    de Albuquerque, Victor Hugo C.
    Barbosa, Cleisson V.
    Silva, Cleiton C.
    Moura, Elineudo P.
    Reboucas Filho, Pedro P.
    Papa, Joao P.
    Tavares, Joao Manuel R. S.
    SENSORS, 2015, 15 (06) : 12474 - 12497
  • [38] Speaker-Independent Vowel Recognition for Malay Children Using Time-Delay Neural Network
    Yong, B. F.
    Ting, H. N.
    5TH KUALA LUMPUR INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING 2011 (BIOMED 2011), 2011, 35 : 565 - 568
  • [39] An Efficient Noise-Robust Automatic Speech Recognition System using Artificial Neural Networks
    Gupta, Santosh
    Bhurchandi, Kishor M.
    Keskar, Avinash G.
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 1873 - 1877
  • [40] Robust face recognition using generalized neural reflectance model
    Siu-Yeung Cho
    Tommy W. S. Chow
    Neural Computing & Applications, 2006, 15 : 170 - 182