On the Studies of Syllable Segmentation and Improving MFCCs for Automatic Birdsong Recognition

被引:20
|
作者
Chou, Chih-Hsun [1 ]
Liu, Pang-Hsin [1 ]
Cai, Bingjing [2 ]
机构
[1] Chung Hua Univ, Dept Comp Sci & Informat Engn, 707,Sec 2,WuFu Rd, Hsinchu 30067, Taiwan
[2] Yunnan Univ, Sch Software, Yunnan 650091, Peoples R China
关键词
D O I
10.1109/APSCC.2008.6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Birdsongs are typically divided into four hierarchical levels: note, syllable, phrase, and song, of which syllable plays an important role in bird species recognition. To improve the recognition rate of birdsongs, in this study an enhanced syllable segmentation method based on R-S endpoint detection method was presented Furthermore, a decision based neural network with suitable reinforcement learning rule was developed as the classifier. The proposed methods combined with the well-known MFCCs feature vector form a birdsong recognition system that was applied to two recognition problems: one is the recognition of a set of arbitrary syllables and the other is the recognition of a section of a birdsong. Experimental results show the performances of the proposed methods.
引用
收藏
页码:745 / +
页数:3
相关论文
共 50 条
  • [21] Template-based automatic recognition of birdsong syllables from continuous recordings
    Anderson, SE
    Dave, AS
    Margoliash, D
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (02): : 1209 - 1219
  • [22] On the Influence of Automatic Segmentation and Clustering in Automatic Speech Recognition
    Lopez-Otero, Paula
    Docio-Fernandez, Laura
    Garcia-Mateo, Carmen
    Cardenal-Lopez, Antonio
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 49 - 58
  • [23] Automatic birdsong recognition based on autoregressive time-delay neural networks
    Selouani, S-. A.
    Kardouchi, M.
    Hervet, E.
    Roy, D.
    2005 ICSC Congress on Computational Intelligence Methods and Applications (CIMA 2005), 2005, : 101 - 106
  • [24] Speaker Independent Automatic Emotion Recognition from Speech: A Comparison of MFCCs and Discrete Wavelet Transforms
    Shah, Firoz A.
    Krishnan, Vimal V. R.
    Sukumar, Raji A.
    Jayakumar, Athulya
    Anto, Babu P.
    2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 528 - 531
  • [25] Automatic Speaker Recognition Dependency on Both the Shape of Auditory Critical Bands and Speaker Discriminative MFCCs
    Jokic, Ivan
    Delic, Vlado
    Jokic, Stevan
    Peric, Zoran
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2015, 15 (04) : 25 - 32
  • [26] Syllable-based automatic Arabic speech recognition in noisy enviroment
    Azmi, Mohamed M.
    Tolba, Hesham
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1436 - 1441
  • [27] SEGMENTATION FOR THE AUTOMATIC RECOGNITION OF WORD SEQUENCES
    CLASS, F
    MANGOLD, H
    ZELINSKI, R
    FREQUENZ, 1980, 34 (05) : 142 - 148
  • [28] Automatic syllable-based phoneme recognition using ESTER corpus
    Le Blouch, Olivier
    Collen, Patrice
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION (ISCGAV'-07), 2007, : 77 - +
  • [29] An Automatic Blind Syllable Segmentation Model Based on Bi-directional LSTM
    Jian, Yang
    Peng, Su
    Li Zhenpeng
    2019 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY (CCET), 2019, : 109 - 113
  • [30] Automatic syllable segmentation algorithm of Chinese speech based on MF-DFA
    He, Shaofang
    Zhao, Huan
    SPEECH COMMUNICATION, 2017, 92 : 42 - 51