On the Studies of Syllable Segmentation and Improving MFCCs for Automatic Birdsong Recognition

被引:20
|
作者
Chou, Chih-Hsun [1 ]
Liu, Pang-Hsin [1 ]
Cai, Bingjing [2 ]
机构
[1] Chung Hua Univ, Dept Comp Sci & Informat Engn, 707,Sec 2,WuFu Rd, Hsinchu 30067, Taiwan
[2] Yunnan Univ, Sch Software, Yunnan 650091, Peoples R China
关键词
D O I
10.1109/APSCC.2008.6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Birdsongs are typically divided into four hierarchical levels: note, syllable, phrase, and song, of which syllable plays an important role in bird species recognition. To improve the recognition rate of birdsongs, in this study an enhanced syllable segmentation method based on R-S endpoint detection method was presented Furthermore, a decision based neural network with suitable reinforcement learning rule was developed as the classifier. The proposed methods combined with the well-known MFCCs feature vector form a birdsong recognition system that was applied to two recognition problems: one is the recognition of a set of arbitrary syllables and the other is the recognition of a section of a birdsong. Experimental results show the performances of the proposed methods.
引用
收藏
页码:745 / +
页数:3
相关论文
共 50 条
  • [1] Automatic speech segmentation in syllable centric speech recognition system
    Panda S.P.
    Nayak A.K.
    International Journal of Speech Technology, 2016, 19 (1) : 9 - 18
  • [2] Consonant/vowel segmentation for Mandarin syllable recognition
    Lin, MT
    Lee, CK
    Lin, CY
    COMPUTER SPEECH AND LANGUAGE, 1999, 13 (03): : 207 - 222
  • [3] Consonant/vowel segmentation for Mandarin syllable recognition
    Tatung Institute of Technology, Institute of Electrical Engineering, 3rd Section, EE Department, 40 Chungshan North Road, Taipei 10451, Taiwan, Taiwan
    Comput Speech Lang, 3 (207-222):
  • [4] Automatic Species Recognition Based on Improved Birdsong Analysis
    Knapp, Joshua
    Qu, Guangzhi
    Zhang, Feng
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 176 - 181
  • [5] Auditory Model Based Optimization of MFCCs Improves Automatic Speech Recognition Performance
    Chatterjee, Saikat
    Koniaris, Christos
    Kleijn, W. Bastiaan
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2943 - 2946
  • [6] Automatic Recognition of Element Classes and Boundaries in the Birdsong with Variable Sequences
    Koumura, Takuya
    Okanoya, Kazuo
    PLOS ONE, 2016, 11 (07):
  • [7] Role of syllable segmentation processes in peripheral word recognition
    Bernard, Jean-Baptiste
    Calabrese, Aurelie
    Castet, Eric
    VISION RESEARCH, 2014, 105 : 226 - 232
  • [8] Automatic recognition of anuran species based on syllable identification
    Bedoya, Carol
    Isaza, Claudia
    Daza, Juan M.
    Lopez, Jose D.
    ECOLOGICAL INFORMATICS, 2014, 24 : 200 - 209
  • [9] Automatic recognition of syllable-final nasals preceded by/ε/
    Journal of the Acoustical Society of America, 1995, 97 (03):
  • [10] Syllable-based automatic Arabic speech recognition
    Azmi, Mohamed Mostafa
    Tolba, Hesham
    Mahdy, Sherif
    Fashal, Mervat
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION: ADVANCED TOPICS ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION, 2008, : 246 - +