On the Studies of Syllable Segmentation and Improving MFCCs for Automatic Birdsong Recognition

被引:20
作者
Chou, Chih-Hsun [1 ]
Liu, Pang-Hsin [1 ]
Cai, Bingjing [2 ]
机构
[1] Chung Hua Univ, Dept Comp Sci & Informat Engn, 707,Sec 2,WuFu Rd, Hsinchu 30067, Taiwan
[2] Yunnan Univ, Sch Software, Yunnan 650091, Peoples R China
来源
2008 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE, VOLS 1-3, PROCEEDINGS | 2008年
关键词
D O I
10.1109/APSCC.2008.6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Birdsongs are typically divided into four hierarchical levels: note, syllable, phrase, and song, of which syllable plays an important role in bird species recognition. To improve the recognition rate of birdsongs, in this study an enhanced syllable segmentation method based on R-S endpoint detection method was presented Furthermore, a decision based neural network with suitable reinforcement learning rule was developed as the classifier. The proposed methods combined with the well-known MFCCs feature vector form a birdsong recognition system that was applied to two recognition problems: one is the recognition of a set of arbitrary syllables and the other is the recognition of a section of a birdsong. Experimental results show the performances of the proposed methods.
引用
收藏
页码:745 / +
页数:3
相关论文
共 20 条
  • [1] Template-based automatic recognition of birdsong syllables from continuous recordings
    Anderson, SE
    Dave, AS
    Margoliash, D
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (02) : 1209 - 1219
  • [2] [Anonymous], P ICSC C COMP INT ME
  • [3] Bou-Ghazale SE, 2002, INT CONF ACOUST SPEE, P3808
  • [4] Catchpole CK., 1995, BIRD SONG BIOL THEME
  • [5] FAGERLUND S, EURASIP J ADV SIGNAL, V2007
  • [6] HAIGH JA, 1993, TENCON'93: 1993 IEEE REGION 10 CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND POWER ENGINEERING, VOL 3, P321, DOI 10.1109/TENCON.1993.327987
  • [7] HE SN, 2002, IEEE INT C COMM CIRC, V2, P992
  • [8] HE SN, 2002, P IEEE INT C COMM CI, V2, P997
  • [9] KABAYA T, 2001, SONGS CALLS 420 BIRD
  • [10] KITAYAMA K, 2003, P EUR GEN SWITZ, P1237