Automatic recognition of birdsongs using mel-frequency cepstral coefficients and vector quantization

被引:0
作者
Lee, Chang-Hsing [1 ]
Lien, Cheng-Chang [1 ]
Huang, Ren-Zhuang [2 ]
机构
[1] Chung Hua Univ, Dept Comp Sci & Informat Engn, 707,Sec 2,WuFu Rd, Hsinchu, Taiwan
[2] Chung Hua Univ, Dept Comp Sci & Informat Engn, Hsinchu, Taiwan
来源
IMECS 2006: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS | 2006年
关键词
birdsong recognition; linear discriminant analysis; MFCC; vector quantization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a method to automatically identify birds from the sounds they generate. First, each syllable corresponding to a piece of vocalization is segmented. For each syllable, the averaged MFCCs (AMFCC) over all frames in a syllable are calculated as the vocalization features. Vector quantization is employed to design the codebook consisting of several representative feature vectors to well model the variant characteristics of syllables segmented from the sound of identical bird species. Finally, linear discriminant analysis (LDA) is exploited to increase the classification accuracy at a lower dimensional feature vector space. In our experiments, the average classification accuracy is 87% for the recognition of 420 bird species.
引用
收藏
页码:331 / +
页数:3
相关论文
共 15 条
[1]  
AKROUT N, 1992, SIGNAL PROCESS 6, V3, P1227
[2]   Template-based automatic recognition of birdsong syllables from continuous recordings [J].
Anderson, SE ;
Dave, AS ;
Margoliash, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (02) :1209-1219
[3]  
Brenowitz EA, 1997, J NEUROBIOL, V33, P495, DOI 10.1002/(SICI)1097-4695(19971105)33:5<495::AID-NEU1>3.0.CO
[4]  
2-#
[5]  
Duda R. O., 2000, PATTERN CLASSIFICATI
[6]  
Härmä A, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS, P545
[7]  
KABAYA T, 2001, SONGS CALLS 420 BIRD
[8]   Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: A comparative study [J].
Kogan, JA ;
Margoliash, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 103 (04) :2185-2196
[9]   Birdsong recognition using backpropagation and multivariate statistics [J].
McIlraith, AL ;
Card, HC .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (11) :2740-2748
[10]  
MCILRAITH AL, 1997, P IEEE INT C NEUR NE, V1, P100