Birdsong classification based on multi-feature fusion

被引:19
|
作者
Yan, Na [1 ]
Chen, Aibin [1 ,3 ]
Zhou, Guoxiong [1 ]
Zhang, Zhiqiang [2 ]
Liu, Xiangyong [4 ]
Wang, Jianwu [5 ]
Liu, Zhihua [1 ]
Chen, Wenjie [1 ]
机构
[1] Cent South Univ Forestry & Technol, Coll Comp & Informat Engn, Inst Artificial Intelligence Applicat, Changsha, Peoples R China
[2] Cent South Univ Forestry & Technol, Coll Forestry, Wildlife Conservat & Utilizat Lab, Changsha, Peoples R China
[3] Cent South Univ Forestry & Technol, Coll Life Sci & Technol, Hunan Prov Key Lab Urban Forest Ecol, Changsha, Peoples R China
[4] Hunan Zixing Artificial Intelligence Res Acad, Hunan Zixing, Peoples R China
[5] HuangFengQiao State Owned Forest Farm, Youxian Cty, Hunan, Peoples R China
关键词
Birdsong classification; Acoustic feature; Feature fusion; 3DCNN-LSTM; NEURAL-NETWORKS; RECOGNITION; SOUNDS; MFCC;
D O I
10.1007/s11042-021-11396-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The classification of birdsong has very important signification to monitor the bird population in the habitats. Aiming at the birdsong dataset with complex and diverse audio background, this paper attempts to introduce an acoustic feature for voice and music analysis: Chroma. It is spliced and fused with the commonly used birdsong features, Log-Mel Spectrogram (LM) and Mel Frequency Cepstrum Coefficient (MFCC), to enrich the representational capacity of single feature; At the same time, in view of the characteristic that birdsong has continuous and dynamic changes in time, a 3DCNN-LSTM combined model is proposed as a classifier to make the network more sensitive to the birdsong information that changes with time. In this paper, we selected four bird audio data from the Xeno-Canto website to evaluate how LM, MFCC and Chroma were fused to maximize the birdsong audio information. The experimental results show that the LM-MFCC-C feature combination achieves the best result of 97.9% mean average precision (mAP) in the experiment.
引用
收藏
页码:36529 / 36547
页数:19
相关论文
共 50 条
  • [1] Birdsong classification based on multi-feature fusion
    Na Yan
    Aibin Chen
    Guoxiong Zhou
    Zhiqiang Zhang
    Xiangyong Liu
    Jianwu Wang
    Zhihua Liu
    Wenjie Chen
    Multimedia Tools and Applications, 2021, 80 : 36529 - 36547
  • [2] Birdsong classification based on multi feature channel fusion
    Zhihua Liu
    Wenjie Chen
    Aibin Chen
    Guoxiong Zhou
    Jizheng Yi
    Multimedia Tools and Applications, 2022, 81 : 15469 - 15490
  • [3] Birdsong classification based on multi feature channel fusion
    Liu, Zhihua
    Chen, Wenjie
    Chen, Aibin
    Zhou, Guoxiong
    Yi, Jizheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (11) : 15469 - 15490
  • [4] EEG Emotion Classification Based on Multi-Feature Fusion
    Liang, Mingjing
    Wang, Lu
    Wen, Xin
    Cao, Rui
    Computer Engineering and Applications, 2024, 59 (05) : 155 - 159
  • [5] Eye State Classification Based on Multi-feature fusion
    Dong, Wenhui
    Qu, Peishu
    CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 231 - 234
  • [6] Alzheimer's Disease Classification Based on Multi-feature Fusion
    Madusanka, Nuwan
    Choi, Heung-Kook
    So, Jae-Hong
    Choi, Boo-Kyeong
    CURRENT MEDICAL IMAGING REVIEWS, 2019, 15 (02) : 161 - 169
  • [7] BiTCN malware classification method based on multi-feature fusion
    Xuan, Bona
    Li, Jin
    Song, Yafei
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 359 - 364
  • [8] Target classification with adaptive weights based on multi-feature fusion
    Wang L.
    Zhang Z.
    Su L.
    Nie W.
    1600, Huazhong University of Science and Technology (48): : 38 - 43
  • [9] A GLASS IMAGE CLASSIFICATION METHOD BASED ON MULTI-FEATURE FUSION
    Zhang, Liang
    Wen, Jing
    Xu, Sheng-Zhou
    Xing, Hao-Yang
    Zhu, Yu
    Chen, Heng-Xin
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2016, : 7 - 11
  • [10] Unsupervised seismic facies classification based on multi-feature fusion autoencoder
    Wang QianNan
    Wang ZhiGuo
    Yang Yang
    Zhu JianBing
    Gao JingHuai
    CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2024, 67 (01): : 370 - 378