Birdsong classification based on multi-feature fusion

被引：19

作者：

Yan, Na ^{[1
]}

Chen, Aibin ^{[1
,3
]}

Zhou, Guoxiong ^{[1
]}

Zhang, Zhiqiang ^{[2
]}

Liu, Xiangyong ^{[4
]}

Wang, Jianwu ^{[5
]}

Liu, Zhihua ^{[1
]}

Chen, Wenjie ^{[1
]}

机构：

[1] Cent South Univ Forestry & Technol, Coll Comp & Informat Engn, Inst Artificial Intelligence Applicat, Changsha, Peoples R China

[2] Cent South Univ Forestry & Technol, Coll Forestry, Wildlife Conservat & Utilizat Lab, Changsha, Peoples R China

[3] Cent South Univ Forestry & Technol, Coll Life Sci & Technol, Hunan Prov Key Lab Urban Forest Ecol, Changsha, Peoples R China

[4] Hunan Zixing Artificial Intelligence Res Acad, Hunan Zixing, Peoples R China

[5] HuangFengQiao State Owned Forest Farm, Youxian Cty, Hunan, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2021年 / 80卷 / 30期

关键词：

Birdsong classification; Acoustic feature; Feature fusion; 3DCNN-LSTM; NEURAL-NETWORKS; RECOGNITION; SOUNDS; MFCC;

D O I：

10.1007/s11042-021-11396-9

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The classification of birdsong has very important signification to monitor the bird population in the habitats. Aiming at the birdsong dataset with complex and diverse audio background, this paper attempts to introduce an acoustic feature for voice and music analysis: Chroma. It is spliced and fused with the commonly used birdsong features, Log-Mel Spectrogram (LM) and Mel Frequency Cepstrum Coefficient (MFCC), to enrich the representational capacity of single feature; At the same time, in view of the characteristic that birdsong has continuous and dynamic changes in time, a 3DCNN-LSTM combined model is proposed as a classifier to make the network more sensitive to the birdsong information that changes with time. In this paper, we selected four bird audio data from the Xeno-Canto website to evaluate how LM, MFCC and Chroma were fused to maximize the birdsong audio information. The experimental results show that the LM-MFCC-C feature combination achieves the best result of 97.9% mean average precision (mAP) in the experiment.

引用

页码：36529 / 36547

页数：19

共 50 条

[1] Birdsong classification based on multi-feature fusion
Na Yan
Aibin Chen
Guoxiong Zhou
Zhiqiang Zhang
Xiangyong Liu
Jianwu Wang
Zhihua Liu
Wenjie Chen
Multimedia Tools and Applications, 2021, 80 : 36529 - 36547
[2] Birdsong classification based on multi feature channel fusion
Zhihua Liu
Wenjie Chen
Aibin Chen
Guoxiong Zhou
Jizheng Yi
Multimedia Tools and Applications, 2022, 81 : 15469 - 15490
[3] Birdsong classification based on multi feature channel fusion
Liu, Zhihua
Chen, Wenjie
Chen, Aibin
Zhou, Guoxiong
Yi, Jizheng
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (11) : 15469 - 15490
[4] EEG Emotion Classification Based on Multi-Feature Fusion
Liang, Mingjing
Wang, Lu
Wen, Xin
Cao, Rui
Computer Engineering and Applications, 2024, 59 (05) : 155 - 159
[5] Eye State Classification Based on Multi-feature fusion
Dong, Wenhui
Qu, Peishu
CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 231 - 234
[6] Alzheimer's Disease Classification Based on Multi-feature Fusion
Madusanka, Nuwan
Choi, Heung-Kook
So, Jae-Hong
Choi, Boo-Kyeong
CURRENT MEDICAL IMAGING REVIEWS, 2019, 15 (02) : 161 - 169
[7] BiTCN malware classification method based on multi-feature fusion
Xuan, Bona
Li, Jin
Song, Yafei
2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 359 - 364
[8] Target classification with adaptive weights based on multi-feature fusion
Wang L.
Zhang Z.
Su L.
Nie W.
1600, Huazhong University of Science and Technology (48): : 38 - 43
[9] A GLASS IMAGE CLASSIFICATION METHOD BASED ON MULTI-FEATURE FUSION
Zhang, Liang
Wen, Jing
Xu, Sheng-Zhou
Xing, Hao-Yang
Zhu, Yu
Chen, Heng-Xin
PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2016, : 7 - 11
[10] Unsupervised seismic facies classification based on multi-feature fusion autoencoder
Wang QianNan
Wang ZhiGuo
Yang Yang
Zhu JianBing
Gao JingHuai
CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2024, 67 (01): : 370 - 378

← 1 2 3 4 5 →