A Novel Approach of Audio Based Feature Optimisation for Bird Classification

被引：3

作者：

Ramashini, Murugaiya ^{[1
]}

Abas, Pg Emeroylariffion ^{[1
]}

De Silva, Liyanage C. ^{[1
]}

机构：

[1] Univ Brunei Darussalam, Fac Integrated Technol, Jalan Tungku Link, BE-1410 Gadong, Brunei

来源：

PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY | 2021年 / 29卷 / 04期

关键词：

Artificial neural network (ANN; bird sounds; classification; linear discriminant analysis (LDA); nearest centroid (NC); RECOGNITION;

D O I：

10.47836/pjst.29.4.08

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Bird classification using audio data can be beneficial in assisting ornithologists, bird watchers and environmentalists. However, due to the complex environment in the jungles, it is difficult to identify birds by visual inspection. Hence, identification via acoustical means may be a better option in such an environment. This study aims to classify endemic Bornean birds using their sounds. Thirty-five (35) acoustic features have been extracted from the pre-recorded soundtracks of birds. In this paper, a novel approach for selecting an optimum number of features using Linear Discriminant Analysis (LDA) has been proposed to give better classification accuracy. It is found that using a Nearest Centroid (NC) technique with LDA produces the optimum classification results of bird sounds at 96.7% accuracy with reduced computational power. The low computational complexity is an added advantage for handheld portable devices with minimal computing power, which can be used in birdwatching expeditions. Comparison results have been provided with and without LDA using NC and Artificial Neural Network (ANN) classifiers. It has been demonstrated that both classifiers with LDA outperform those without LDA. Maximum accuracies for both NC and ANN with LDA, with NC and the ANN classifiers requiring 7 and 10 LDAs to achieve the optimum accuracy, respectively, are 96.7%. However, ANN accuracy, respectively, However, classifier with LDA is more computationally complex. Hence, this is significant as the simpler NC classifier with LDA, which does not require expensive processing power, may be used on the portable and affordable device for bird classification purposes.

引用

页码：2383 / 2407

页数：25

共 33 条

[1] Stress Recovery during Exposure to Nature Sound and Environmental Noise
Alvarsson, Jesper J.
Wiens, Stefan
Nilsson, Mats E.
[J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2010, 7 (03): : 1036 - 1046
[2] Template-based automatic recognition of birdsong syllables from continuous recordings
Anderson, SE
Dave, AS
Margoliash, D
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (02) : 1209 - 1219
[3] [Anonymous], 2014, INTRO AUDIO ANAL
[4] Bird sounds classification by combining PNCC and robust Mel-log filter bank features
Badi, Alzahra
Ko, Kyungdeuk
Ko, Hanseok
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (01): : 39 - 46
[5] Butler R. W., 2019, International Journal of Tourism Anthropology, V7, P5, DOI 10.1504/IJTA.2019.098097
[6] On the Studies of Syllable Segmentation and Improving MFCCs for Automatic Birdsong Recognition
Chou, Chih-Hsun
Liu, Pang-Hsin
Cai, Bingjing
[J]. 2008 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE, VOLS 1-3, PROCEEDINGS, 2008, : 745 - +
[7] Elliott D. L, 1993, 938 ISR TR
[8] Automatic Segmentation of Audio Signals for Bird Species Identification
Evangelista, Thiago L. F.
Priolli, Thales M.
Silla, Carlos N., Jr.
Angelico, Bruno A.
Kaestner, Celso A. A.
[J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2014, : 223 - 228
[9] Fagerlund Seppo, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P8247, DOI 10.1109/ICASSP.2014.6855209
[10] Bird species recognition using support vector machines
Fagerlund, Seppo
[J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)

← 1 2 3 4 →