Optimizing avian species recognition with MFCC features and deep learning models

被引：0

作者：

Raviteja Kamarajugadda ^{[1
]}

Rahul Battula ^{[1
]}

Chaitanya Reddy Borra ^{[1
]}

Harsha Durga ^{[1
]}

Venkat Bypilla ^{[1
]}

Seelam Srinivasa Reddy ^{[1
]}

Farzana Fathima Khan ^{[1
]}

Shrimannaraya Bhavanam ^{[2
]}

机构：

[1] Lakireddy Bali Reddy College of Engineering,Department of Information Technology

[2] Shrimannaraya Bhavanam,undefined

[3] Oracle Company,undefined

[4] Principal Member Technical Staff,undefined

来源：

International Journal of Information Technology | 2024年 / 16卷 / 7期

关键词：

Sequential model (Convolutional Neural Network); LSTM (recurrent neural network); VGGish; MFCC (Mel-frequency cepstral coefficients); Audio; Bird; Ecosystem; Deep learning; Ornithologist; Extinction;

D O I：

10.1007/s41870-024-02108-1

中图分类号：

学科分类号：

摘要：

The rapid reduction in bird populations and the imminent prospect of avian extinction have profound effects on global ecosystems, putting vital ecological services and processes in jeopardy. Finding endangered bird species is a major problem for scientists, making it more difficult to come up with practical conservation plans. In order to meet this need, we provide an integrated system that combines MFCC-based feature extraction with state-of-the-art deep learning models CNN, LSTM, and VGGish to accurately identify bird species from audio recordings. Our method makes use of each model’s special strengths: VGGish represents extensive audio features, LSTM handles temporal dependencies, and CNN handles spatial hierarchies. Our framework seeks to improve species categorization efficiency and accuracy by utilizing these cutting-edge methods, supporting conservation efforts, and reducing the negative ecological effects of rapid population reduction. We work to provide ornithologists and conservationists with the resources they need to protect biodiversity and maintain the integrity of our ecosystems by continuously collecting data and disseminating information.

引用

页码：4621 / 4626

页数：5

共 50 条

[1] A Comparison of MFCC and LPCC with Deep Learning for Speaker Recognition
Yang, Haiyan
Deng, Yanrong
Zhao, Hua-An
ICBDC 2019: PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON BIG DATA AND COMPUTING, 2019, : 160 - 164
[2] Voice Recognition Based on Adaptive MFCC and Deep Learning
Bae, Hyan-Soo
Lee, Ho-Jin
Lee, Suk-Gyu
PROCEEDINGS OF THE 2016 IEEE 11TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2016, : 1542 - 1546
[3] Exploring Deep Features and Transfer Learning for Plant Species Recognition
Feitoza, Marcondes Coelho
da Silva, Wanderson Bezerra
Calumby, Rodrigo Tripodi
PROCEEDINGS OF THE XV BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, SBSI 2019: Complexity on Modern Information Systems, 2019,
[4] Overview of handcrafted features and deep learning models for leaf recognition
Isik, Sahin
Ozkan, Kemal
JOURNAL OF ENGINEERING RESEARCH, 2021, 9 (01):
[5] The impact of MFCC, spectrogram, and Mel-Spectrogram on deep learning models for Amazigh speech recognition system
Meryam Telmem
Naouar Laaidi
Hassan Satori
International Journal of Speech Technology, 2025, 28 (1) : 299 - 312
[6] Recognition of Endemic Bird Species Using Deep Learning Models
Huang, Yo-Ping
Basanta, Haobijam
IEEE ACCESS, 2021, 9 : 102975 - 102984
[7] Speaker identification and localization using shuffled MFCC features and deep learning
Barhoush M.
Hallawa A.
Schmeink A.
International Journal of Speech Technology, 2023, 26 (01) : 185 - 196
[8] Amazigh Spoken Digit Recognition using a Deep Learning Approach based on MFCC
Boulal, Hossam
Hamidi, Mohamed
Abarkan, Mustapha
Barkani, Jamal
INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (07) : 791 - 798
[9] Prosodic information extraction and classification based on MFCC features and machine learning models
Gill, Sajid Habib
Mahar, Javed Ahmed
Mahar, Shahid Ali
Razzaq, Mirza Abdur
Mehmood, Arif
Choi, Gyu Sang
Ashraf, Imran
MEASUREMENT & CONTROL, 2025,
[10] Optimizing Deep Learning Models for Object Detection
Barburescu, Calin-George
Iuhasz, Gabriel
2020 22ND INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2020), 2020, : 270 - 277

← 1 2 3 4 5 →