Optimizing avian species recognition with MFCC features and deep learning models

被引:0
|
作者
Raviteja Kamarajugadda [1 ]
Rahul Battula [1 ]
Chaitanya Reddy Borra [1 ]
Harsha Durga [1 ]
Venkat Bypilla [1 ]
Seelam Srinivasa Reddy [1 ]
Farzana Fathima Khan [1 ]
Shrimannaraya Bhavanam [2 ]
机构
[1] Lakireddy Bali Reddy College of Engineering,Department of Information Technology
[2] Shrimannaraya Bhavanam,undefined
[3] Oracle Company,undefined
[4] Principal Member Technical Staff,undefined
关键词
Sequential model (Convolutional Neural Network); LSTM (recurrent neural network); VGGish; MFCC (Mel-frequency cepstral coefficients); Audio; Bird; Ecosystem; Deep learning; Ornithologist; Extinction;
D O I
10.1007/s41870-024-02108-1
中图分类号
学科分类号
摘要
The rapid reduction in bird populations and the imminent prospect of avian extinction have profound effects on global ecosystems, putting vital ecological services and processes in jeopardy. Finding endangered bird species is a major problem for scientists, making it more difficult to come up with practical conservation plans. In order to meet this need, we provide an integrated system that combines MFCC-based feature extraction with state-of-the-art deep learning models CNN, LSTM, and VGGish to accurately identify bird species from audio recordings. Our method makes use of each model’s special strengths: VGGish represents extensive audio features, LSTM handles temporal dependencies, and CNN handles spatial hierarchies. Our framework seeks to improve species categorization efficiency and accuracy by utilizing these cutting-edge methods, supporting conservation efforts, and reducing the negative ecological effects of rapid population reduction. We work to provide ornithologists and conservationists with the resources they need to protect biodiversity and maintain the integrity of our ecosystems by continuously collecting data and disseminating information.
引用
收藏
页码:4621 / 4626
页数:5
相关论文
共 50 条
  • [1] A Comparison of MFCC and LPCC with Deep Learning for Speaker Recognition
    Yang, Haiyan
    Deng, Yanrong
    Zhao, Hua-An
    ICBDC 2019: PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON BIG DATA AND COMPUTING, 2019, : 160 - 164
  • [2] Voice Recognition Based on Adaptive MFCC and Deep Learning
    Bae, Hyan-Soo
    Lee, Ho-Jin
    Lee, Suk-Gyu
    PROCEEDINGS OF THE 2016 IEEE 11TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2016, : 1542 - 1546
  • [3] Exploring Deep Features and Transfer Learning for Plant Species Recognition
    Feitoza, Marcondes Coelho
    da Silva, Wanderson Bezerra
    Calumby, Rodrigo Tripodi
    PROCEEDINGS OF THE XV BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, SBSI 2019: Complexity on Modern Information Systems, 2019,
  • [4] Overview of handcrafted features and deep learning models for leaf recognition
    Isik, Sahin
    Ozkan, Kemal
    JOURNAL OF ENGINEERING RESEARCH, 2021, 9 (01):
  • [5] The impact of MFCC, spectrogram, and Mel-Spectrogram on deep learning models for Amazigh speech recognition system
    Meryam Telmem
    Naouar Laaidi
    Hassan Satori
    International Journal of Speech Technology, 2025, 28 (1) : 299 - 312
  • [6] Recognition of Endemic Bird Species Using Deep Learning Models
    Huang, Yo-Ping
    Basanta, Haobijam
    IEEE ACCESS, 2021, 9 : 102975 - 102984
  • [7] Speaker identification and localization using shuffled MFCC features and deep learning
    Barhoush M.
    Hallawa A.
    Schmeink A.
    International Journal of Speech Technology, 2023, 26 (01) : 185 - 196
  • [8] Amazigh Spoken Digit Recognition using a Deep Learning Approach based on MFCC
    Boulal, Hossam
    Hamidi, Mohamed
    Abarkan, Mustapha
    Barkani, Jamal
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (07) : 791 - 798
  • [9] Prosodic information extraction and classification based on MFCC features and machine learning models
    Gill, Sajid Habib
    Mahar, Javed Ahmed
    Mahar, Shahid Ali
    Razzaq, Mirza Abdur
    Mehmood, Arif
    Choi, Gyu Sang
    Ashraf, Imran
    MEASUREMENT & CONTROL, 2025,
  • [10] Optimizing Deep Learning Models for Object Detection
    Barburescu, Calin-George
    Iuhasz, Gabriel
    2020 22ND INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2020), 2020, : 270 - 277