Bird Call Classification Using DNN-Based Acoustic Modelling

被引:3
|
作者
Rajan, Rajeev [1 ,2 ]
Johnson, Jisna [1 ,2 ]
Kareem, Noumida Abdul [1 ,2 ]
机构
[1] Coll Engn, Dept Elect & Commun Engn, Thiruvananthapuram, Kerala, India
[2] APJ Abdul Kalam Technol Univ, Thiruvananthapuram, Kerala, India
关键词
Hidden Markov model; Gaussian mixture model; Deep neural network; Convolutional neural network;
D O I
10.1007/s00034-021-01896-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Bird call recognition using deep neural network-hidden Markov model (DNN-HMM)-based transcription is proposed. The work is an attempt to adapt the human speech recognition framework for bird call classification through transcription approach. Initially, the phone transcriptions are generated using CMU-Sphinx, and lexicons are modified using group delay-based segmentation. Later, bird call transcription is implemented using hybrid DNN-HMM framework through DNN-based acoustic modelling. During the DNN-based acoustic modelling, mel-frequency cepstral coefficient features (MFCCs) are computed and experimented with monophone models, triphone models, followed by linear discriminative analysis and maximum likelihood linear transform. The transcribed phonemes are corrected using context-based rules in the final phase. The proposed approach is evaluated on a dataset that consists of ten species with 563 audio tracks. The hybrid DNN-HMM approach outperforms the convolutional neural network and long short-term memory framework with an accuracy of 94.46%.
引用
收藏
页码:2669 / 2680
页数:12
相关论文
共 50 条
  • [1] Bird Call Classification Using DNN-Based Acoustic Modelling
    Rajeev Rajan
    Jisna Johnson
    Noumida Abdul Kareem
    Circuits, Systems, and Signal Processing, 2022, 41 : 2669 - 2680
  • [2] DNN-based Arabic Printed Characters Classification
    Amrouche, Aissa
    PROGRAM OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND AUTOMATIC CONTROL, ICEEAC 2024, 2024,
  • [3] DNN-Based Acoustic Modeling for Russian Speech Recognition Using Kaldi
    Kipyatkova, Irina
    Karpov, Alexey
    SPEECH AND COMPUTER, 2016, 9811 : 246 - 253
  • [4] Traffic Reduction in Video Call and Chat using DNN-based Image Reconstruction
    Watanabe, Shota
    Fujihashi, Takuya
    Saruwatari, Shunsuke
    Watanabe, Takashi
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [5] DNN-based seabed classification using differently weighted MBES multifeatures
    Zhu, Zhengren
    Cui, Xiaodong
    Zhang, Kai
    Ai, Bo
    Shi, Bo
    Yang, Fanlin
    MARINE GEOLOGY, 2021, 438
  • [6] DNN-based Acoustic-to-Articulatory Inversion using Ultrasound Tongue Imaging
    Porras, Dagoberto
    Sepulveda-Sepulveda, Alexander
    Csapo, Tamas Gabor
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation
    Houidhek, Amal
    Colotte, Vincent
    Mnasri, Zied
    Jouvet, Denis
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 9 - 20
  • [8] Analyzing Decision Polygons of DNN-based Classification Methods
    Kim, Jongyoung
    Woo, Seongyoun
    Lee, Wonjun
    Kim, Donghwan
    Lee, Chulhee
    ICINCO: PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, 2020, : 346 - 351
  • [9] DNN-Based PolSAR Image Classification on Noisy Labels
    Ni, Jun
    Xiang, Deliang
    Lin, Zhiyuan
    Lopez-Martinez, Carlos
    Hu, Wei
    Zhang, Fan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 3697 - 3713
  • [10] DNN-based Models for Speaker Age and Gender Classification
    Qawaqneh, Zakariya
    Abu Mallouh, Arafat
    Barkana, Buket D.
    PROCEEDINGS OF THE 10TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 4: BIOSIGNALS, 2017, : 106 - 111