Video-based bird posture recognition using dual feature-rates deep fusion convolutional neural network

被引:4
作者
Lin, Chih-Wei [1 ]
Chen, Zhongsheng [1 ]
Lin, Mengxiang [1 ]
机构
[1] Fujian Agr & Forestry Univ, Coll Comp & Informat Sci, Fuzhou, Peoples R China
关键词
Bird behavior recognition; Convolutional neural network; Behavior rate; Information fusion; BEHAVIOR; VISION; COLOR; FORM;
D O I
10.1016/j.ecolind.2022.109141
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
It is necessary to detect changes in birds' behaviors promptly to realize their health and habitat status. Moreover, promptly provide appropriate medical treatment and environmental remediation. Automating bird behavior recognition can solve this problem and assist in breeding and protecting birds. This paper proposes the transposed non-local module based on a time pyramid network to establish a dual feature-rates deep fusion net (DF2Net) for bird behavior recognition. The time pyramid network uses spatial alignment and time pooling operations to extract features with different rates from features of different depths and then fuse features containing the information. On this basis, the transposed non-local (TNL) module uses features with different rates to calculate the relationship matrix of each time slice and spatial position. Then TNL uses the transpose operation to make the original feature take advantage of the corresponding relationship when it multiplies the original feature by the matrix. The module further deeply integrates the relational information of behaviors at different rates to enhance features of different rates, respectively, which can improve the recognition effect of the model on dynamic behaviors. Our study has three contributions: (1) The TNL module takes features with different rates as inputs and uses a transpose mechanism to get two relationships in different directions that match their corresponding input. (2) Dual feature-rates deep fusion net (DF2-Net) transforms a single rate feature into two rate features and iteratively fuses them to obtain information and behavior relationships at various rates. (3) We collect the unique video dataset of birds' behaviors to fill in the blank of the video dataset and support the study of birds' behaviors. The experiments compare the DF2-Net with the well-known video-based recognition model on the self-collected birds' behavior dataset, containing eight behaviors. DF2-Net has the best classification accuracy and achieves 80.87%, 81.35%, 80.70%, and 81.35% in precision, recall, f1-score, and OA metrics. They are 1.81%, 2.43%, 2.20%, and 2.43% higher than the second-best approach (TPN) with 8 frames, respectively. The experimental results show that DF2-Net outperforms state-of-the-art methods in various frames, in which 16 frames are the most suitable for bird behavior recognition. Moreover, we execute the various ablation experiments to demonstrate the efficiency of the TNL module, the optimal location and internal operations of TNL, and the most suitable parameters of DF2-Net. The ablation experiments demonstrate that the TNL module dramatically improves the recognition accuracy of the dynamic behavior and proves the validity and rationality of the TNL module. Therefore, the proposed model is practical and feasible for automatically recognizing bird behavior.
引用
收藏
页数:14
相关论文
共 46 条
  • [31] What is play fighting and what is it good for?
    Pellis, Sergio M.
    Pellis, Vivien C.
    [J]. LEARNING & BEHAVIOR, 2017, 45 (04) : 355 - 366
  • [32] Machine vision to identify broiler breeder behavior
    Pereira, Danilo F.
    Miyamoto, Bruno C. B.
    Maia, Guilherme D. N.
    Sales, G. Tatiana
    Magalhaes, Marcelo M.
    Gates, Richard S.
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2013, 99 : 194 - 199
  • [33] Automatic counting of birds in a bird deterrence field trial
    Simons, Elizabeth S.
    Hinders, Mark K.
    [J]. ECOLOGY AND EVOLUTION, 2019, 9 (20): : 11878 - 11890
  • [34] Intermittent breeding is associated with breeding group turnover in a cooperatively breeding bird
    Smith, Maria G.
    Riehl, Christina
    [J]. OECOLOGIA, 2020, 192 (04) : 953 - 963
  • [35] Soomro K, 2012, Arxiv, DOI [arXiv:1212.0402, DOI 10.48550/ARXIV.1212.0402]
  • [36] AUDIO-VISUAL RECOGNITION OF GOOSE FLOCKING BEHAVIOR
    Steen, Kim Arild
    Therkildsen, Ole Roland
    Green, Ole
    Karstoft, Henrik
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (07)
  • [37] Tang Yi-ping, 2016, Journal of Chinese Computer Systems, V37, P841
  • [38] NEURAL MECHANISMS OF FORM AND MOTION PROCESSING IN THE PRIMATE VISUAL-SYSTEM
    VANESSEN, DC
    GALLANT, JL
    [J]. NEURON, 1994, 13 (01) : 1 - 10
  • [39] AN RFID-BASED AUTOMATED INDIVIDUAL PERCHING MONITORING SYSTEM FOR GROUP-HOUSED POULTRY
    Wang, K.
    Liu, K.
    Xin, H.
    Chai, L.
    Wang, Y.
    Fei, T.
    Oliveira, J.
    Pan, J.
    Ying, Y.
    [J]. TRANSACTIONS OF THE ASABE, 2019, 62 (03) : 695 - 704
  • [40] 靴隼雕在中国的分布及羽色多态现象
    王威
    杨小农
    关翔宇
    蒋志友
    王惠
    朱磊
    [J]. 动物学杂志, 2021, 56 (02) : 180 - 189