Object Detection Algorithm for Pigs Based on Dual Dilated Layer and Rotary Box Location

被引:0
作者
Geng Y. [1 ,2 ]
Lin Y. [1 ]
Fu Y. [3 ]
Yang S. [4 ]
机构
[1] School of Artificial Intelligence, Hebei University of Technology, Tianjin
[2] Engineering Research Center of Intelligent Rehabilitation Device and Detection Technology, Ministry of Education, Tianjin
[3] Hebei Provincial General Animal Husbandry Station, Shijiazhuang
[4] Tianjin Mojieke Technology Co., Ltd., Tianjin
来源
Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery | 2023年 / 54卷 / 04期
关键词
dilated convolution; Gaussian Wasserstein distance; object detection; pigs; rotary box location;
D O I
10.6041/j.issn.1000-1298.2023.04.033
中图分类号
学科分类号
摘要
At present, the target detection algorithm based on horizontal box is applied to pig objection detection. The adhesion and mutual occlusion in the image of pigs bring great difficulty to individual pig detection. The image of pig has a large ratio of length to width and may rotate at any angle. Object detection algorithm for group pig images based on dual dilated layer and rotary box location network (DR-Net) was proposed. Images of pigs was collected in three pig farms. A dynamic clustering method based on histogram feature and singular value decomposition was used to extract the key frames of pig videos, Laplace operator was used to eliminate images with unclear targets. There were 9 600 images as the data set after data enhancement. The outline of the pig with rotary box was marked. Data set was divided into training set, verification set and test set according to 8: 1: 1. Dual dilated layer used the residual structure and combined two convolution with different dilation factors. The receptive field was increased exponentially with the increase of layers. Stacking dual dilated layers can obtain very large receptive field, it can help the model understand the global information of the image with fewer parameters. Every pig target was located in a rotary box and represented by five parameters. In training, regression loss calculation method based on Gaussian Wasserstein distance was used. The model can get prediction results more accurate. In DR-Net, the features of the input image was extracted by dual dilated layer. The CSP layer containing multi-layer Res2Net module, which was used to feature fusion and feature extraction of different scales. The prediction results were output through head network. The results showed that the precision, recall, mean average precision, MAE and RMSE of DR-Net were 98. 57%, 97. 27%, 96. 94%, 0. 21 and 0. 54, respectively. DR-Net was superior to YOLO v5 and YOLO v5 with rotary box location and pig target recognition accuracy was improved. By analyzing the visualization feature map, DR-Net can accurately locate the target using the head, neck, back or tail feature of pigs under occlusion and adhesion condition. The research can contribute to the construction of intelligent pig farm and provide reference for the subsequent research on pig behavior recognition. © 2023 Chinese Society of Agricultural Machinery. All rights reserved.
引用
收藏
页码:323 / 330
页数:7
相关论文
共 25 条
  • [1] HE K M, GKIOXARJ G, DOLLAR P, Et al., Mask R-CNN, IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 2, pp. 386-397, (2020)
  • [2] REDMONJ, FARHADI A., Yolov3: an incremental improvement, (2018)
  • [3] LIU W, ANGUELOY D, ERHAN D, Et al., SSD: single shot multiBox detector [C], European Conference on Computer Vision, pp. 21-37, (2016)
  • [4] RENS, HE K, GIRSHICK R, Et al., Faster R-CNN: towards real-time object detection with region proposal networks, Advances in Neural Information Processing Systems, pp. 91-99, (2015)
  • [5] LI Dan, ZHANG Kaifeng, LI Xingjian, Et al., Mounting behavior recognition for pigs based on Mask R-CNN [J], Transactions of the Chinese Society for Agricultural Machinery, 50, pp. 261-266, (2019)
  • [6] HU Yunge, CANG Yan, QIAO Yulong, Design of intelligent pig counting system based on improved instance segmentation algorithm, Transactions of the CSAE, 36, 19, pp. 177-183, (2020)
  • [7] GAO Yun, GUO Jiliang, LI Xuan, Et al., Instance-level segmentation method for group pig images based on deep learning[J], Transactions of the Chinese Society for Agricultural Machinery, 50, 4, pp. 179-187, (2019)
  • [8] MARTIN R, ACHIM K, FELIX A, Et al., Automatically detecting pig position and posture by 2D camera imaging and deep learning[J], Computers and Electronics in Agriculture, 174, (2020)
  • [9] LIU Yan, SUN Longqing, LUO Bing, Et al., Multi-target pigs detection algorithm based on improved CNN[J], Transactions of the Chinese Society for Agricultural Machinery, 50, pp. 283-289, (2019)
  • [10] GAO Yun, LI Jing, YU Mei, Et al., High-density pig counting net based on multi-scale aware, Transactions of the Chinese Society for Agricultural Machinery, 52, 9, pp. 172-178, (2021)