Nested object detection using mask R-CNN: application to bee and varroa detection

被引:0
作者
Kriouile, Yassine [1 ,2 ]
Ancourt, Corinne [1 ]
Wegrzyn-Wolska, Katarzyna [1 ,2 ]
Bougueroua, Lamine [2 ]
机构
[1] Centre de Recherche en Informatique, Mines Paris, PSL University, 35 rue Saint Honoré, Fontainebleau
[2] EFREI Research Lab, Paris-Pantheon-Assas University, Paris
关键词
Bee; Deep learning; Mask R-CNN; Nested objects; Neural network; Object detection; Object segmentation; Varroa;
D O I
10.1007/s00521-024-10393-x
中图分类号
学科分类号
摘要
In this paper, we address an essential problem related to object detection and image processing: detecting objects potentially nested in other ones. This problem exists particularly in the beekeeping sector: detecting varroa parasites on bees. Indeed, beekeepers must ensure the level of infestation of their apiaries by the varroa parasite which settles on the backs of bees. As far as we know, there is no yet a published approach to deal with nested object detection using only one neural network trained on two different datasets. We propose an approach that fills this gap. Therefore, we improve the accuracy and the efficiency of bee and varroa detection task. Our work is based on deep learning, more precisely Mask R-CNN neural network. Instead of segmenting detected objects (bees), we segment internal objects (varroas). We add a branch to Faster R-CNN to segment internal objects. We extract relevant features for internal object segmentation and suggest efficient method for training the neural network on two different datasets. Our experiments are based on a set of images of bee frames, containing annotated bees and varroa mites. Due to differences in occurrence rates, two different sets were created. After carrying out experiments, we ended up with a single neural network capable of detecting two nested objects without decreasing accuracy compared to two separate neural networks. Our approach, compared to traditional separate neural networks, improves varroa detection accuracy by 1.9%, reduces infestation level prediction error by 0.22%, and reduces execution time by 28% and model memory by 23%. In our approach, we extract Res4 (a layer of the ResNet neural network) features for varroa segmentation, which improves detection accuracy by 11% compared to standard FPN extraction. Thus, we suggest a new approach that detects nested objects more accurately than two separate network approaches. © The Author(s) 2024.
引用
收藏
页码:22587 / 22609
页数:22
相关论文
共 41 条
  • [1] Ren S., He K., Girshick R., Sun J., Faster R-CNN: Towards real-time object detection with region proposal networks, Advances in Neural Information Processing Systems, (2015)
  • [2] He K., Gkioxari G., Dollar P., Girshick R., Mask R-CNN, 2017 IEEE international conference on computer vision (ICCV), pp. 2980-2988, (2017)
  • [3] Sipos T., Donko T., Jocsak I., Keszthelyi S., Study of morphological features in pre-imaginal honey bee impaired by varroa destructor by means of computer tomography, Insects, 12, 8, (2021)
  • [4] Liu W., Anguelov D., Erhan D., Szegedy C., Reed S., Fu C.-Y., Berg A.C., SSD: Single shot multibox detector, Lecture Notes in Computer Science, (2016)
  • [5] Redmon J., Divvala S., Girshick R., Farhadi A., You only look once: unified, real-time object detection, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2016)
  • [6] Lin T.-Y., Goyal P., Girshick R., He K., Dollar P., Focal loss for dense object detection, IEEE International Conference on Computer Vision (ICCV), (2017)
  • [7] Law H., Deng J., Cornernet: Detecting objects as paired keypoints, European Conference on Computer Vision (ECCV), (2018)
  • [8] Zhou X., Wang D., Krahenbuhl P., Objects as points, (2019)
  • [9] Girshick R., Donahue J., Darrell T., Malik J., Rich feature hierarchies for accurate object detection and semantic segmentation, (2014)
  • [10] Girshick R., Fast R-CNN, 2015 IEEE international conference on computer vision (ICCV), pp. 1440-1448, (2015)