Semantic segmentation of tea geometrid in natural scene images using discriminative pyramid network

被引:15
作者
Hu, Gensheng [1 ]
Li, Suqing [1 ]
Wan, Mingzhu [2 ]
Bao, Wenxia [1 ]
机构
[1] Anhui Univ, Natl Engn Res Ctr Agroecol Big Data Anal & Applic, Hefei, Peoples R China
[2] Fudan Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
关键词
Semantic segmentation; Tea pest; Discriminative pyramid network; Natural scene image; K-MEANS; LEAVES;
D O I
10.1016/j.asoc.2021.107984
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A discriminative pyramid (DP) network-based method is presented in this paper, which aims to perform semantic segmentation on tea geometrid in natural scene images. The method uses image flipping, translation, mirroring, random zooming, and other techniques for training sample augmentation and adopts local histogram equalization to reduce nonuniform light influence on segmentation. Moreover, this method constructs a DP network to improve segmentation accuracy. The DP network contains two sub networks: pyramid attention and border networks. The pyramid attention network can capture the global context information of targets at different scales, increase the receptive fields to focus on small targets, and solve the problems of large shape change, small size, and difficult detection of tea geometrids. The border network can increase the differences between tea geometrids and tea tree stalks, diseased leaves, and other types of backgrounds with similar appearances by extracting and supervising semantic boundaries to help the network learn additional discriminative features. Experiment results show that the proposed method has outstanding accuracy for semantic segmentation of tea geometrid in natural scene images. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
[1]   A dynamic histogram equalization for image contrast enhancement [J].
Abdullah-Al-Wadud, M. ;
Kabir, Md. Hasanul ;
Dewan, M. Ali Akber ;
Chae, Oksam .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2007, 53 (02) :593-600
[2]  
[Anonymous], 2015, ACS SYM SER
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   Synthetic bootstrapping of convolutional neural networks for semantic plant part segmentation [J].
Barth, R. ;
IJsselmuiden, J. ;
Hemming, J. ;
Van Henten, E. J. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2019, 161 :291-304
[5]   The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks [J].
Berman, Maxim ;
Triki, Amal Rannen ;
Blaschko, Matthew B. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4413-4421
[6]   Color index based thresholding method for background and foreground segmentation of plant images [J].
Castillo-Martinez, Miguel A. ;
Gallegos-Funes, Francisco J. ;
Carvajal-Gamez, Blanca E. ;
Urriolagoitia-Sosa, Guillermo ;
Rosales-Silva, Alberto J. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 178
[7]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[8]  
Gonzalez R. C., 2017, Digital Image Processing, V4
[9]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[10]   Tree Leaves Extraction in Natural Images: Comparative Study of Preprocessing Tools and Segmentation Methods [J].
Grand-Brochier, Manuel ;
Vacavant, Antoine ;
Cerutti, Guillaume ;
Kurtz, Camille ;
Weber, Jonathan ;
Tougne, Laure .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (05) :1549-1560