An Improved Instance Segmentation Method for Complex Elements of Farm UAV Aerial Survey Images

被引:2
作者
Lv, Feixiang [1 ]
Zhang, Taihong [1 ,2 ,3 ]
Zhao, Yunjie [1 ,2 ,3 ]
Yao, Zhixin [1 ,2 ,3 ]
Cao, Xinyu [1 ]
机构
[1] Xinjiang Agr Univ, Sch Comp & Informat Engn, Urumqi 830052, Peoples R China
[2] Minist Educ, Engn Res Ctr Intelligent Agr, Urumqi 830052, Peoples R China
[3] Xinjiang Agr Informatizat Engn Technol Res Ctr, Urumqi 830052, Peoples R China
基金
国家重点研发计划;
关键词
image processing; instance segmentation; SparseInst; attention mechanism; farm aerial survey;
D O I
10.3390/s24185990
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Farm aerial survey layers can assist in unmanned farm operations, such as planning paths and early warnings. To address the inefficiencies and high costs associated with traditional layer construction, this study proposes a high-precision instance segmentation algorithm based on SparseInst. Considering the structural characteristics of farm elements, this study introduces a multi-scale attention module (MSA) that leverages the properties of atrous convolution to expand the sensory field. It enhances spatial and channel feature weights, effectively improving segmentation accuracy for large-scale and complex targets in the farm through three parallel dense connections. A bottom-up aggregation path is added to the feature pyramid fusion network, enhancing the model's ability to perceive complex targets such as mechanized trails in farms. Coordinate attention blocks (CAs) are incorporated into the neck to capture richer contextual semantic information, enhancing farm aerial imagery scene recognition accuracy. To assess the proposed method, we compare it against existing mainstream object segmentation models, including the Mask R-CNN, Cascade-Mask, SOLOv2, and Condinst algorithms. The experimental results show that the improved model proposed in this study can be adapted to segment various complex targets in farms. The accuracy of the improved SparseInst model greatly exceeds that of Mask R-CNN and Cascade-Mask and is 10.8 and 12.8 percentage points better than the average accuracy of SOLOv2 and Condinst, respectively, with the smallest number of model parameters. The results show that the model can be used for real-time segmentation of targets under complex farm conditions.
引用
收藏
页数:18
相关论文
共 42 条
[1]   YOLACT Real-time Instance Segmentation [J].
Bolya, Daniel ;
Zhou, Chong ;
Xiao, Fanyi ;
Lee, Yong Jae .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9156-9165
[2]   Deep learning techniques to classify agricultural crops through UAV imagery: a review [J].
Bouguettaya, Abdelmalek ;
Zarzour, Hafed ;
Kechida, Ahmed ;
Taberkit, Amine Mohammed .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12) :9511-9536
[3]   Tree Crown Delineation Algorithm Based on a Convolutional Neural Network [J].
Braga, Jose R. G. ;
Peripato, Vinicius ;
Dalagnol, Ricardo ;
Ferreira, Matheus P. ;
Tarabalka, Yuliya ;
Aragao, Luiz E. O. C. ;
de Campos Velho, Haroldo E. ;
Shiguemori, Elcio H. ;
Wagner, Fabien H. .
REMOTE SENSING, 2020, 12 (08)
[4]   SeNet: Structured Edge Network for Sea-Land Segmentation [J].
Cheng, Dongcai ;
Meng, Gaofeng ;
Cheng, Guangliang ;
Pan, Chunhong .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (02) :247-251
[5]  
Cheng T., 2022, P IEEE CVF C COMP VI
[6]   Recognition of Plastic Film in Terrain-Fragmented Areas Based on Drone Visible Light Images [J].
Du, Xiaoyi ;
Huang, Denghong ;
Dai, Li ;
Du, Xiandan .
AGRICULTURE-BASEL, 2024, 14 (05)
[7]   Forest Fire Segmentation from Aerial Imagery Data Using an Improved Instance Segmentation Model [J].
Guan, Zhihao ;
Miao, Xinyu ;
Mu, Yunjie ;
Sun, Quan ;
Ye, Qiaolin ;
Gao, Demin .
REMOTE SENSING, 2022, 14 (13)
[8]  
He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[9]   Momentum Contrast for Unsupervised Visual Representation Learning [J].
He, Kaiming ;
Fan, Haoqi ;
Wu, Yuxin ;
Xie, Saining ;
Girshick, Ross .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735
[10]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507