An unmanned aerial vehicle identification and tracking system based on weakly supervised semantic segmentation technology

被引:4
作者
Zhou, Tong [1 ]
Guo, Junxing [1 ]
Wu, Qilong [1 ]
Xu, Chuan [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Mech Engn, Nanjing 210094, Peoples R China
关键词
Unmanned aerial vehicle; Weakly supervised; Pseudo pixel label; Region-based tracking method; Pan-tilt;
D O I
10.1016/j.phycom.2022.101758
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents an unmanned aerial vehicle (UAV) identification and tracking system aimed at monitoring UAVs based on weakly supervised semantic segmentation. A camera is equipped with a pan-tilt to collect images for semantic segmentation network in real time. The GrabCut+ algorithm and annotation boxes are employed to generate the UAV "pseudo pixel labels "for supervised model learning and reduce labelling costs. A new loss function combining the focus loss function and dice loss function is designed to balance positive and negative samples and improve the segmentation effect. The Mixup method is introduced for model training to prevent overfitting and enhance the generalization ability of the model. The semantic segmentation network outputs the prediction results by a fully connected conditional random field to smooth the target image. Furthermore, a region-based tracking method is proposed to solve the hysteresis problem of the pan-tilt control system and improve the system tracking performance. Finally, an experiment based on a dataset is carried out to prove the effectiveness of the segmentation algorithm with 66.3% mIoU. Considering that 10% of the central area of view is specified as the view centre, a UAV falling in the centre of the field accounts for more than 80% of this view area, demonstrating the real-time effectiveness of the designed UAV identification and tracking system. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 36 条
[1]  
[Anonymous], Image polygonal annotation with python
[2]  
[Anonymous], LABELIMG IS GRAPHICA
[3]   Deep Cross-Modal Audio-Visual Generation [J].
Chen, Lele ;
Srivastava, Sudhanshu ;
Duan, Zhiyao ;
Xu, Chenliang .
PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, :349-357
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[6]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[7]   IR radiation characteristics and operating range research for a quad-rotor unmanned aircraft vehicle [J].
Gong, Mali ;
Guo, Rui ;
He, Sifeng ;
Wang, Wei .
APPLIED OPTICS, 2016, 55 (31) :8757-8762
[8]   Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1026-1034
[9]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[10]  
He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]