Detection and localization for lake floating objects based on CA-faster R-CNN

被引:11
作者
Yi, Zeren [1 ,2 ]
Yao, Dongyi [1 ]
Li, Guojin [1 ]
Ai, Jiaoyan [1 ]
Xie, Wei [2 ]
机构
[1] Guangxi Univ, Sch Elect Engn, Nanning 530004, Peoples R China
[2] South China Univ Technol, Sch Automat Sci & Engn, Guangzhou 510641, Peoples R China
关键词
Detection; Localization; CA network; Faster R-CNN; Floating objects; NETWORKS;
D O I
10.1007/s11042-022-12686-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a general trend, unmanned ships have been gradually replacing humans and served as the cleaner of lakes. To work properly, those unmanned ships need to detect and localize lake floating objects that need to be collected. Compared to conventional image-based objects, lake floating objects are too small to detect. Meanwhile, because most conventional algorithms depend on bounding-boxes to detect the object, their results - it is hard to detect the accurate location of floating objects. To this end, this paper proposes a detection and localization algorithm based on CA-Faster R-CNN (Class Activation-Faster Regions with Convolutional Neural Network). Specifically, for an image with objects on it, the proposed algorithm detects and classifies objects with Faster R-CNN and localize objects with CA network. The experimental results show that, compared with the Faster R-CNN algorithm, this algorithm can reduce the positioning error without affecting the recognition accuracy, thereby can be used for the detection and localization of floating objects on the water surface. Compared with Faster R-CNN algorithm, the positioning accuracy of CA-Faster R-CNN algorithm is improved by 6.29 pixels. Also, the proposed algorithm remains a great potential for other objects that shared similar challenges with lake floating objects.
引用
收藏
页码:17263 / 17281
页数:19
相关论文
共 43 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Agrawal P, 2013, NATURE INSPIRED MOBILE ROBOTICS, P171
[3]   Evaluation of deep neural networks for traffic sign detection systems [J].
Arcos-Garcia, Alvaro ;
Alvarez-Garcia, Juan A. ;
Soria-Morillo, Luis M. .
NEUROCOMPUTING, 2018, 316 :332-344
[4]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[5]  
Dai JF, 2016, ADV NEUR IN, V29
[6]  
Deng L, 2019, ELECT TEST, P133, DOI [10.16520/j.cnki.1000-8519.2019.17.057, DOI 10.16520/J.CNKI.1000-8519.2019.17.057]
[7]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[8]  
Fang Jing, 2017, Transactions of Beijing Institute of Technology, V37, P1235, DOI 10.15918/j.tbit1001-0645.2017.12.005
[9]  
Girshick R., 2017, RICH FEATURE HIERARC, DOI [DOI 10.1109/CVPR.2014.81, 10.1109/cvpr.2014.81]
[10]  
He K., 2016, 2016 IEEE C COMP VIS, DOI [DOI 10.1109/CVPR.2016.90, 10.1109/CVPR.2016.90]