Microscale Precision of 6DOF Localization Rectification of Low-End Stereo Vision Using Deep Learning

被引:0
作者
Farag, Ramy [1 ]
Saad, Mohamed S. [2 ]
Emara, H. [2 ]
Bahgat, A. [2 ]
机构
[1] Helwan Univ, Mechatron Dept, Cairo, Egypt
[2] Cairo Univ, Elect Power Dept, Giza, Egypt
来源
2020 IEEE 29TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE) | 2020年
关键词
object detection; convolutional neural network; stereo vision; pose estimation; robot arm; deep learning;
D O I
10.1109/isie45063.2020.9152567
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper proposes the deployment of a deep feedforward neural network (DFF) for the estimation of an object's 6D pose with respect to a robot arm. DFF is also deployed as a data rectification stage. The proposed technique is able to achieve microscale precision using customized low-end stereo vision system, given the 3D location of each of the object's four bounding corners. The 4 bounding corners' 3D locations are estimated from the subpixel localization accuracy of each. The paper also proposes a building block for convolutional neural networks for object detection, which is based on utilization of skipping connections that concatenates early feature maps to later layers, instead of adding them like in Residual network (ResNet). The proposed building block shows better results than the building block of ResNet in terms of object detection accuracy. For minimizing the use of memory resources, this paper is recommending the use of batch normalization layer at the beginning of convolutional neural network architectures, empirically, it dispenses the need for data normalization.
引用
收藏
页码:549 / 554
页数:6
相关论文
共 22 条
[1]  
Alahi A, 2012, PROC CVPR IEEE, P510, DOI 10.1109/CVPR.2012.6247715
[2]  
Aldo-ma Aitor, 2011, 2011 IEEE INT C
[3]  
[Anonymous], 2015, 3 INT C LEARN REPR
[4]  
[Anonymous], 2015, ADV NEURAL INF PROCE
[5]  
[Anonymous], 2008, COMPUT VIS IMAGE UND
[6]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[7]  
Chen Y., 2016, P INT C IM PROC COMP, P252, DOI 10.1109/CVPR.2006.95
[8]   The multi-modal interface of Robot-Era multi-robot services tailored for the elderly [J].
Di Nuovo, Alessandro ;
Broz, Frank ;
Wang, Ning ;
Belpaeme, Tony ;
Cangelosi, Angelo ;
Jones, Ray ;
Esposito, Raffaele ;
Cavallo, Filippo ;
Dario, Paolo .
INTELLIGENT SERVICE ROBOTICS, 2018, 11 (01) :109-126
[9]  
Farag RMA, 2018, IEEE IND ELEC, P2736, DOI 10.1109/IECON.2018.8592716
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778