An Evaluation of RetinaNet on Indoor Object Detection for Blind and Visually Impaired Persons Assistance Navigation

被引:57
作者
Afif, Mouna [1 ]
Ayachi, Riadh [1 ]
Said, Yahia [1 ,2 ]
Pissaloux, Edwige [3 ,4 ]
Atri, Mohamed [5 ]
机构
[1] Univ Monastir, Fac Sci Monastir, Lab Elect & Microelect EME, Monastir, Tunisia
[2] Northern Border Univ, Coll Engn, Elect Engn Dept, Ar Ar, Saudi Arabia
[3] Univ Rouen Normandy, LITIS Lab, Rouen, France
[4] Univ Rouen Normandy, CNRS FR 3638, Rouen, France
[5] King Khalid Univ, Coll Comp Sci, Abha, Saudi Arabia
关键词
Indoor object recognition; Visually impaired people (VIP); Deep convolutional neural network (DCNN); Deep learning; Indoor object detection and recognition dataset (IODR); RECOGNITION;
D O I
10.1007/s11063-020-10197-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Indoor object detection presents a computer vision task that deals with the detection of specific indoor classes. This task attracts a lot of attention, especially in the last few years. The strong interest related to this field can be explained by the big importance of this task for indoor assistance navigation for visually impaired people and also by the phenomenal development of the deep convolutional neural networks (Deep CNN). In this paper, an effort is made to perform a new indoor object detector using the deep convolutional neural network-based framework. The framework is built based on the deep convolutional neural network "RetinaNet". Evaluation is done by using various backbones as ResNet, DenseNet, and VGGNet in order to improve detection performances and processing time. We obtained very encouraging results coming up to 84.61% mAP as detection precision.
引用
收藏
页码:2265 / 2279
页数:15
相关论文
共 42 条
[1]  
Afif M, 2019, ARTIF INTELL ADV, V1, P52
[2]  
Afif Mouna, 2018, INT C SCI EL TECHN I, P364
[3]  
Aftf M., 2019, 2019 IEEE INT S MEAS, P1
[4]   Traffic Signs Detection for Real-World Application of an Advanced Driving Assisting System Using Deep Learning [J].
Ayachi, Riadh ;
Afif, Mouna ;
Said, Yahia ;
Atri, Mohamed .
NEURAL PROCESSING LETTERS, 2020, 51 (01) :837-851
[5]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[6]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[7]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[8]   Autonomous robot navigation using adaptive potential fields [J].
Cosío, FA ;
Castañeda, MAP .
MATHEMATICAL AND COMPUTER MODELLING, 2004, 40 (9-10) :1141-1156
[9]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[10]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036