An Improved SSD-Like Deep Network-Based Object Detection Method for Indoor Scenes

被引：38

作者：

Ni, Jianjun ^{[1
]}

Shen, Kang ^{[1
]}

Chen, Yan ^{[1
]}

Yang, Simon X. ^{[2
]}

机构：

[1] Hohai Univ, Coll Internet Things Engn, Changzhou 213022, Jiangsu, Peoples R China

[2] Univ Guelph, Sch Engn, Adv Robot & Intelligent Syst ARIS Lab, Guelph, ON N1G 2W1, Canada

来源：

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT | 2023年 / 72卷

基金：

中国国家自然科学基金;

关键词：

Object detection; Feature extraction; Robots; Deep learning; Task analysis; Lighting; Data mining; Deep network; indoor scene; object detection; ResNet50; network; single-shot multibox detector (SSD) algorithm; RECOGNITION; NAVIGATION;

D O I：

10.1109/TIM.2023.3244819

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The indoor scene object detection technology is of important research significance, which is one of the popular research topics in the field of scene understanding for indoor robots. In recent years, the solutions based on deep learning have achieved good results in object detection. However, there are still some problems to be further studied in indoor object detection methods, such as lighting problem and occlusion problem caused by the complexity of the indoor environment. Aiming at these problems, an improved object detection method based on deep neural networks is proposed in this article, which uses a framework similar to the single-shot multibox detector (SSD). In the proposed method, an improved ResNet50 network is used to enhance the transmission of information, and the feature expression capability of the feature extraction network is improved. At the same time, a multiscale contextual information extraction (MCIE) module is used to extract the contextual information of the indoor scene, so as to improve the indoor object detection effect. In addition, an improved dual-threshold non-maximum suppression (DT-NMS) algorithm is used to alleviate the occlusion problem in indoor scenes. Finally, the public dataset SUN2012 is further screened for the special application of indoor scene object detection, and the proposed method is tested on this dataset. The experimental results show that the mean average precision (mAP) of the proposed method can reach 54.10%, which is higher than those of the state-of-the-art methods.

引用

页数：15

共 69 条

[1] Indoor objects detection and recognition for an ICT mobility assistance of visually impaired people
Afif, Mouna
Ayachi, Riadh
Pissaloux, Edwige
Said, Yahia
Atri, Mohamed
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) : 31645 - 31662
[2] Design and development of an indoor navigation system using denoising autoencoder based convolutional neural network for visually impaired people
Akilandeswari, J.
Jothi, G.
Naveenkumar, A.
Sabeenian, R. S.
Iyyanar, P.
Paramasivam, M. E.
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3483 - 3514
[3] Alamri F, 2019, J IEEE I C DEVELOP L, P313, DOI [10.1109/DEVLRN.2019.8850686, 10.1109/devlrn.2019.8850686]
[4] Indoor Scene Recognition for Micro Aerial Vehicles Navigation using Enhanced-GIST Descriptors
Anbarasu, B.
Anitha, G.
[J]. DEFENCE SCIENCE JOURNAL, 2018, 68 (02) : 129 - 137
[5] High level visual scene classification using background knowledge of objects
Benrais, Lamine
Baha, Nadia
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3663 - 3692
[6] Soft-NMS - Improving Object Detection With One Line of Code
Bodla, Navaneeth
Singh, Bharat
Chellappa, Rama
Davis, Larry S.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
[7] A Survey of the Four Pillars for Small Object Detection: Multiscale Representation, Contextual Information, Super-Resolution, and Region Proposal
Chen, Guang
Wang, Haitao
Chen, Kai
Li, Zhijun
Song, Zida
Liu, Yinlong
Chen, Wenkai
Knoll, Alois
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02): : 936 - 953
[8] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[9] Chen M, 2020, PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), P772, DOI 10.1109/DDCLS49620.2020.9275060
[10] S3-Net: A Fast Scene Understanding Network by Single-Shot Segmentation for Autonomous Driving
Cheng, Yuan
Yang, Yuchao
Chen, Hai-Bao
Wong, Ngai
Yu, Hao
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (05)

← 1 2 3 4 5 6 7 →