An Improved SSD-Like Deep Network-Based Object Detection Method for Indoor Scenes

被引:38
作者
Ni, Jianjun [1 ]
Shen, Kang [1 ]
Chen, Yan [1 ]
Yang, Simon X. [2 ]
机构
[1] Hohai Univ, Coll Internet Things Engn, Changzhou 213022, Jiangsu, Peoples R China
[2] Univ Guelph, Sch Engn, Adv Robot & Intelligent Syst ARIS Lab, Guelph, ON N1G 2W1, Canada
基金
中国国家自然科学基金;
关键词
Object detection; Feature extraction; Robots; Deep learning; Task analysis; Lighting; Data mining; Deep network; indoor scene; object detection; ResNet50; network; single-shot multibox detector (SSD) algorithm; RECOGNITION; NAVIGATION;
D O I
10.1109/TIM.2023.3244819
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The indoor scene object detection technology is of important research significance, which is one of the popular research topics in the field of scene understanding for indoor robots. In recent years, the solutions based on deep learning have achieved good results in object detection. However, there are still some problems to be further studied in indoor object detection methods, such as lighting problem and occlusion problem caused by the complexity of the indoor environment. Aiming at these problems, an improved object detection method based on deep neural networks is proposed in this article, which uses a framework similar to the single-shot multibox detector (SSD). In the proposed method, an improved ResNet50 network is used to enhance the transmission of information, and the feature expression capability of the feature extraction network is improved. At the same time, a multiscale contextual information extraction (MCIE) module is used to extract the contextual information of the indoor scene, so as to improve the indoor object detection effect. In addition, an improved dual-threshold non-maximum suppression (DT-NMS) algorithm is used to alleviate the occlusion problem in indoor scenes. Finally, the public dataset SUN2012 is further screened for the special application of indoor scene object detection, and the proposed method is tested on this dataset. The experimental results show that the mean average precision (mAP) of the proposed method can reach 54.10%, which is higher than those of the state-of-the-art methods.
引用
收藏
页数:15
相关论文
共 69 条
  • [1] Indoor objects detection and recognition for an ICT mobility assistance of visually impaired people
    Afif, Mouna
    Ayachi, Riadh
    Pissaloux, Edwige
    Said, Yahia
    Atri, Mohamed
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (41-42) : 31645 - 31662
  • [2] Design and development of an indoor navigation system using denoising autoencoder based convolutional neural network for visually impaired people
    Akilandeswari, J.
    Jothi, G.
    Naveenkumar, A.
    Sabeenian, R. S.
    Iyyanar, P.
    Paramasivam, M. E.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3483 - 3514
  • [3] Alamri F, 2019, J IEEE I C DEVELOP L, P313, DOI [10.1109/DEVLRN.2019.8850686, 10.1109/devlrn.2019.8850686]
  • [4] Indoor Scene Recognition for Micro Aerial Vehicles Navigation using Enhanced-GIST Descriptors
    Anbarasu, B.
    Anitha, G.
    [J]. DEFENCE SCIENCE JOURNAL, 2018, 68 (02) : 129 - 137
  • [5] High level visual scene classification using background knowledge of objects
    Benrais, Lamine
    Baha, Nadia
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) : 3663 - 3692
  • [6] Soft-NMS - Improving Object Detection With One Line of Code
    Bodla, Navaneeth
    Singh, Bharat
    Chellappa, Rama
    Davis, Larry S.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
  • [7] A Survey of the Four Pillars for Small Object Detection: Multiscale Representation, Contextual Information, Super-Resolution, and Region Proposal
    Chen, Guang
    Wang, Haitao
    Chen, Kai
    Li, Zhijun
    Song, Zida
    Liu, Yinlong
    Chen, Wenkai
    Knoll, Alois
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02): : 936 - 953
  • [8] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [9] Chen M, 2020, PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), P772, DOI 10.1109/DDCLS49620.2020.9275060
  • [10] S3-Net: A Fast Scene Understanding Network by Single-Shot Segmentation for Autonomous Driving
    Cheng, Yuan
    Yang, Yuchao
    Chen, Hai-Bao
    Wong, Ngai
    Yu, Hao
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (05)