IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation

被引:12
作者
Dang, Thai-Viet [1 ]
Tran, Dinh-Manh-Cuong [1 ]
Tan, Phan Xuan [2 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Mech Engn, Dept Mechatron, Hanoi 10000, Vietnam
[2] Shibaura Inst Technol, Grad Sch Engn & Sci, Toyosu,Koto Ku, Tokyo 1358548, Japan
关键词
computer vision; mobile robot; navigation; obstacle avoidance; semantic segmentation; MULTISCALE;
D O I
10.3390/s23156907
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Computer vision plays a significant role in mobile robot navigation due to the wealth of information extracted from digital images. Mobile robots localize and move to the intended destination based on the captured images. Due to the complexity of the environment, obstacle avoidance still requires a complex sensor system with a high computational efficiency requirement. This study offers a real-time solution to the problem of extracting corridor scenes from a single image using a lightweight semantic segmentation model integrating with the quantization technique to reduce the numerous training parameters and computational costs. The proposed model consists of an FCN as the decoder and MobilenetV2 as the decoder (with multi-scale fusion). This combination allows us to significantly minimize computation time while achieving high precision. Moreover, in this study, we also propose to use the Balance Cross-Entropy loss function to handle diverse datasets, especially those with class imbalances and to integrate a number of techniques, for example, the Adam optimizer and Gaussian filters, to enhance segmentation performance. The results demonstrate that our model can outperform baselines across different datasets. Moreover, when being applied to practical experiments with a real mobile robot, the proposed model's performance is still consistent, supporting the optimal path planning, allowing the mobile robot to efficiently and effectively avoid the obstacles.
引用
收藏
页数:22
相关论文
共 46 条
[1]   Augmented Reality Meets Computer Vision: Efficient Data Generation for Urban Driving Scenes [J].
Abu Alhaija, Hassan ;
Mustikovela, Siva Karthik ;
Mescheder, Lars ;
Geiger, Andreas ;
Rother, Carsten .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (09) :961-972
[2]   Deep learning algorithm for Gaussian noise removal from images [J].
Aghajarian, Mickael ;
McInroy, John E. ;
Muknahallipatna, Suresh .
JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (04)
[3]  
Agus ME., 2022, JOIV Int J Inform Vis, V6, P660
[4]   Obstacles Avoidance for Mobile Robot Using Type-2 Fuzzy Logic Controller [J].
Al-Mallah, Mohammad ;
Ali, Mohammad ;
Al-Khawaldeh, Mustafa .
ROBOTICS, 2022, 11 (06)
[5]  
Alqaralleh B, 2020, Journal of Theoretical and Applied Information Technology, V98, P714
[6]  
[Anonymous], 2000, Multiple view geometry
[7]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[8]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[9]   Two-Stage Framework for Faster Semantic Segmentation [J].
Cruz, Ricardo ;
Teixeira e Silva, Diana ;
Goncalves, Tiago ;
Carneiro, Diogo ;
Cardoso, Jaime S. .
SENSORS, 2023, 23 (06)
[10]   Experimental Analysis of the Behavior of Mirror-like Objects in LiDAR-Based Robot Navigation [J].
Damodaran, Deeptha ;
Mozaffari, Saeed ;
Alirezaee, Shahpour ;
Ahamed, Mohammed Jalal .
APPLIED SCIENCES-BASEL, 2023, 13 (05)