Optimization of Road Detection using Semantic Segmentation and Deep Learning in Self-Driving Cars

被引:0
作者
Hammoud, Mohammed Sameeh [1 ]
Lupin, Sergey [1 ]
机构
[1] National Research University of Electronic Technology, Russia
关键词
Advanced Driver Assistance Systems; Computer vision; Deep Learning; Image segmentation; Mobile Robots; Navigation; Road Detection; Self-Driving Cars;
D O I
10.33166/AETiC.2024.03.004
中图分类号
学科分类号
摘要
Robust and accurate road detection is an essential part of Automatic Driver Assistance Systems (ADAS). Self-driving Cars have the capability to revolutionize the way we travel, making transportation safer, more effective and more available to all. With the ability to navigate roads without human intervention, self-driving cars can reduce the number of accidents caused by human error, eliminate the demand for drivers to be behind the wheel and make it easier for people who can’t drive, such as the elderly or disabled people to get around. In addition, self-driving cars can enhance traffic flow by reducing congestion and optimizing routes, eventually saving time and reducing emissions. As technology continues to advance, self-driving cars are poised to transform the transportation industry and change the way we think about mobility. In this work, a convolutional neural network-based deep learning to achieve road detection based on image segmentation to be applied in self-driving cars. In addition to our proposed network, multiple experiments were conducted to investigate the impact of different deep-earning architectures on performance. A public dataset called the KITTI road dataset is used to train and validate the model. The images were down-sampled from 1224x370 to 256x256. We compared our model’s performance with the performance of popular deep learning architectures such as Unet and LinkNet A transfer learning technique is used while training the models based on network weights trained on the famous dataset ImageNet, including popular architectures such as ResNet, VGG, SeresNet and EfficientNet. The results show that our model achieves an F1-score of 0.9909, outperforming Unet and LinkNet architectures. In the second place, the best results were obtained based on Unet and ResNet50 with an F1-score of 0.9904. © 2024 by the author(s).
引用
收藏
页码:51 / 63
页数:12
相关论文
共 32 条
  • [21] Fremont Teodoro Vincent, Wolf Denis Fernando, Exploiting fully convolutional neural networks for fast road detection, Proceedings of the 2016 IEEE International Conference on Robotics and Automation (ICRA), pp. 3174-3179, (2016)
  • [22] Liang Xiao, Wang Ruili, Dai Bin, Fang Yuqiang, Liu Daxue, Et al., Hybrid conditional random field based camera-LIDAR fusion for road detection, Information Sciences, 432, 2, pp. 543-558, (2018)
  • [23] Ankit Laddha, Kemal Kocamaz Mehmet, Navarro-Serment Luis E, Hebert Martial, Map-supervised road detection, Proceedings of the 2016 IEEE Intelligent Vehicles Symposium (IV), pp. 118-123, (2016)
  • [24] Oliveira Gabriel L, Burgard Wolfram, Brox Thomas, Efficient deep models for monocular road segmentation, Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4885-4891, (2016)
  • [25] Noa Garnett, Silberstein Shai, Oron Shaul, Fetaya Ethan, Verner Uri, Et al., Real-time category-based and general obstacle detection for autonomous driving, Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops (ICCVW), pp. 198-205, (2017)
  • [26] Zhe Chen, Chen Zijing, Rbnet: A deep neural network for unified road and road boundary detection, Proceedings of the Neural Information Processing: 24th International Conference, ICONIP 2017, pp. 677-687
  • [27] Luca Caltagirone, Bellone Mauro, Svensson Lennart, Wahde Mattias, LIDAR–camera fusion for road detection using fully convolutional neural networks, Robotics and Autonomous Systems, 111, pp. 125-131, (2019)
  • [28] Hengshuang Zhao, Shi Jianping, Qi Xiaojuan, Wang Xiaogang, Jia Jiaya, Pyramid scene parsing network, Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6230-6239, (2017)
  • [29] Peter Kontschieder, Bulo Samuel Rota, Bischof Horst, Pelillo Marcello, Structured class-labels in random forests for semantic image labelling, Proceedings of the 2011 International Conference on Computer Vision, pp. 2190-2197, (2011)
  • [30] Xiaozhi Chen, Kundu Kaustav, Zhu Yukun, Berneshawi Andrew G, Ma Huimin, Et al., 3D object proposals for accurate object class detection, Advances in Neural Information Processing Systems, 40, 5, pp. 1259-1272, (2018)