Multi-Objective Neural Architecture Search for Efficient and Fast Semantic Segmentation on Edge

被引:4
作者
Dou ZiWen [1 ]
Dong, Ye [1 ]
机构
[1] Harbin Inst Technol, Sch Instrumentat Sci & Engn, Harbin 150001, Peoples R China
来源
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期
关键词
Semantic segmentation; Hardware; Computer architecture; Real-time systems; Computational modeling; Semantics; Search problems; Neural architecture search; edge computing; real-time semantic segmentation; multi-objective NAS; REAL-TIME SEGMENTATION;
D O I
10.1109/TIV.2023.3332594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deploying efficient and fast semantic segmentation networks on edge computing platforms in real-world environments is desired and challenging. To address this challenge, we propose RealtimeSeg, one of the first semantic segmentation models to be searched by neural architecture search(NAS), capable of running at real-time speed on edge devices. In our neural architecture search, we incorporate the inference time and FLOPs (floating-point operations) of the target edge devices and the semantic segmentation accuracy as objectives. In this way, we construct a multi-objective neural architecture search. Specifically, the multi-objective NAS's loss function is decomposed into three sub-objective loss functions, which are weighted and summed. We employed knowledge distillation to further enhance the accuracy, latency, and FLOPs of the discovered network architecture during the search process. As a result, we successfully obtained our RealtimeSeg model. Lastly, we utilized NVIDIA TensorRT to accelerate RealtimeSeg and deployed the accelerated RealtimeSeg on the target platform for real-time semantic segmentation. Using a single NVIDIA Titan XP GPU, RealtimeSeg can be obtained within 1.5 days. The experimental results demonstrate that RealtimeSeg achieved an accuracy of 71.7 mIoU(%) while maintaining a frame rate of 25.25 FPS on the NVIDIA Jetson NX, using the input resolution of 1024 x 2048. And the RealtimeSeg has a lower FLOPs value of 1.52 G, which is 17-18x less than SOTA methods. In realistic scenarios, RealtimeSeg has been successfully deployed on edge computing platforms, achieving efficient and fast semantic segmentation results.
引用
收藏
页码:1346 / 1357
页数:12
相关论文
共 52 条
  • [1] Bo D., 2023, P AAAI C ART INT
  • [2] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [3] RangeSeg: Range-Aware Real Time Segmentation of 3D LiDAR Point Clouds
    Chen, Tzu-Hsuan
    Chang, Tian Sheuan
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (01): : 93 - 101
  • [4] Chen W., 2020, ICLR
  • [5] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [6] FBNetV3: Joint Architecture-Recipe Search using Predictor Pretraining
    Dai, Xiaoliang
    Wan, Alvin
    Zhang, Peizhao
    Wu, Bichen
    He, Zijian
    Wei, Zhen
    Chen, Kan
    Tian, Yuandong
    Yu, Matthew
    Vajda, Peter
    Gonzalez, Joseph E.
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16271 - 16280
  • [7] ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation
    Dai, Xiaoliang
    Zhang, Peizhao
    Wu, Bichen
    Yin, Hongxu
    Sun, Fei
    Wang, Yanghan
    Dukhan, Marat
    Hu, Yunqing
    Wu, Yiming
    Jia, Yangqing
    Vajda, Peter
    Uyttendaele, Matt
    Jha, Niraj K.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11390 - 11399
  • [8] HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight Transformers
    Ding, Mingyu
    Lian, Xiaochen
    Yang, Linjie
    Wang, Peng
    Jin, Xiaojie
    Lu, Zhiwu
    Luo, Ping
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2981 - 2991
  • [9] Dosovitskiy A., 2021, P ICLR, P1
  • [10] A comprehensive survey of clustering algorithms: State-of-the-art machine learning applications, taxonomy, challenges, and future research prospects
    Ezugwu, Absalom E.
    Ikotun, Abiodun M.
    Oyelade, Olaide O.
    Abualigah, Laith
    Agushaka, Jeffery O.
    Eke, Christopher I.
    Akinyelu, Andronicus A.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110