Multi-Objective Neural Architecture Search for Efficient and Fast Semantic Segmentation on Edge

被引：4

作者：

Dou ZiWen ^{[1
]}

Dong, Ye ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Instrumentat Sci & Engn, Harbin 150001, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期

关键词：

Semantic segmentation; Hardware; Computer architecture; Real-time systems; Computational modeling; Semantics; Search problems; Neural architecture search; edge computing; real-time semantic segmentation; multi-objective NAS; REAL-TIME SEGMENTATION;

D O I：

10.1109/TIV.2023.3332594

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deploying efficient and fast semantic segmentation networks on edge computing platforms in real-world environments is desired and challenging. To address this challenge, we propose RealtimeSeg, one of the first semantic segmentation models to be searched by neural architecture search(NAS), capable of running at real-time speed on edge devices. In our neural architecture search, we incorporate the inference time and FLOPs (floating-point operations) of the target edge devices and the semantic segmentation accuracy as objectives. In this way, we construct a multi-objective neural architecture search. Specifically, the multi-objective NAS's loss function is decomposed into three sub-objective loss functions, which are weighted and summed. We employed knowledge distillation to further enhance the accuracy, latency, and FLOPs of the discovered network architecture during the search process. As a result, we successfully obtained our RealtimeSeg model. Lastly, we utilized NVIDIA TensorRT to accelerate RealtimeSeg and deployed the accelerated RealtimeSeg on the target platform for real-time semantic segmentation. Using a single NVIDIA Titan XP GPU, RealtimeSeg can be obtained within 1.5 days. The experimental results demonstrate that RealtimeSeg achieved an accuracy of 71.7 mIoU(%) while maintaining a frame rate of 25.25 FPS on the NVIDIA Jetson NX, using the input resolution of 1024 x 2048. And the RealtimeSeg has a lower FLOPs value of 1.52 G, which is 17-18x less than SOTA methods. In realistic scenarios, RealtimeSeg has been successfully deployed on edge computing platforms, achieving efficient and fast semantic segmentation results.

引用

页码：1346 / 1357

页数：12

共 52 条

[51] Enhancing MOEA/D with information feedback models for large-scale many-objective optimization [J].

Zhang, Yin ;

Wang, Gai-Ge ;

Li, Keqin ;

Yeh, Wei-Chang ;

Jian, Muwei ;

Dong, Junyu .

INFORMATION SCIENCES, 2020, 522 :1-16

[52] Scene Parsing through ADE20K Dataset [J].

Zhou, Bolei ;

Zhao, Hang ;

Puig, Xavier ;

Fidler, Sanja ;

Barriuso, Adela ;

Torralba, Antonio .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5122-5130

← 1 2 3 4 5 6 →