Multi-Objective Neural Architecture Search for Efficient and Fast Semantic Segmentation on Edge

被引：4

作者：

Dou ZiWen ^{[1
]}

Dong, Ye ^{[1
]}

机构：

[1] Harbin Inst Technol, Sch Instrumentat Sci & Engn, Harbin 150001, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期

关键词：

Semantic segmentation; Hardware; Computer architecture; Real-time systems; Computational modeling; Semantics; Search problems; Neural architecture search; edge computing; real-time semantic segmentation; multi-objective NAS; REAL-TIME SEGMENTATION;

D O I：

10.1109/TIV.2023.3332594

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deploying efficient and fast semantic segmentation networks on edge computing platforms in real-world environments is desired and challenging. To address this challenge, we propose RealtimeSeg, one of the first semantic segmentation models to be searched by neural architecture search(NAS), capable of running at real-time speed on edge devices. In our neural architecture search, we incorporate the inference time and FLOPs (floating-point operations) of the target edge devices and the semantic segmentation accuracy as objectives. In this way, we construct a multi-objective neural architecture search. Specifically, the multi-objective NAS's loss function is decomposed into three sub-objective loss functions, which are weighted and summed. We employed knowledge distillation to further enhance the accuracy, latency, and FLOPs of the discovered network architecture during the search process. As a result, we successfully obtained our RealtimeSeg model. Lastly, we utilized NVIDIA TensorRT to accelerate RealtimeSeg and deployed the accelerated RealtimeSeg on the target platform for real-time semantic segmentation. Using a single NVIDIA Titan XP GPU, RealtimeSeg can be obtained within 1.5 days. The experimental results demonstrate that RealtimeSeg achieved an accuracy of 71.7 mIoU(%) while maintaining a frame rate of 25.25 FPS on the NVIDIA Jetson NX, using the input resolution of 1024 x 2048. And the RealtimeSeg has a lower FLOPs value of 1.52 G, which is 17-18x less than SOTA methods. In realistic scenarios, RealtimeSeg has been successfully deployed on edge computing platforms, achieving efficient and fast semantic segmentation results.

引用

页码：1346 / 1357

页数：12

共 52 条

[1]

Bo D., 2023, P AAAI C ART INT

[2]

Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, 10.48550/arXiv.1706.05587]

[3] RangeSeg: Range-Aware Real Time Segmentation of 3D LiDAR Point Clouds [J].

Chen, Tzu-Hsuan ;

Chang, Tian Sheuan .

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (01) :93-101

[4]

Chen W.-Y., 2019, ARXIV

[5] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[6] FBNetV3: Joint Architecture-Recipe Search using Predictor Pretraining [J].

Dai, Xiaoliang ;

Wan, Alvin ;

Zhang, Peizhao ;

Wu, Bichen ;

He, Zijian ;

Wei, Zhen ;

Chen, Kan ;

Tian, Yuandong ;

Yu, Matthew ;

Vajda, Peter ;

Gonzalez, Joseph E. .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16271-16280

[7] ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation [J].

Dai, Xiaoliang ;

Zhang, Peizhao ;

Wu, Bichen ;

Yin, Hongxu ;

Sun, Fei ;

Wang, Yanghan ;

Dukhan, Marat ;

Hu, Yunqing ;

Wu, Yiming ;

Jia, Yangqing ;

Vajda, Peter ;

Uyttendaele, Matt ;

Jha, Niraj K. .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11390-11399

[8] HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight Transformers [J].

Ding, Mingyu ;

Lian, Xiaochen ;

Yang, Linjie ;

Wang, Peng ;

Jin, Xiaojie ;

Lu, Zhiwu ;

Luo, Ping .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2981-2991

[9]

Dosovitskiy L., 2021, ICLR

[10] A comprehensive survey of clustering algorithms: State-of-the-art machine learning applications, taxonomy, challenges, and future research prospects [J].

Ezugwu, Absalom E. ;

Ikotun, Abiodun M. ;

Oyelade, Olaide O. ;

Abualigah, Laith ;

Agushaka, Jeffery O. ;

Eke, Christopher I. ;

Akinyelu, Andronicus A. .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110

← 1 2 3 4 5 6 →