HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks

被引:2
作者
Zhang, Zining [1 ,2 ]
He, Bingsheng [1 ]
Zhang, Zhenjie [3 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[2] NUS, Ctr Trusted Internet & Community, Singapore, Singapore
[3] Neuron Mobil Pte Ltd, Singapore, Singapore
来源
51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022 | 2022年
关键词
neural network optimization; auto tuner; reinforcement learning;
D O I
10.1145/3545008.3545020
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To efficiently perform inference with neural networks, the underlying tensor programs require sufficient tuning efforts before being deployed into production environments. Usually, enormous tensor program candidates need to be sufficiently explored to find the one with the best performance. This is necessary to make the neural network products meet the high demand of real-world applications such as natural language processing, auto-driving, etc. Auto-schedulers are being developed to avoid the need for human intervention. However, due to the gigantic search space and lack of intelligent search guidance, current auto-schedulers require hours to days of tuning time to find the best-performing tensor program for the entire neural network. In this paper, we propose HARL, a reinforcement learning (RL) based auto-scheduler specifically designed for efficient tensor program exploration. HARL uses a hierarchical RL architecture in which learning-based decisions are made at all different levels of search granularity. It also automatically adjusts exploration configurations in real-time for faster performance convergence. As a result, HARL improves the tensor operator performance by 22% and the search speed by 4.3x compared to the state-of-the-art auto-scheduler. Inference performance and search speed are also significantly improved on end-to-end neural networks.
引用
收藏
页数:13
相关论文
共 29 条
[21]  
Paszke A, 2019, ADV NEUR IN, V32
[22]   MobileNetV2: Inverted Residuals and Linear Bottlenecks [J].
Sandler, Mark ;
Howard, Andrew ;
Zhu, Menglong ;
Zhmoginov, Andrey ;
Chen, Liang-Chieh .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4510-4520
[23]  
Schulman J, 2017, Arxiv, DOI arXiv:1707.06347
[24]  
Suriana P, 2017, INT SYM CODE GENER, P281, DOI 10.1109/CGO.2017.7863747
[25]  
Sutton RS, 2000, ADV NEUR IN, V12, P1057
[26]  
van Otterlo M, 2012, ADAPT LEARN OPTIM, V12, P3
[27]  
Vermorel J, 2005, LECT NOTES ARTIF INT, V3720, P437, DOI 10.1007/11564096_42
[28]  
Zheng LM, 2020, PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20), P863
[29]   FlexTensor: An Automatic Schedule Exploration and Optimization Framework for Tensor Computation on Heterogeneous System [J].
Zheng, Size ;
Liang, Yun ;
Wang, Shuo ;
Chen, Renze ;
Sheng, Kaiwen .
TWENTY-FIFTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXV), 2020, :859-873