Energy-Efficient DNN Training Processorson Micro-AI Systems

被引:5
作者
Han, Donghyeon [1 ]
Kang, Sanghoon [1 ]
Kim, Sangyeob [1 ]
Lee, Juhyoung [1 ]
Yoo, Hoi-Jun [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea
来源
IEEE OPEN JOURNAL OF THE SOLID-STATE CIRCUITS SOCIETY | 2022年 / 2卷
关键词
Training; Program processors; Artificial intelligence; System-on-chip; Solid state circuits; Performance evaluation; Energy efficiency; Backpropagation (BP); backward unlocking (BU); bit-precision optimization; deep neural network (DNN) training; reading transposed weight; sparsity exploitation; FACE RECOGNITION PROCESSOR; LEARNING PROCESSOR; LOW-POWER; ACCELERATOR; TUTORIAL;
D O I
10.1109/OJSSCS.2022.3219034
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Many edge/mobile devices are now able to utilize deep neural networks (DNNs) thanks to the development of mobile DNN accelerators. Mobile DNN accelerators overcame the problems of limited computing resources and battery capacity by realizing energy-efficient inference. However, its passive behavior makes it difficult for DNN to provide active customization for individual users or its service environment. The importance of on-chip training is rising more and more to provide active interaction between DNN processors and ever-changing surroundings or conditions. Despite its advantages, the DNN training has more constraints than the inference such that it was considered impractical to be realized on mobile/edge devices. Recently, there are many trials to realize mobile DNN training, and a number of prior works will be summarized. First, it arranges the new challenges of the DNN accelerator induced by training functionality and discusses new hardware features related to the challenges. Second, it explains algorithm-hardware co-optimization methods and explains why it becomes mainstream in mobile DNN training research. Third, it compares the main differences between the conventional inference accelerators and recent training processors. Finally, the conclusion is made by proposing the future directions of the DNN training processor in micro-AI systems.
引用
收藏
页码:259 / 275
页数:17
相关论文
共 50 条
[21]   Towards Energy-Efficient and Secure Edge AI: A Cross-Layer Framework ICCAD Special Session Paper [J].
Shafique, Muhammad ;
Marchisio, Alberto ;
Putra, Rachmad Vidya Wicaksana ;
Hanif, Muhammad Abdullah .
2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,
[22]   A Global Energy-Efficient Framework for Edge AI in Industry and Railway [J].
Nicodeme, Claire .
2024 24TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS, ICCAS 2024, 2024, :1299-1304
[23]   A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge [J].
Mahmud, Hasanul ;
Kang, Peng ;
Desai, Kevin ;
Lama, Palden ;
Prasad, Sushil K. .
2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, :592-599
[24]   NeuroBalancer: Balancing System Frequencies With Punctual Laziness for Timely and Energy-Efficient DNN Inferences [J].
Bin, Kyungmin ;
Kim, Seyeon ;
Ha, Sangtae ;
Chong, Song ;
Lee, Kyunghan .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) :4339-4354
[25]   Energy-efficient operational training in a ship bridge simulator [J].
Jensen, Signe ;
Lutzen, Marie ;
Mikkelsen, Lars Lindegaard ;
Rasmussen, Hanna Barbara ;
Pedersen, Poul Vibsig ;
Schamby, Per .
JOURNAL OF CLEANER PRODUCTION, 2018, 171 :175-183
[26]   A High-Performance and Energy-Efficient Photonic Architecture for Multi-DNN Acceleration [J].
Li, Yuan ;
Louri, Ahmed ;
Karanth, Avinash .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (01) :46-58
[27]   Electron Emission Devices for Energy-Efficient Systems [J].
Nirantar, Shruti ;
Ahmed, Taimur ;
Bhaskaran, Madhu ;
Han, Jin-Woo ;
Walia, Sumeet ;
Sriram, Sharath .
ADVANCED INTELLIGENT SYSTEMS, 2019, 1 (04)
[28]   An Energy-Efficient Architecture for Internet of Things Systems [J].
De Rango, Floriano ;
Barletta, Domenico ;
Imbrogno, Alessandro .
UNMANNED SYSTEMS TECHNOLOGY XVIII, 2016, 9837
[29]   Energy-efficient deadline scheduling for heterogeneous systems [J].
Ma, Yan ;
Gong, Bin ;
Sugihara, Ryo ;
Gupta, Rajesh .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2012, 72 (12) :1725-1740
[30]   Tunable Energy-Efficient Approximate Circuits for Self-Powered AI and Autonomous Edge Computing Systems [J].
Garg, Shubham ;
Monga, Kanika ;
Chaturvedi, Nitin ;
Gurunarayanan, S. .
IEEE ACCESS, 2025, 13 :43607-43630