Software cooling approach enables efficient and cost-effective thermal management of multicore systems

被引:0
作者
Zhou, Kaihang [1 ]
Xuan, Yimin [1 ,2 ]
Hu, Dinghua [1 ]
Li, Qiang [1 ]
机构
[1] Nanjing Univ Sci & Technol, MIIT Key Lab Thermal Control Elect Equipment, Nanjing 210094, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Sch Energy & Power Engn, Nanjing 210016, Peoples R China
关键词
Software cooling approach; Ant colony optimization; Long short-term memory; Dynamic Voltage and Frequency Scaling; Resource allocation; TEMPERATURE; PERFORMANCE; DVFS;
D O I
10.1016/j.ijheatmasstransfer.2025.126937
中图分类号
O414.1 [热力学];
学科分类号
摘要
The relentless pursuit of high-performance electronic devices has driven semiconductor technology toward relentless miniaturization and integration. While this advancement enhances computational capabilities, it concurrently reduces chip heat capacities and diminishes thermal inertia. Traditional hardware-based thermal management strategies face inherent limitations, including temporal heat transfer mismatches, physical size constraints, and prohibitive economic costs. To address these challenges, this study proposes a software-driven thermal management approach that achieves cost-effective thermal regulation under constrained hardware package conditions. More importantly, it effectively mitigates temperature rises caused by transient thermal pulse-a capability lacking in traditional hardware cooling. Long short-term memory (LSTM) model, a type of recurrent neural network (RNN) has been successfully integrated into our framework to enable precise temperature prediction. The combination of LSTM and ant colony optimization (ACO) algorithm enables the scheduler to output the best allocation scheme. Results indicate that this approach achieves more than 6 degrees C decrease of mean peak temperature and 8% decrease of percentage of hotspots, while also reducing communication energy by 15% compared to existing software level thermal management technologies. External cooling resources (thermoelectric cooler) are incorporated into the task allocation algorithm for the first time. In the presence of local TEC, our approach performs best thermal performance. The feasibility of this approach under different workloads and platform sizes is also validated. Such software cooling approach provides valuable insights into the field of thermal management for electronic devices.
引用
收藏
页数:15
相关论文
共 54 条
[1]   Exploiting Die-to-Die Thermal Coupling in 3-D IC Placement [J].
Athikulwongse, Krit ;
Ekpanyapong, Mongkol ;
Lim, Sung Kyu .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (10) :2145-2155
[2]   Optimization of fins arrangements for the square light emitting diode (LED) cooling through nanofluid-filled microchannel [J].
Ben Hamida, Mohamed Bechir ;
Hatami, Mohammad .
SCIENTIFIC REPORTS, 2021, 11 (01)
[3]  
Bovet D., 2005, Understanding the Linux Kernel, V3rd, P258
[4]  
Chatterjee S, 2012, P IEEE SEMICOND THER, P14, DOI 10.1109/STHERM.2012.6188820
[5]  
Cheng WK, 2013, 2013 IEEE TENCON SPRING CONFERENCE, P95, DOI 10.1109/TENCONSpring.2013.6584424
[6]   Thermal-Constrained Task Allocation for Interconnect Energy Reduction in 3-D Homogeneous MPSoCs [J].
Cheng, Yuanqing ;
Zhang, Lei ;
Han, Yinhe ;
Li, Xiaowei .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2013, 21 (02) :239-249
[7]  
Chowdhury I, 2009, NAT NANOTECHNOL, V4, P235, DOI [10.1038/NNANO.2008.417, 10.1038/nnano.2008.417]
[8]   Flexible thermal interface based on self-assembled boron arsenide for high-performance thermal management [J].
Cui, Ying ;
Qin, Zihao ;
Wu, Huan ;
Li, Man ;
Hu, Yongjie .
NATURE COMMUNICATIONS, 2021, 12 (01)
[9]  
Dick RP, 1998, HARDW SOFTW CODES, P97, DOI 10.1109/HSC.1998.666245
[10]   Parametric study for optimizing double-layer microchannel heat sink for solar panel thermal management [J].
Elqady, Hesham, I ;
El-Shazly, A. H. ;
Elkady, M. F. .
SCIENTIFIC REPORTS, 2022, 12 (01)