Software cooling approach enables efficient and cost-effective thermal management of multicore systems

被引:0
作者
Zhou, Kaihang [1 ]
Xuan, Yimin [1 ,2 ]
Hu, Dinghua [1 ]
Li, Qiang [1 ]
机构
[1] Nanjing Univ Sci & Technol, MIIT Key Lab Thermal Control Elect Equipment, Nanjing 210094, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Sch Energy & Power Engn, Nanjing 210016, Peoples R China
关键词
Software cooling approach; Ant colony optimization; Long short-term memory; Dynamic Voltage and Frequency Scaling; Resource allocation; TEMPERATURE; PERFORMANCE; DVFS;
D O I
10.1016/j.ijheatmasstransfer.2025.126937
中图分类号
O414.1 [热力学];
学科分类号
摘要
The relentless pursuit of high-performance electronic devices has driven semiconductor technology toward relentless miniaturization and integration. While this advancement enhances computational capabilities, it concurrently reduces chip heat capacities and diminishes thermal inertia. Traditional hardware-based thermal management strategies face inherent limitations, including temporal heat transfer mismatches, physical size constraints, and prohibitive economic costs. To address these challenges, this study proposes a software-driven thermal management approach that achieves cost-effective thermal regulation under constrained hardware package conditions. More importantly, it effectively mitigates temperature rises caused by transient thermal pulse-a capability lacking in traditional hardware cooling. Long short-term memory (LSTM) model, a type of recurrent neural network (RNN) has been successfully integrated into our framework to enable precise temperature prediction. The combination of LSTM and ant colony optimization (ACO) algorithm enables the scheduler to output the best allocation scheme. Results indicate that this approach achieves more than 6 degrees C decrease of mean peak temperature and 8% decrease of percentage of hotspots, while also reducing communication energy by 15% compared to existing software level thermal management technologies. External cooling resources (thermoelectric cooler) are incorporated into the task allocation algorithm for the first time. In the presence of local TEC, our approach performs best thermal performance. The feasibility of this approach under different workloads and platform sizes is also validated. Such software cooling approach provides valuable insights into the field of thermal management for electronic devices.
引用
收藏
页数:15
相关论文
共 54 条
[51]   Low thermal contact resistance boron nitride nanosheets composites enabled by interfacial arc-like phonon bridge [J].
Zhan, Ke ;
Chen, Yucong ;
Xiong, Zhiyuan ;
Zhang, Yulun ;
Ding, Siyuan ;
Zhen, Fangzheng ;
Liu, Zhenshi ;
Wei, Qiang ;
Liu, Minsu ;
Sun, Bo ;
Cheng, Hui-Ming ;
Qiu, Ling .
NATURE COMMUNICATIONS, 2024, 15 (01)
[52]  
Zhang Yi-wen, 2015, Journal of Chinese Computer Systems, V36, P1907
[53]   Overview of 3-D Architecture Design Opportunities and Techniques [J].
Zhao, Jishen ;
Zou, Qiaosha ;
Xie, Yuan .
IEEE DESIGN & TEST, 2017, 34 (04) :60-68
[54]   3D-STAF:Scalable temperature and leakage aware floorplanning for three-dimensional integrated circuits [J].
Zhou, Pingqiang ;
Ma, Yuchun ;
Li, Zhouyuan ;
Dick, Robert P. ;
Shang, Li ;
Zhou, Hai ;
Hong, Xianlong ;
Zhou, Qiang .
IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN DIGEST OF TECHNICAL PAPERS, VOLS 1 AND 2, 2007, :590-+