CPU Overclocking: A Performance Assessment of Air, Cold Plates, and Two-Phase Immersion Cooling

被引:41
作者
Ramakrishnan, Bharath [1 ]
Alissa, Husam [1 ]
Manousakis, Ioannis [2 ]
Lankston, Robert [2 ]
Bianchini, Ricardo [3 ]
Kim, Washington [1 ]
Baca, Rich [2 ]
Misra, Pulkit A. [3 ]
Goiri, Inigo [3 ]
Jalili, Majid [3 ]
Raniwala, Ashish [2 ]
Warrier, Brijesh [2 ]
Monroe, Mark [1 ]
Belady, Christian [1 ]
Shaw, Mark [2 ]
Fontoura, Marcus [2 ]
机构
[1] Microsoft CO I, Redmond, WA 98052 USA
[2] Microsoft Azure, Redmond, WA 98052 USA
[3] Microsoft Res, Redmond, WA 98052 USA
来源
IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY | 2021年 / 11卷 / 10期
关键词
Clocks; Heating systems; Timing; Cooling; Fluids; Heat transfer; Dielectrics; Air cooling; cold plates; data center thermal management; high-performance computing (HPC); overclocking; two-phase (2P) immersion; THERMAL MANAGEMENT;
D O I
10.1109/TCPMT.2021.3106026
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Computing capacity has always been on the upward climb due to the constant technological improvements in semiconductor manufacturing and packaging industry. This growth in computing capability is usually accompanied by a steep rise in heat flux density associated with the electronic component (CPUs or GPUs, for example). High-performance computing (HPC) data centers often employ several of these high-performance devices for crunching their enormous artificial intelligence (AI) or scientific computing workloads. State-of-the-art air cooling technologies throttle after a certain heat flux level and would require bigger heat sinks driving enormous airflow through it, which may not be desirable from a data center operation standpoint. Hence, a lookout for advanced thermal management techniques is quite imperative. In this article, a commercially available intel overclockable CPU i9-9900k was exercised with high performance workloads while employing and evaluating three different cooling technologies namely air-cooled, cold plates, and two-phase (2P) immersion. The high heat carrying capacity associated with cold plates and 2P immersion techniques outperformed the air-cooled solution by constantly yielding higher CPU clock rates up to 41% utilizing cold plates and 51% employing 2P immersion. The performance of cold plates and 2P immersion was evaluated at different functional points (different coolant operating temperatures and 2P coolant types for example) to understand how the technologies would respond in a legacy or modern data center operation. It was again observed that both 2P immersion technology and cold plates provided the least thermal resistance (as low as 0.247 degrees C/W) path to heat transfer and therefore provided higher computational performance and efficiency compared to air cooling. Further improvement in performance is not limited by cooling but by packaging constraints. Advanced packaging techniques coupled with minimizing the interfacial resistance can reap enhanced performance and energy efficiency gains using these modes of cooling. This article demonstrates the potential increase in virtual machine (VM) performance that can be attained using cold plates or 2P immersion. Authors recommend adoption of liquid cooling in early chip design and architecture phases. Finally, the article provides a generic overclocking methodology that the industry can use to qualify postturbo server class chip performance.
引用
收藏
页码:1703 / 1715
页数:13
相关论文
共 15 条
[1]  
[Anonymous], 2019, ASHRAE WATER COOLED
[2]  
[Anonymous], OVERCLOCK YOUR CPU U
[3]  
[Anonymous], 2016, ASHRAE AIR COOLED SE
[4]  
[Anonymous], 1985, SMALLER FASTER CRAY
[5]  
[Anonymous], 2017, JAPANESE SUPERCOMPUT
[6]  
[Anonymous], 2009, US SUP OAK RIDG JAG
[7]   Gen 3 "Embedded" Cooling: Key Enabler for Energy Efficient Data Centers [J].
Bar-Cohen, Avram .
IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2017, 7 (08) :1206-1211
[8]  
Bostanci H, 2020, INTSOC CONF THERMAL, P450, DOI 10.1109/ITherm45881.2020.9190494
[9]   An Overview of the IBM Power 775 Supercomputer Water Cooling System [J].
Ellsworth, Michael J. ;
Goth, Gary F. ;
Zoodsma, Randy J. ;
Arvelo, Amilcar ;
Campbell, Levi A. ;
Anderl, William J. .
JOURNAL OF ELECTRONIC PACKAGING, 2012, 134 (02)
[10]  
Fuller SH, 2011, FUTURE OF COMPUTING PERFORMANCE: GAME OVER OR NEXT LEVEL?, P1