The NVIDIA GB200 Superchip is a powerhouse for AI and high-performance computing, but it generates significant heat. To fully unlock the potential of this superchip, an efficient and reliable cooling solution is essential. Custom liquid cooling plates offer a tailored approach to managing the thermal output of the GB200, ensuring optimal performance and longevity.
Why Custom Liquid Cooling for the GB200?
- Thermal Efficiency: Custom liquid cooling plates are designed to directly target hotspots on the GB200 Superchip, providing superior heat dissipation compared to traditional cooling methods.
- Maximized Performance: By maintaining optimal operating temperatures, these cooling solutions prevent thermal throttling, allowing the GB200 to consistently deliver peak performance.
- Reliability and Longevity: Effective cooling reduces the risk of overheating, thereby extending the lifespan of the expensive GB200 Superchip.
- Energy Efficiency: Compared to air-cooled systems, liquid cooling systems for the GB200 NVL72 can deliver 25 times the performance at the same power level.
- Space Optimization: Liquid cooling enables higher computational density, optimizing data center footprint. Systems can be controlled within a compact 1-2U range, improving space utilization.
Liquid Cooling Technologies for the GB200
Several liquid cooling technologies are available for the NVIDIA GB200, each with its own advantages:
- Direct-to-Chip, Waterless Liquid Cooling: ZutaCore’s HyperCool technology uses waterless cold plates placed directly on the superchip. A single monolithic cold plate can cool up to 2800 watts. This closed-loop system operates at low pressure, efficiently removing heat from the processor.
- Micro-Convection Liquid Cooling: JetCool’s single-phase liquid cooling solutions offer higher performance than microchannel approaches. Their micro-jet impingement cooling and advanced microfluidics are designed for complex heat flux distributions, managing over 1,500W per socket.
- Cold Plates: Cold plates can meet the cooling needs of the CPU and GPU on the GB200 board.
- Liquid-to-Air and Liquid-to-Liquid Cooling: Some vendors offer liquid-to-air or liquid-to-liquid cooling solutions as options for heat dissipation.
Considerations for Custom Liquid Cooling Plates
- Cooling Capacity: Ensure the cooling plate can handle the GB200’s thermal design power (TDP), which can be substantial.
- Material Compatibility: Choose materials compatible with the coolant used to prevent corrosion or other issues.
- Design and Integration: The cold plate should be designed for easy integration with the GB200 and the overall system. For example, ZutaCore’s monolithic cold plate design uses a simple input/output configuration to reduce potential failure points.
- Waterless vs. Water-Based: Consider the pros and cons of waterless versus traditional water-based liquid cooling systems, especially regarding leakage risks.
- Single-Phase vs. Two-Phase Cooling: Evaluate the performance and sustainability of single-phase versus two-phase cooling methods.
Vendors and Solutions
Several companies offer liquid cooling solutions for the NVIDIA GB200:
- ZutaCore: Known for its waterless, direct-to-chip HyperCool technology.
- JetCool: Offers micro-convection liquid cooling solutions with SmartPlate technology.
- Supermicro: Provides end-to-end liquid cooling solutions for GB200-based systems, including CDUs and custom cold plates.
- Boyd: Offers plug-and-play, fully liquid-cooled systems for the GB200 NVL72.
- ASUS: Provides AI POD solutions with options for liquid-to-air or liquid-to-liquid cooling.
- ToneCooling: Delivers high-density thermal performance liquid cooling solutions.
Conclusion
Custom liquid cooling plates are critical components for maximizing the performance and lifespan of the NVIDIA GB200 Superchip GPU. By carefully considering specific needs and selecting the appropriate technology, data centers and high-performance computing environments can fully harness the potential of this powerful processor.