Efficient Intra-Rack Resource Disaggregation for HPC Using Co-Packaged DWDM Photonics

01/09/2023
by   George Michelogiannakis, et al.
0

The diversity of workload requirements and increasing hardware heterogeneity in emerging high performance computing (HPC) systems motivate resource disaggregation. Disaggregation allows compute and memory resources to be allocated individually as required to each workload. However, it is unclear how to realize these gains and cost-effectively meet the stringent bandwidth and latency requirements of HPC applications. To that end, we describe how modern photonic components can be co-designed with modern HPC racks to implement flexible intra-rack resource disaggregation and fully meet the bit error rate (BER) and high escape bandwidth of all chip types in modern HPC racks with negligible power overhead. Our photonic-based disaggregated rack provides an average application speedup of 11 benchmarks compared to a similar system that instead uses modern electronic switches for disaggregation. Using observed resource usage from a production system, we estimate that an iso-performance intra-rack disaggregated HPC system using photonics would require 4x fewer memory modules and 2x fewer NICs than a non-disaggregated baseline.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset