Publications
A case study and characterization of a many-socket, multi-tier numa hpc platform
Abstract
As the number of processor cores and sockets on HPC compute nodes increase and systems expose more hierarchical non-uniform memory access (NUMA) architectures, efficiently scaling applications within even a single shared memory system is becoming more challenging. It is now common for HPC compute nodes to have two or more sockets and dozens of cores, but future generation systems may contain an order of magnitude more of each. We conduct experiments on a state-of-the-art Intel Xeon Platinum system with 12 processor sockets, totaling 288 cores (576 hardware threads), arranged in a multi-tier NUMA hierarchy. Platforms of this scale and memory hierarchy are uncommon today, providing us a unique opportunity to empirically evaluate-rather than model or simulate-an architecture potentially representative of future HPC compute nodes. We quantify the platform's multi-tier NUMA patterns, then …
- Date
- November 12, 2020
- Authors
- Connor Imes, Steven Hofmeyr, Dong In D Kang, John Paul Walters
- Conference
- 2020 IEEE/ACM 6th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC) and Workshop on Hierarchical Parallelism for Exascale Computing (HiPar)
- Pages
- 74-84
- Publisher
- IEEE