CHPC Technical Specifications

The CHPC cluster comprises 31 nodes, housing 1920 CPU cores and 5 GPUs, incorporating Dell PowerEdge R6525s and R7525s nodes and boasting a theoretical sustained single precision performance exceeding 170.69 TFLOPs.

Node CPU Specs

This table summarizes the CPU hardware specification for the nodes.

NodeNameCPUTotThreadsPerCoreCoresPerSocketSocketsModel nameCPU MHzCPU max MHzPartitionCPU TFLOPS
chpc-compute-12-3641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-4641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-5641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-6641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-7641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-8641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-9641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-10641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-11641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-12641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-13641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-14641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-15641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-16641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-17641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-18641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-19641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-20641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-21641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-22641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-23641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-compute-12-24641322AMD EPYC 7532 32-Core Processor2395.4462395.446main2.45
chpc-gpu-10-0641322AMD EPYC 7532 32-Core Processor2395.4542395.454gpu2.45
chpc-gpu-10-1641322AMD EPYC 7532 32-Core Processor2395.4542395.454gpu2.45
chpc-gpu-10-2641322AMD EPYC 7532 32-Core Processor2395.4542395.454gpu2.45
chpc-gpu-12-0641322AMD EPYC 7532 32-Core Processor2395.4542395.454gpu2.45
chpc-gpu-12-1641322AMD EPYC 7532 32-Core Processor2395.4542395.454gpu2.45
chpc-highmem-10-0481242AMD EPYC 7352 24-Core Processor2295.4962295.496highmem1.76
chpc-highmem-10-1481242AMD EPYC 7352 24-Core Processor2295.4962295.496highmem1.76
chpc-highmem-12-0481242AMD EPYC 7352 24-Core Processor2295.4962295.496highmem1.76
chpc-highmem-12-1481242AMD EPYC 7352 24-Core Processor2295.4962295.496highmem1.76

Node GPU Specs

This table summarizes the GPU hardware specification for the GPU nodes.

NodeNamePartitionGresGPU TFLOPSGPU Deep Learning TFLOPSGPU Memory (GB)GPU CUDA CoresGPU Tensor CoresGPU Half Precision TFLOPS
chpc-gpu-10-0gpua100:119.5156.040.06912.0432.0312.0
chpc-gpu-10-1gpua100:119.5156.040.06912.0432.0312.0
chpc-gpu-10-2gpua100:119.5156.040.06912.0432.0312.0
chpc-gpu-12-0gpua100:119.5156.040.06912.0432.0312.0
chpc-gpu-12-1gpua100:119.5156.040.06912.0432.0312.0

Summary Plots

These visualizations present a summarized view of the compute resources within the CHPC cluster.

Node vs. CPU TFLOPS

This graphical representation offers insights into the theoretical CPU compute performance of individual nodes.

Node vs. GPU

This graphical representation offers insights into the theoretical GPU compute performance of individual nodes.

CPU Composition

This graphical representation offers insights into the CPU compositions of individual nodes.


QOS Specifications

The “main” QoS permits users to conduct calculations over a span of 5 days, capped at 1000 jobs or 1000 cores per user. Under the “long” QoS, users can access only 100 jobs or 100 cores for a duration of up to 14 days. For the “highmem” QoS, users can operate for one week with 100 jobs or 100 cores or allocate up to 6000 GB of RAM per user.

In the realm of the “gpu” QoS, users can leverage Nvidia A100 tensor core GPUs, with restrictions set to 3 jobs, 3 GPUs, or 192 cores, lasting up to 2 days. These GPUs deliver 9.7 TFLOPS of double-precision performance. However, when employed in machine learning applications, particularly neural networks with enabled sparsity, each GPU can achieve an impressive tensor performance of 780 TFLOPS individually.

QoS (rows) / Parition (columns) Limits on CHPC

QoS/PartitionMainHighmemGPU
Main 120 hours / job
1600 cores / group 
1000 cores / job 
1000 cores / user 
1000 jobs / user 
Long 336 hours / job
500 cores / group 
300 cores / job
300 cores / user
300 jobs / user 
Debug 15 mins / job
128 cores / group 
32 cores / job  
4 jobs / user 
15 minutes / job
128 cores / group
32 cores / job 
4 jobs / user 
Highmem 168 hours / job
192 cores / group 
100 cores / user 
100 jobs / user 
8TB memory / group 
6 TB memory / user 
GPU 48 hours / job
320 cores / group 
192 cores / job
5 GPUs/ group
3 GPUs / user 
3 jobs / user