Inhalt |
---|
List of Slurm Partitions
The compute nodes of Lise in Berlin (blogin.hlrn.de) and Emmy in Göttingen (glogin.hlrn.de) are organized the CPU cluster of system Lise are organised via the following SLURM partitions:
...
Slurm partitions.
Partition name | NodesNode count | CPU | Main memory (GB) | Max. nodes per job | Max. jobs per user (running/ queued)per user | Walltime Wall time limit (hh:mm:ss) | Remark |
---|---|---|---|---|---|---|---|
standard96cpu-clx | 1204688 | Cascade 9242 | 362 | 512 | 16 128 / 500 | 12:00:00 | default partition |
standard96cpu-clx:test | 32 dedicated +128 on demand | 362 | 16 | 1 / 500 | 01:00:00 | test nodes with higher priority but lower walltimeless wall time | |
large96 | 28 | 747 | 8 | 16 128 / 500 | 12:00:00 | fat memory nodes | |
large96:test | 2 dedicated +2 on demand | 747 | 2 | 1 / 500 | 1001:00:00 | fat memory test nodes with higher priority but lower walltimeless wall time | |
large96:shared | 2 dedicated | 747 | 1 | 16 128 / 500 | 48:00:00 | fat memory nodes for data pre- and postprocessingpost-processing | |
huge96 | 2 | 1 5221522 | 1 | 16 128 / 500 | 24:00:00 | very fat memory nodes for data pre- and postprocessingpost-processing |
See Slurm usage how to pass the 12h walltime wall time limit with job dependencies.
Fat-Tree Network of Lise
See OPA Fat Tree network of Lise
Emmy (Göttingen)
...
Partition (number holds cores per node)
...
...
Usable memory MB per node
...
CPU, GPU type
...
gcn#
...
...
2 dedicated
+6 on demand
...
747 000
...
1522 000
...
very fat memory nodes for data pre- and postprocessing
...
8 dedicated
+64 on demand
...
181 000
...
764 000
...
2 dedicated
+2 on demand
...
764 000
...
2 dedicated
+6 on demand
...
500 000 MB per node
(40GB HBM per GPU)
...
see GPU Usage
...
Skylake 6148 + 4 Nvidia V100 32GB,
Zen3 EPYC 7513 + 4 NVidia A100 40GB,
and Zen2 EPYC 7662 + 8 NVidia A100 80GB
...
764 000 MB (32 GB per GPU)
or 500 000 MB (10GB or 20GB HBM per MiG slice)
...
Skylake 6148 + 4 Nvidia V100 32GB,
Zen3 EPYC 7513 + 4 NVidia A100 40GB splitted in 2g.10gb and 3g.20gb slices
...
150 per GPU (V100)
or 47 per MiG slice (A100)
see GPU Usage
A100 GPUs are split into slices via MIG (3 slices per GPU)
...
764 000 MB (32 GB per GPU)
or 500 000 MB (10GB or 20GB HBM per MiG slice)
...
Skylake 6148 + 4 Nvidia V100 32GB,
Zen3 EPYC 7513 + 4 NVidia A100 40GB splitted in 2g.10gb and 3g.20gb slices
...
150 per GPU (V100)
or 47 per MiG slice (A100)
* 600 for the nodes with 4 GPUs, and 1200 for the nodes with 8 GPUs
Which partition to choose?
If you do not request a partition, your job will be placed in the default partition, which is standard96.
The default partitions are partition is suitable for most calculations. The :test partitions are, as the name suggests, intended for shorter and smaller test runs. These have a higher priority and a few dedicated nodes, but are limited in time and number of nodesprovide only limited resources. Shared nodes are suitable for pre- and postprocessingpost-processing. A job running on a shared node is only accounted for its core fraction (cores of job / all cores per node). All non-shared nodes are exclusive to one job , which implies that full NPL are paid.
Details about the CPU/GPU types can be found below.
The network topology is described here.
The only at a time.
The available home/local-ssd/work/perm storages file systems are discussed in Storage under File Systems.
An For an overview of all Slurm partitions and node statuses is provided bystatus of nodes: sinfo -r
To see For detailed information about a particular nodes type: scontrol show node <nodename>
Charge rates
Charge rates for the Slurm partitions can be found under Accounting.
Fat-Tree Communication Network of Lise
See OPA Fat Tree network of Lise
List of CPUs
...
Short name | Link to manufacturer specifications | Where to find | Units per node | Cores per unit | Clock speed |
---|---|---|---|---|---|
Cascade 9242 | Intel Cascade Lake Platinum 9242 (CLX-AP) | Lise and Emmy compute partitionsCPU partition "Lise" | 2 | 48 | 2.3 |
Cascade 4210 | Intel Cascade Lake Silver 4210 (CLX) | blogin[1-8], glogin[3-8]6] | 2 | 10 | 2.2 |
Skylake 6148 | Intel Skylake Gold 6148 | Emmy compute partitions | 2 | 20 | 2.4 |
Skylake 4110 | Intel Skylake Silver 4110 | glogin[1-2] | 2 | 8 | 2.1 | Tesla V100 | NVIDIA Tesla V100 32GB | Emmy grete partitions | 4 | 640/5120* | Tesla A100 | NVIDIA Tesla A100 40GB and 80GB | Emmy grete partitions | 4 or 8 | 432/6912* |
...