Versionen im Vergleich

Schlüssel

  • Diese Zeile wurde hinzugefügt.
  • Diese Zeile wurde entfernt.
  • Formatierung wurde geändert.
Kommentar: Updated grete partitions for move of gpu[01-03]

The compute nodes of Lise in Berlin (blogin.hlrn.de) and Emmy in Göttingen (glogin.hlrn.de) are organized via the following SLURM partitions:

...

Partition (number holds cores per node)

Node nameMax. walltimeNodesMax. nodes
per job

Max jobs (running/ queued)
per user

Usable memory MB per node

CPU

Shared

Charged core-hours per node

Remark
standard96bcn#12:00:001204512

16 / 500

362 000Cascade 924296default partition











standard96:testbcn#1:00:0032 dedicated
+128 on demand
161 / 500362 000Cascade 924296test nodes with higher priority but lower walltime
large96bfn#12:00:0028816 / 500747 000Cascade 9242144fat memory nodes
large96:testbfn#1:00:002 dedicated
+2 on demand
21 / 500747 000Cascade 9242144fat memory test nodes with higher priority but lower walltime
large96:sharedbfn#48:00:002 dedicated116 / 500

747 000

Cascade 9242144fat memory nodes for data pre- and postprocessing
huge96bsn#24:00:002116 / 500

1522 000

Cascade 9242192

very fat memory nodes for data pre- and postprocessing

...

see GPU Usage

Partition (number holds cores per node)

Node name

Max. walltime

NodesMax. nodes
per job
Max. jobs
per user

Usable memory MB per node

CPU, GPU type

SharedNPL per node hourRemark
standard96

gcn#

12:00:00996256unlimited362 000Cascade 924296default partition
standard96:testgcn#1:00:008 dedicated
+128 on demand
16unlimited362 000Cascade 924296test nodes with higher priority but lower walltime
large96gfn#12:00:00122unlimited747 000Cascade 9242144fat memory nodes
large96:testgfn#1:00:002 dedicated
+2 on demand
2unlimited747 000Cascade 9242144fat memory test nodes with higher priority but lower walltime
large96:sharedgfn#48:00:00

2 dedicated

+6 on demand

1unlimited

747 000

Cascade 9242144fat memory nodes for data pre- and postprocessing
huge96gsn#24:00:0021unlimited

1522 000

Cascade 9242192

very fat memory nodes for data pre- and postprocessing












medium40gcn#48:00:00424128unlimited181 000Skylake  614840
medium40:testgcn#1:00:00

8 dedicated

+64 on demand

8unlimited

181 000

Skylake  614840test nodes with higher priority but lower walltime
large40gfn#48:00:00124unlimited

764 000

Skylake  614880fat memory nodes
large40:testgfn#1:00:00

2 dedicated

+2 on demand

2unlimited

764 000

Skylake  614880fat memory test nodes with higher priority but lower walltime
large40:sharedgfn#48:00:00

2 dedicated

+6 on demand

1unlimited764 000Skylake  614880fat memory nodes for data pre- and postprocessinggpuggpu#48:00:0032unlimited764 000 MB per node

(32GB HBM per GPU)












Skylake  6148 + 4 Nvidia V100 32GB375grete
ggpu#48:00:00338unlimited

500 000 MB per node

(40GB HBM per GPU)

Zen3 EPYC 7513 + 4 NVidia A100 40GB
600



see GPU Usage

grete:shared
ggpu#48:00:0035381unlimited500 000 MB and , 764 000 MB, or 1 000 000 MB per node
(32 GB, 40GB, or 80GB HBM per GPU)

Skylake  6148 + 4 Nvidia V100 32GB,

Zen3 EPYC 7513 + 4 NVidia A100 40GB,

and Zen2 EPYC 7662 + 8 NVidia A100 80GB

150 per GPU
grete:interactive
ggpu#48:00:00361unlimited

764 000 MB (32 GB per GPU)

or 500 000 MB (10GB or 20GB HBM per MiG slice)

Skylake  6148 + 4 Nvidia V100 32GB,

Zen3 EPYC 7513 + 4 NVidia A100  40GB splitted in 2g.10gb and 3g.20gb slices

150 per GPU (V100)

or 47 per MiG slice (A100)

see GPU Usage


A100 GPUs are split into slices via MIG (3 slices per GPU)

grete:preemptible
ggpu#48:00:00361unlimited

764 000 MB (32 GB per GPU)

or 500 000 MB (10GB or 20GB HBM per MiG slice)

Skylake  6148 + 4 Nvidia V100 32GB,

Zen3 EPYC 7513 + 4 NVidia

A100

A100  40GB splitted in 2g.10gb and 3g.20gb slices

150 per GPU (V100)

or 47 per MiG slice (A100)

* 600 for the nodes with 4 GPUs, and 1200 for the nodes with 8 GPUs

...

The available home/local-ssd/work/perm storages are discussed in File Storage Systems.

An overview of all partitions and node statuses is provided by: sinfo -r
To see detailed information about a nodes type: scontrol show node <nodename>

...

Short nameLink to manufacturer specificationsWhere to findUnits per node

Cores per unit

Clock speed
[GHz]

Cascade 9242Intel Cascade Lake Platinum 9242 (CLX-AP)Lise and Emmy compute partitions2482.3
Cascade 4210Intel Cascade Lake Silver 4210 (CLX)blogin[1-8], glogin[3-8]2102.2
Skylake  6148Intel Skylake Gold 6148Emmy compute partitions2202.4
Skylake 4110Intel Skylake Silver 4110glogin[1-2]282.1
Tesla V100NVIDIA Tesla V100 32GBEmmy gpu partitiongrete partitions4

640/5120*


Tesla A100NVIDIA Tesla A100 40GB and 80GBEmmy grete partitions4 or 8

432/6912*


...