Paso 1.- Instalación de paquetes:
server# apt install munge slurm-wlm
Paso 2.- Crear el archivo de configuración /etc/slurm-llnl/slurm.conf :
ControlMachine=<YOUR-HOST-NAME>
MpiDefault=none
ProctrackType=proctrack/pgid
ReturnToService=1
SlurmctldPidFile=/var/run/slurm-llnl/slurmctld.pid
SlurmdPidFile=/var/run/slurm-llnl/slurmd.pid
SlurmdSpoolDir=/var/lib/slurm-llnl/slurmd
SlurmUser=slurm
StateSaveLocation=/var/lib/slurm-llnl/slurmctld
SwitchType=switch/none
TaskPlugin=task/none
#
# SCHEDULING
FastSchedule=1
SchedulerType=sched/builtin
#Modificado para enviar trabajos simultáneos en un mismo nodo
SelectType=select/cons_res
SelectTypeParameters=CR_CPU_Memory
#
#
# LOGGING AND ACCOUNTING
AccountingStorageType=accounting_storage/none
ClusterName=<YOUR-HOST-NAME>
JobAcctGatherType=jobacct_gather/none
SlurmctldLogFile=/var/log/slurm-llnl/slurmctld.log
SlurmdLogFile=/var/log/slurm-llnl/slurmd.log
#
# COMPUTE NODES
NodeName=<YOUR-HOST-NAME> CPUs=4 Sockets=1 CoresPerSocket=4 ThreadsPerCore=2 State=UNKNOWN
PartitionName=long Nodes=<YOUR-HOST-NAME> Default=YES MaxTime=INFINITE State=UP
Paso 3.- Iniciar los servicios:
server# systemctl enable slurmctld
server# systemctl start slurmctld
server# systemctl enable slurmd
server# systemctl start slurmd
Paso 4.- Lanzar un trabajo:
$ sbatch -N1 -n1 --mem-per-cpu=50M submith.sh
Comandos útiles:
Estado de la cola en "Drain"
$sudo scontrol update nodename=node state=resume
Referencias: