#
Setup SLURM Clients
The SLURM server was set up on the head node so now the SLURM clients need to be set up on the other nodes. Do not follow these instructions on the head node, only the other nodes.
Swap to a compute node
Install munge:
yum install -y munge munge-libs perl-Switch numactl
Install the SLURM packages:
yum install -y flight-slurm flight-slurm-devel flight-slurm-perlapi flight-slurm-torque flight-slurm-slurmd flight-slurm-example-configs flight-slurm-libpmi
Take the contents of the file
/opt/flight/opt/slurm/etc/slurm.conf
on the head node and copy it to the same location on the current node.Create new directories for SLURM:
mkdir -p /opt/flight/opt/slurm/var/{log,run,spool}
Set the owner of the new directories:
chown -R nobody: /opt/flight/opt/slurm/var/{log,run,spool}
Take the munge key from the file
/etc/munge/munge.key
on the head node and make a munge key in the same location on the current node.Set the owner of the munge key:
chown munge: /etc/munge/munge.key
Lock the munge key so that it cannot be changed again:
chmod 400 /etc/munge/munge.key
Start munge and SLURM:
systemctl start munge systemctl enable munge systemctl start flight-slurmd systemctl enable flight-slurmd
Repeat these steps on every compute node.
#
Testing
If all was successful, then the following should be the case on the compute nodes:
The command
systemctl status munge
shows the service as active with no errors.The command
systemctl status flight-slurmd
shows the service as active with no errors.The file
/opt/flight/opt/slurm/etc/slurm.conf
has all the necessary options set. An example file is given in the instructions for SLURM server setup.The
/opt/flight/opt/slurm/var
directory exists, and contains these three directories:[root@chead1 (mycluster1) ~]# ls /opt/flight/opt/slurm/var/ log run spool
The
/opt/flight/opt/slurm/var
directory has these permissions:[root@cnode01 (mycluster1) ~]# ls -l /opt/flight/opt/slurm/var/ total 0 drwxr-xr-x. 2 nobody nobody 6 Sep 20 14:54 log drwxr-xr-x. 2 nobody nobody 6 Sep 20 14:54 run drwxr-xr-x. 2 nobody nobody 6 Sep 20 14:54 spool
The munge key (
/etc/munge/munge.key
) should be the same on all nodes.The munge key (
/etc/munge/munge.key
) should have these permissions:[root@cnode01 (mycluster1) ~]# ls -l /etc/munge/munge.key -r--------. 1 munge munge 65 Sep 20 14:56 /etc/munge/munge.key