Submitting a job in SLURM is performed by running sbatch command and specifying a job script. A simple example of a Slurm job submission script (slurm_test.job):
srun hostname | sort
First let's check how many nodes we have:
Now let's run our job (slurm_test.job) on all the availble nodes (c1 and c2).
The job will remain in the PD pending state until it runs when the resources get available. Let's now check the output, which should list the hostnames of the c1 and c2 compute nodes:
To check all your running/pending jobs:
squeue -u [USERID]
If you want to check the status of a single job:
scontrol show job [JOBID]
To cancel a job:
To cancel all jobs for a specific user:
Hands-on course on scientific computing utilizing High Performance Computing (HPC). The goal main of this course is to introduce you with the HPC systems and its software stack: Slurm, PBS, OpenMPI, MPI and CUDA.