Managing Jobs

One tool we found very useful and installed on SC cluster is "pestat", made available by Ole@DTU

It gives you an overview of the entire cluster or just a specific partition/node/user, line-by-line.

Status of each node on the cluster:

pestat -G

Status of each node within a partition:

pestat -p mypartition -G

Status of a specific node:

pestat -n mynode -G

List nodes that have a job owned by a specific user:

pestat -u myuser -G

You can also use standard Slurm commands.  To view a list of all jobs running on the cluster, type:

squeue

You can view detailed information for a specific job by typing:

scontrol show job jobid

To cancel a job you started, type:

scancel "jobid"