Difference between revisions of "Basic Usage: CPU Based Resources With Slurm"

Latest revision as of 19:43, 12 May 2025

Simple commands with SLURM

You can obtain information on the Slurm "Partitions" that accept jobs using the sinfo command


   $ sinfo
   PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST
   test         up       1:00      1   idle gnt-usiu-gpu-00.kenet.or.ke
   gpu1         up 1-00:00:00      1   idle gnt-usiu-gpu-00.kenet.or.ke
   normal*      up 1-00:00:00      1   idle gnt-usiu-gpu-00.kenet.or.ke

The test partition is reserved for testing, with a very short time limit. The normal partition is to be used for CPU only jobs, and the gpu1 queue is reserved for GPU jobs. Both production partitions have a time limit of 24 hours at a time for individual jobs.

Showing The Queue

The squeue slurm command will list all submitted jobs, and will give you an indication of how busy the cluster is, as well as the status of all running or waiting jobs. Jobs that are complete will exit the queue and will not be in this list.


   $ squeue 
   JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON)
    63    normal     gpu1   jotuya  R       0:03      1 gnt-usiu-gpu-00.kenet.or.ke
   $

Submitting Your first Job

Create a submission script For Quantum Espresso

You require a submission script, which is a plain text file with all the instructions for the command you intend to run. Retrieve the example files in your scratch directory from this github repository


 cd ~/localscratch/
 git clone https://github.com/Materials-Modelling-Group/training-examples.git
 cd  training-examples

and in this directory we will place the following text content in a file:


#!/bin/bash

#SBATCH -J testjob               # Job name
#SBATCH -o job.%j.out         # Name of stdout output file (%j expands to jobId)
#SBATCH -e %j.err             # Name of std err
#SBATCH --partition=normal    # Queue
#SBATCH --nodes=1             # Total number of nodes requested
#SBATCH --cpus-per-task=1     # 
#SBATCH --time=00:03:00        # Run time (hh:mm:ss) - 1.5 hours
  
# Launch MPI-based executable
module load applications/qespresso/7.3.1 
 
cd $HOME/localscratch/training-examples 
mpirun -np 4  pw.x <al.scf.david.in > output.out

Put this in a file called *test.slurm*

Submitting the Job to the Queue

The slurm sbatch command provides the means to submit batch jobs to the queue:


   $ sbatch  test.slurm 
   Submitted batch job 64
   $

This will run the named program on four 4 cores, note that the parallelism is built into the program, if the program itself is not parallelised, running on multiple cores will not provide any benefit.

Watch Demo

Next: Basic_Usage:_GPU_Based_Resources_With_Slurm

Up: HPC_Usage

Difference between revisions of "Basic Usage: CPU Based Resources With Slurm"

Latest revision as of 19:43, 12 May 2025

Contents

Simple commands with SLURM

Showing The Queue

Submitting Your first Job

Create a submission script For Quantum Espresso

Submitting the Job to the Queue

Watch Demo

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools

@@ Line 1: / Line 1: @@
-== Introduction ==
+[[File:Slurm_logo.svg.png|150px]]
-Slurm [https://slurm.schedmd.com/documentation.html] is a workload manager for clusters, offering both batch and interactive job scheduling.
-It works over a text based interface on the linux terminal.
-Slurm will provide you with the following to help you make use of the cluster;
-# What resources are available on the cluster.
-# Queuing and allocation of jobs based on specified resources.
-# Job monitoring and status reporting.
 == Simple  commands with SLURM ==
@@ Line 32: / Line 25: @@
      $
 </code>
+[[File:Quantum_ESPRESSO_logo.jpg|250px]]
 == Submitting Your first Job ==
-==== Create a submission script ====
+==== Create a submission script For Quantum Espresso ====
-You require a submission script, which is a plain text file with all the instructions for the command you intend to run:
+You require a submission script, which is a plain text file with all the instructions for the command you intend to run.
+Retrieve the example files in your scratch directory from this [https://github.com/Materials-Modelling-Group/training-examples github repository ]
 <code bash>
-   #!/bin/bash
+   cd ~/localscratch/
+  git clone https://github.com/Materials-Modelling-Group/training-examples.git
+  cd  training-examples
+</code>
+and in this directory we will place the following text content in a file:
+<code bash>
+ #!/bin/bash
-  #SBATCH -J testjob               # Job name
+ #SBATCH -J testjob               # Job name
-  #SBATCH -o job.%j.out         # Name of stdout output file (%j expands to jobId)
+ #SBATCH -o job.%j.out         # Name of stdout output file (%j expands to jobId)
-  #SBATCH -e %j.err             # Name of std err
+ #SBATCH -e %j.err             # Name of std err
-  #SBATCH --partition=normal    # Queue
+ #SBATCH --partition=normal    # Queue
-  #SBATCH --nodes=1             # Total number of nodes requested
+ #SBATCH --nodes=1             # Total number of nodes requested
-  #SBATCH --gres=gpu:1             # Total number of gpus requested
+ #SBATCH --cpus-per-task=1     #
-  #SBATCH --cpus-per-task=1     #
+ #SBATCH --time=00:03:00        # Run time (hh:mm:ss) - 1.5 hours
-  #SBATCH --time=00:03:00        # Run time (hh:mm:ss) - 1.5 hours
-  # Launch MPI-based executable
+ # Launch MPI-based executable
-  module load applications/qespresso/7.3.1
+ module load applications/qespresso/7.3.1
-  cd $HOME/test
+ cd $HOME/localscratch/training-examples
-  mpirun -np 4  pw.x <input.in > output.out
+ mpirun -np 4  pw.x <al.scf.david.in > output.out
 </code>
 Put this in a file called *test.slurm*
@@ Line 64: / Line 65: @@
 </code>
 This will run the named program on four 4 cores, note that the parallelism is built into the program, if the program itself is not parallelised, running on multiple cores will not provide any benefit.
+== [https://asciinema.org/a/uN4JgasyfUJlKlkgiBSUBtkmN  Watch Demo] ==
+Next:
+[[Basic_Usage:_GPU_Based_Resources_With_Slurm| Basic_Usage:_GPU_Based_Resources_With_Slurm]]
+Up:
+[[ HPC_Usage| HPC_Usage]]