Basic Usage: CPU Based Resources With Slurm

From KENET Training
Revision as of 13:21, 1 April 2025 by Atambo (talk | contribs)
Jump to: navigation, search

Introduction

Slurm [1] is a workload manager for clusters, offering both batch and interactive job scheduling. It works over a text based interface on the linux terminal.

Slurm will provide you with the following to help you make use of the cluster;

  1. What resources are available on the cluster.
  2. Queuing and allocation of jobs based on specified resources.
  3. Job monitoring and status reporting.

Simple commands with SLURM

You can obtain information on the Slurm "Partitions" that accept jobs using the sinfo command:

$ sinfo PARTITION AVAIL TIMELIMIT NODES STATE NODELIST test up infinite 1 idle gnt-usiu-gpu-00.kenet.or.ke gpu1 up infinite 1 idle gnt-usiu-gpu-00.kenet.or.ke normal* up infinite 1 idle gnt-usiu-gpu-00.kenet.or.ke