sde-cheatsheet

slurm - Command

Show Status of Cluster Nodes

sinfo

Show Detailed Information of a Node

scontrol show node [node_tag]

Login to a Node

ssh [node_tag]

Run a Task

srun

Submit a Batch Task

# Command
sbatch \
    --comment [comment] \
    --timeout [max_execution_time] \
    --exclusive \
    --nodes [node_number] \
    --cpus-per-task [cpu_number] \
    --wrap [command]

# Example
sbatch \
    --comment "feicheng" \
    --timeout 60000 \
    --exclusive \
    --nodes 32 \
    --cpus-per-task 128
    --wrap "./run_inference_demo.sh"

Show Tasks Scheduled in Queue

squeue

Cancel a Task in Queue

scancel [task_id]