...
Node: A single machine in a supercomputer. This will be either a physical machine or a virtual machine.
Scheduler: The application on our end that assigns compute resources for jobs.
Slurm: The brand name of our scheduler which manages and allocates resources.
MPI: Message Passing Interface (MPI) is a standardized and portable function used on parallel computing architecturesSBATCH: This is a means of submitting a batch job, from which tasks will be executed, to execute on allocated resources.
What is a Job?
A job is any work submitted to the supercomputer that requests or uses its resources. There are three types of jobs that a user can request:
...
This reduced example will still allocate 50 cores with 5 cores per task on any number of available nodes. Note, since there is no MPI capability in the Aloe environment, you will likely always add -N 1
. This will ensure that each job’s worker has the lowest latency to each other.
Info |
---|
As a general rule, CPU-only nodes have 128 cores and GPU-present nodes have 48 cores. |
...