The IHPC clusters are using Torque as the queuing system.
Users should always run their computations on the compute nodes, not the login nodes. This is done by using the queuing system.
To run a job on the system, you need to create a job script. A job script is a reqular shell script either bash or csh with some directives which specifies number of cpus, memory etc. Then, this will be interpreted by the batch system on submission. Below is a very basic job sample script:
#PBS -N ExampleJobName
#PBS -l nodes=1
#print hostname on which the job is running
Once you have your job script ready, you can use qsub command as follows:
qsub <your job script filename>
To check the status of your job you can then do:
qstat -u <username>
Other options of note for job scripts
- -l nodes=2:ppn=4
- Requests 2 nodes with 4 cores each for the job
- -q himem [or -q gpu]
- Requests to use the high memory nodes or gpu nodes [Garpur only]
- Joins the standart error stream and standart output of a job into a single file
- -v n=10
- Passes variable n to the job
If you want to run a job on the compute nodes interactivly, you can use the command
To leave this interactive session use the “exit” command.
Some additional resources: