Monitoring your jobs
There are various ways for you to monitor and check up on your running and completed jobs.
See the status of the nodes
The easiest way to see what is happening on the cluster is to firstly check ganglia. This is a web based monitoring application that displays statistics about the cluster and its nodes. To view this, simply visit;
http://bert.ibers.aber.ac.uk/ganglia
There are a variety of statistics to view. Most useful is load_one which shows you the cpu load average on each node. You can also monitor the overall averages along with memory and network usage.
Check on what is running
Once you have submitted your job scripts, you may want to check on the progress of what is running. This is achieved using the qstat command.