200 likes | 318 Views
Job Submission. Using PBSPro and Globus Job Commands. Overview. Computational Resources Queuing Systems PBSPro Globus Toolkit. Computational Resources. Hard Disks (permanent storage) Number of CPUs (processing power) CPU time (processing time) Physical memory (program size)
E N D
Job Submission Using PBSPro and Globus Job Commands
Overview • Computational Resources • Queuing Systems • PBSPro • Globus Toolkit
Computational Resources • Hard Disks (permanent storage) • Number of CPUs (processing power) • CPU time (processing time) • Physical memory (program size) • All computers have limited resources • These resources may span across multiple processors/machines • How to allocate these resources fairly, amongst the many users?
Queuing Systems • Holding area for pending requests • A method for allocating the needed resources based on a user request • Processing of requests dynamically, quickly and fairly • Many differentimplementations
PBSPro • Queuing System used to control the allocation of computational resources to user submitted jobs • Allows optimal sharing of all resources • Ensures that the limited resources aren’t over-run and exceeded • Unattended processing of requests
PBS Queues • Allows distribution of resources into clearly defined groups, called queues • Queues can be defined by: • Maximum CPU time • Number of CPUs available • Memory needed • Concurrently executing jobs • Queuing schemes evolve over time as user requests and workload vary
Interacting With PBS • qsub – Submit a job to the queues • qstat – Check the status of a job • qdel – Delete a job from the queues • qmgr – Create/modify queue settings • xpbs – Monitor all queues and jobs
qsub – Submit A Job • To submit a job to the Queuing System, use “qsub” • E.g. qsub test.sub • Test.sub is a script file containing the commands to be executed • qsub returns your “QueueID”, if the job was submitted successfully:61606.master
Submit File For A Serial Job • !/bin/bash#PBS -l walltime=2:00:00#PBS -l mem=5mb#PBS -j oe#PBS –m becd Serial./test
Submit File For A Parallel Job • #!/bin/sh#PBS -l nodes=4:ppn=2#PBS -l walltime=48:00:00#PBS -j oecd ./MPI/Examplesmpiexec ./test
qstat – Check Job Status • To check the status of a job submitted to the Queuing System, use “qstat”Job id Name User Time S Queue------------------ ---------- ------------ ------------ - --------61395.master G5D2C ngs0140 70:00:40 R cpu1661494.master STDIN ngs0227 17:58:40 R cpu161495.master STDIN ngs0227 18:15:42 R cpu161496.master STDIN ngs0227 18:15:02 R cpu161497.master STDIN ngs0227 18:13:42 R cpu161498.master STDIN ngs0227 18:14:13 R cpu861555.master Co_V ngs0133 20:57:53 R cpu2461567.master 20_12 ngs0234 00:31:17 R cpu161576.master 20_21 ngs0234 00:11:59 R cpu461578.master 20_23 ngs0234 00:05:51 R cpu161580.master 20_25 ngs0234 00:03:09 R cpu1
qdel – Delete A Job • To delete a job submitted to the Queuing System, use “qdel” • E.g. qdel QueueID qdel QueueID1 QueueID2
Globus Toolkit • An open source toolkit for developing Grid based applications and connectivity • Allocating computational resources on remote (Globus aware) machines for the execution of user submitted jobs
Globus Job Commands • globus-job-run <options> • globus-job-submit <options> • globus-job-status URL • globus-job-get-output URL
globus-job-run • Allows you to run a job as though it were interactive, on a local or remote machine • Don’t actually need to log on to the machine itself • Not submitted to the Queuing System • Returns the programs output as though you were running interactively
globus-job-run Examples • globus-job-run grid-data.man.ac.uk /bin/date • globus-job-run grid-data.rl.ac.uk ./test • globus-job-run \grid-compute.leeds.ac.uk/jobmanager-pbs \-np 8 -x ‘(jobtype=mpi)(environment= \(NGSMODULES clusteruser))’ ./MPI/test • globus-job-run grid-data.rl.ac.uk -s ./test
globus-job-submit • To submit a job through a Globus Job Manager • Commands returns a URL to the program’s outputhttps://grid-compute.leeds.ac.uk:64167/5291/1094639422/ • The status and output of the job can be tested through this URL
globus-job-submit Examples • globus-job-submit \grid-data.rl.ac.uk/jobmanager-pbs ./test • globus-job-submit \grid-compute.leeds.ac.uk/jobmanager-pbs \-x ‘(jobtype=mpi)(directory=/home/bob/mpi) \(environment=(NGSMODULES clusteruser)) \(count=8)’ ./mpi_program • globus-job-submit \grid-data.man.ac.uk/jobmanager-pbs -s ./test
globus-job-status URL • To get the status of a submitted job • globus-job-status \https://gridcompute.leeds.ac.uk:64167/5291/1094639422/ • Returns: • Pending, Active, Done, Failed
globus-job-get-output URL • To retrieve the output of a job submitted through a Globus Job Manager • globus-job-get-output \https://gridcompute.leeds.ac.uk:64167/5291/1094639422/ • Returns the output of the program to the console