70 likes | 211 Views
CONDOR-G Installation. July 2004 : one independent PC for Grid FTP as a client to UCL August 2004 : complete installation of Globus 3.2 on the farm GridFTP server -> another picture to download Interface with CONDOR 6.6.6 for batch submission Local test with the independent PC.
E N D
CONDOR-G Installation • July 2004 : one independent PC for Grid FTP as a client to UCL • August 2004 : complete installation of Globus 3.2 on the farm • GridFTP server -> another picture to download • Interface with CONDOR 6.6.6 for batch submission • Local test with the independent PC Globus-url-copy –v gsiftp://cms01.umh.ac.be/tmp/newlab_we_are_ready.jpg file ${HOME}/newlab_we_are_ready.jpg J. Hanton - P. Herquet - F. Lequeux - A. Romeyer
Cluster architecture Independent PC CONDOR Globus Outer world OS : Redhat CERN 7.3.3 100 Mb/s cms01.umh.ac.be 193.190.171.101 Server Raid disk (2.4 TB) 10.0.0.11 Static IP 10.0.0.XXX Router 1 Gb/s … Cms02 10.0.0.12 Cms03 10.0.0.13 Computer J. Hanton - P. Herquet - F. Lequeux - A. Romeyer
Globus setup • On both the indep. PC and CMS01 public machine : • Configure the GridFTP server on port 2811 • Configure the gatekeeper service on port 2119 • Setting up the grid-mapfile : • Setting up the grim port types : "/C=BE/O=BELGRID/OU=TESTBED/OU=umh.ac.be/CN=cmsuser" cmsuser "/C=BE/O=BELGRID/OU=TESTBED/OU=localdomain/CN=cmsuser" cmsuser "/C=BE/O=BELGRID/OU=TESTBED/OU=fynu.ucl.ac.be/CN=Alain NINANE" cmsuser <authorized_port_types> <port_type username="globus">http://www.globus.org/namespaces/managed_job/manage d_job/ManagedJobPortType</port_type> <port_type username="cmsuser">http://www.globus.org/namespaces/managed_job/manag ed_job/ManagedJobPortType</port_type> </authorized_port_types> J. Hanton - P. Herquet - F. Lequeux - A. Romeyer
Globus setup… • Lets test it : • Nothing strange in the globus-gatekeeper.log • Solution (only a trick…) : remove the gsi-authz.conf in /etc/grid-security • Build the condor scheduler in Globus Globus-job-run cms01.umh.ac.be /bin/dateGram Job submission failed because data transfer to the server failed (error code 10) cmsuser@cms01:~> globus-job-run cms01.umh.ac.be /bin/date Thu Aug 19 10:12:34 CEST 2004 cmsuser@cms-test:~> globus-job-run cms01.umh.ac.be /bin/date Thu Aug 19 10:12:16 CEST 2004 Local test Remote test • gpt-build scheduler-condor-3.2-src_bundle.tar.gz gcc32dbg • gpt-postinstall J. Hanton - P. Herquet - F. Lequeux - A. Romeyer
Test of CONDOR-G Indep. PC CONDOR + Globus eth0 • Start with a CONDOR example : io.c • CONDOR .cmd file : ######################## Test Condor command file######################universe = globusglobusscheduler = cms01.umh.ac.be/jobmanager-condorexecutable = io.remoteoutput = io.outerror = io.errlog = io.logrequirements = CMSFARM=?=Truearguments = 200queue Globus eth1 CMS01CONDOR + Globus eth0 CONDOR eth0 CMS02, CMS03 CONDOR J. Hanton - P. Herquet - F. Lequeux - A. Romeyer
Test of CONDOR-G… • Launch the job from the indep. PC : cmsuser@cms-test:~> condor_submit io.cmd cmsuser@cms-test:/scratch/examples> condor_q -globus -- Submitter: cms-test.umh.ac.be : <10.102.192.19:32771> : cms-test.umh.ac.be ID OWNER STATUS MANAGER HOST EXECUTABLE 239.0 cmsuser UNSUBMITTED condor cms01.umh.ac.be /tmp/Scratch/examp cmsuser@cms-test:~/Script/Condor/Analyse/Reco_1> condor_q -globus -- Submitter: cms-test.umh.ac.be : <10.102.192.19:32771> : cms-test.umh.ac.be ID OWNER STATUS MANAGER HOST EXECUTABLE 239.0 cmsuser PENDING condor cms01.umh.ac.be /tmp/Scratch/examp cmsuser@cms-test:~/Script/Condor/Analyse/Reco_1> condor_q -globus -- Submitter: cms-test.umh.ac.be : <10.102.192.19:32771> : cms-test.umh.ac.be ID OWNER STATUS MANAGER HOST EXECUTABLE 239.0 cmsuser ACTIVE condor cms01.umh.ac.be /tmp/Scratch/examp J. Hanton - P. Herquet - F. Lequeux - A. Romeyer
Test of CONDOR-G… • On cms01 : cmsuser@cms01:~> condor_q -- Submitter: cms01.umh.ac.be : <10.0.0.11:32773> : cms01.umh.ac.be ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD 755.0 cmsuser 8/19 10:34 0+00:00:51 R 0 0.0 data 1 250 0 Lumi2 1 jobs; 0 idle, 1 running, 0 held cmsuser@cms01:~> condor_q -r -- Submitter: cms01.umh.ac.be : <10.0.0.11:32773> : cms01.umh.ac.be ID OWNER SUBMITTED RUN_TIME HOST(S) 755.0 cmsuser 8/19 10:34 0+00:01:00 cms02 Job is running on cms02 Rem : the test has also been done with CMS reconstruction job J. Hanton - P. Herquet - F. Lequeux - A. Romeyer