1 / 7

CONDOR-G Installation

CONDOR-G Installation. July 2004 : one independent PC for Grid FTP as a client to UCL August 2004 : complete installation of Globus 3.2 on the farm GridFTP server -> another picture to download Interface with CONDOR 6.6.6 for batch submission Local test with the independent PC.

schuyler
Télécharger la présentation

CONDOR-G Installation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CONDOR-G Installation • July 2004 : one independent PC for Grid FTP as a client to UCL • August 2004 : complete installation of Globus 3.2 on the farm • GridFTP server -> another picture to download • Interface with CONDOR 6.6.6 for batch submission • Local test with the independent PC Globus-url-copy –v gsiftp://cms01.umh.ac.be/tmp/newlab_we_are_ready.jpg file ${HOME}/newlab_we_are_ready.jpg J. Hanton - P. Herquet - F. Lequeux - A. Romeyer

  2. Cluster architecture Independent PC CONDOR Globus Outer world OS : Redhat CERN 7.3.3 100 Mb/s cms01.umh.ac.be 193.190.171.101 Server Raid disk (2.4 TB) 10.0.0.11 Static IP 10.0.0.XXX Router 1 Gb/s … Cms02 10.0.0.12 Cms03 10.0.0.13 Computer J. Hanton - P. Herquet - F. Lequeux - A. Romeyer

  3. Globus setup • On both the indep. PC and CMS01 public machine : • Configure the GridFTP server on port 2811 • Configure the gatekeeper service on port 2119 • Setting up the grid-mapfile : • Setting up the grim port types : "/C=BE/O=BELGRID/OU=TESTBED/OU=umh.ac.be/CN=cmsuser" cmsuser "/C=BE/O=BELGRID/OU=TESTBED/OU=localdomain/CN=cmsuser" cmsuser "/C=BE/O=BELGRID/OU=TESTBED/OU=fynu.ucl.ac.be/CN=Alain NINANE" cmsuser <authorized_port_types> <port_type username="globus">http://www.globus.org/namespaces/managed_job/manage d_job/ManagedJobPortType</port_type> <port_type username="cmsuser">http://www.globus.org/namespaces/managed_job/manag ed_job/ManagedJobPortType</port_type> </authorized_port_types> J. Hanton - P. Herquet - F. Lequeux - A. Romeyer

  4. Globus setup… • Lets test it : • Nothing strange in the globus-gatekeeper.log • Solution (only a trick…) : remove the gsi-authz.conf in /etc/grid-security • Build the condor scheduler in Globus Globus-job-run cms01.umh.ac.be /bin/dateGram Job submission failed because data transfer to the server failed (error code 10) cmsuser@cms01:~> globus-job-run cms01.umh.ac.be /bin/date Thu Aug 19 10:12:34 CEST 2004 cmsuser@cms-test:~> globus-job-run cms01.umh.ac.be /bin/date Thu Aug 19 10:12:16 CEST 2004 Local test Remote test • gpt-build scheduler-condor-3.2-src_bundle.tar.gz gcc32dbg • gpt-postinstall J. Hanton - P. Herquet - F. Lequeux - A. Romeyer

  5. Test of CONDOR-G Indep. PC CONDOR + Globus eth0 • Start with a CONDOR example : io.c • CONDOR .cmd file : ######################## Test Condor command file######################universe        = globusglobusscheduler = cms01.umh.ac.be/jobmanager-condorexecutable      = io.remoteoutput          = io.outerror           = io.errlog             = io.logrequirements    = CMSFARM=?=Truearguments       = 200queue Globus eth1 CMS01CONDOR + Globus eth0 CONDOR eth0 CMS02, CMS03 CONDOR J. Hanton - P. Herquet - F. Lequeux - A. Romeyer

  6. Test of CONDOR-G… • Launch the job from the indep. PC : cmsuser@cms-test:~> condor_submit io.cmd cmsuser@cms-test:/scratch/examples> condor_q -globus -- Submitter: cms-test.umh.ac.be : <10.102.192.19:32771> : cms-test.umh.ac.be ID OWNER STATUS MANAGER HOST EXECUTABLE 239.0 cmsuser UNSUBMITTED condor cms01.umh.ac.be /tmp/Scratch/examp cmsuser@cms-test:~/Script/Condor/Analyse/Reco_1> condor_q -globus -- Submitter: cms-test.umh.ac.be : <10.102.192.19:32771> : cms-test.umh.ac.be ID OWNER STATUS MANAGER HOST EXECUTABLE 239.0 cmsuser PENDING condor cms01.umh.ac.be /tmp/Scratch/examp cmsuser@cms-test:~/Script/Condor/Analyse/Reco_1> condor_q -globus -- Submitter: cms-test.umh.ac.be : <10.102.192.19:32771> : cms-test.umh.ac.be ID OWNER STATUS MANAGER HOST EXECUTABLE 239.0 cmsuser ACTIVE condor cms01.umh.ac.be /tmp/Scratch/examp J. Hanton - P. Herquet - F. Lequeux - A. Romeyer

  7. Test of CONDOR-G… • On cms01 : cmsuser@cms01:~> condor_q -- Submitter: cms01.umh.ac.be : <10.0.0.11:32773> : cms01.umh.ac.be ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD 755.0 cmsuser 8/19 10:34 0+00:00:51 R 0 0.0 data 1 250 0 Lumi2 1 jobs; 0 idle, 1 running, 0 held cmsuser@cms01:~> condor_q -r -- Submitter: cms01.umh.ac.be : <10.0.0.11:32773> : cms01.umh.ac.be ID OWNER SUBMITTED RUN_TIME HOST(S) 755.0 cmsuser 8/19 10:34 0+00:01:00 cms02 Job is running on cms02 Rem : the test has also been done with CMS reconstruction job J. Hanton - P. Herquet - F. Lequeux - A. Romeyer

More Related