1 / 17

LHCb Development

LHCb Development. Glenn Patrick Raja Nandakumar GridPP18, 20 March 2007. p. p. RICH1 VELO. Trackers. Calorimeters. Muon. RICH2. Magnet. LHCb December 2006. HLT Software. 40 kHz. Level-1 Software. 1 MHz. Level-0 Hardware. 40 MHz. LHCb Computing Model. 2 kHz@30 kB/event

micol
Télécharger la présentation

LHCb Development

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. LHCb Development Glenn Patrick Raja Nandakumar GridPP18, 20 March 2007

  2. p p RICH1 VELO Trackers Calorimeters Muon RICH2 Magnet LHCb December 2006

  3. HLT Software 40 kHz Level-1 Software 1 MHz Level-0 Hardware 40 MHz LHCb Computing Model 2 kHz@30 kB/event 60MB/s

  4. User interfaces Job monitor Production manager GANGA UI User CLI BK query webpage FileCatalog browser BookkeepingSvc FileCatalogSvc DIRAC Job Management Service DIRAC services JobMonitorSvc InformationSvc MonitoringSvc JobAccountingSvc AccountingDB Agent Agent Agent DIRAC resources DIRAC Storage LCG Resource Broker DIRAC Sites CE 3 DIRAC CE gridftp bbftp DIRAC CE DIRAC CE DiskFile CE 2 CE 1 rfio DIRAC Production and Analysis Next talk

  5. checkData Job JDL Job Receiver Data Optimizer Job Receiver Job Receiver Job Input Job JDL Sandbox JobDB LFC Task Queue checkJob Agent Monitor getReplicas WMS Admin Job Monitor Agent Director Matcher Pilot Job checkPilot SE getProxy RB RB RB CE JDL uploadData VO-box getSandbox DIRAC services putRequest Job Wrapper CE LCG services User Application execute (glexec) WN fork Workload On WN DIRAC Workload Management Pilot Agent

  6. DIRAC3 Revision and Roadmap • Operation in multiplatform environment - various Linux flavours, 32 bit/64 bit, Windows(!) • Need to separate generic and LHCb behaviour. • Need for new functionality affecting multiple components (e.g. job state machinery). • DIRAC3 will be the result of this major code revision and reorganisation. Dec 2006. Brainstorming meeting at CERN amongst developers. Jan 2007. Barcelona workshop. Feb – April 2007. Re-implementation of the code base according to new design. May 2007. Integration of DIRAC3 system and thorough testing. June 2007. Release of DIRAC 3. Gennady Kuznetsov (RAL)

  7. AMGA GANGA application GANGA application AMGA Client BookkeepingSvc BookkeepingQuery JDBC Driver AMGA Client Oracle DB Web Browser AMGA-Bookkeeping Architecture Carmine Cioffi (Oxford): AMGA now used in Production – old system retired. New production machine (volhcb01) for bookkeeping. volhcb01 Jython Server volhcb01 Write Read XML-RPC Read Write BK Service Read Tomcat Read/Write Read Read R/W R/W Servlet

  8. Gauss Step Monte-Carlo Production Job SoftwareInstallation module GaussApplication module BookkeepingUpdate module Boole Step SoftwareInstallation module BooleApplication module BookkeepingUpdate module Software Modules and Data Flow DST+RAW Reconstruction Brunel Stripping and Analysis DaVinci Digitisation Boole Simulation Gauss Raw Data rDST MC Truth Event Tag Collection

  9. Last Month Activity ALL Bugs found, stop and restart the production • Average of 7.5K running jobs in the last month • Temporary problems at PIC and RAL Record of running jobs 9715 CERN PIC CNAF GRIDKA NIKHEF RAL IN2P3 Raja Nandakumar (RAL)

  10. CPU Use since Dec. 2006 CERN German UK Spain France Italy

  11. CPU Use since Dec. 2006 CPU Use - 40% @ T1s QMUL CERN CNAF Manchester GRIDKA

  12. GRIDKA 2% RAL NIKHEF 10% 0% PIC 32% IN2P3 17% CERN CNAF 19% 20% Reconstruction since Dec. 2006 Data access problems the main cause of delays to reconstruction. RAL dCache unstable since December. Problems with file staging through SRM. Some GridFTP problems. New staging component in DIRAC

  13. Data Transfer 1 Failed Problems with transfers: When a job fails to transfer data to one or more T1, the transfer request is queued through VO box. Storage is not always available at T1s and number of pending transfer requests increase.

  14. Data Transfer 2 Success Improvements: Temporary replication to a fail-over SE (all Tier 1s). Replication to final destination queued in VO box. VO box retries until transfer succeeds. Extremely reliable (multi-threaded transfer agent required).

  15. Castor-2 Next Steps • Data Stripping. • Delayed to ~June because of late availability of high performance pre-selection algorithms. • Stripped DSTs to be shipped to all Tier 1 centres. • Analysis using Ganga. • Output used for LHCb “Physics Book”. Castor Migration. LHCb tests progressing at RAL (Raja). Once jobs run and are stable aim to switch and replicate existing data from dCache to Castor. End June deadline for Castor approaching fast!

  16. Alignment Challenge First release of alignment framework – March. First Alignment Challenge using tracking detectors – end April for production of datasets. ~June for alignment demonstration. Second Alignment Challenge using all sub-detectors – September? VELO is most precise device in LHCb, but it moves! Retracted by ~3cm in between fills. 21 tracking stations. 4 sensors per station (r/ ) Different Configurations: Magnet OFF, VELO Open Magnet OFF, VELO Closed Magnet ON, VELO Open Magnet ON, VELO Closed Grid test of Conditions Database – streaming of data constants and running of LHCb applications.

  17. 2007 Timetable December 2007 November: First data! September: Second Alignment Challenge September: Re-reconstruction of b and Min Bias events June/July: Full chain DAQ -Tier 0 - Tier 1 tests June: Release of DIRAC3 end April - June: First Alignment Challenge From March April: DAQ -Tier 0 throughput tests Jan - March: DC06 Production Phase January 2007

More Related