1 / 14

PROOF system for parallel MPD event processing

PROOF system for parallel MPD event processing. Gertsenberger K . V . Joint Institute for Nuclear Research , Dubna. NICA scheme. Multipurpose Detector (MPD).

buck
Télécharger la présentation

PROOF system for parallel MPD event processing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. PROOF system for parallel MPD event processing Gertsenberger K. V. Joint Institute for Nuclear Research, Dubna

  2. NICA scheme GertsenbergerK.V.

  3. MultipurposeDetector (MPD) The softwareMPDRootis developed for the MPD event simulation, reconstruction of experimental or simulated data and following physical analysis of heavy ion collisions registered by the MultiPurpose Detector at the NICA collider. GertsenbergerK.V.

  4. Prerequisites of the parallel processing • high interaction rate (up to 6 KHz) • high particle multiplicity, about 1000 charged particles for the central collision at the NICA energy • one event reconstruction takes tens of seconds in MPDRoot now, 1M events – months • large data stream fromthe MPD: is estimated at 5 to 10 PB of raw data per year 1m simulated events~ 50TB • MPD event data can be processed concurrently • the ability to use multicore / multiprocessor machines, computing clusters and, subsequently, GRID system GertsenbergerK.V.

  5. Current NICA clusterin LHEP GertsenbergerK.V.

  6. Data storageon the NICA cluster Distributed file systemGlusterFS • it aggregates existing file systems in a common distributed file system • automatic replication works as background process • background self-checking service restores corrupted files in case of hardware or software failure GertsenbergerK.V.

  7. Parallel MPD event processing concurrent event processing PROOF server parallel data processing in ROOT macros on the parallel architectures MPD-scheduler scheduling system for the task distribution to parallelize data processing on the cluster nodes GertsenbergerK.V.

  8. Parallel data processing with PROOF • PROOF (Parallel ROOT Facility) is a part ofthe ROOT software, no additional installations • PROOF uses data independent parallelism based on the lack of correlation for MPD events good scalability • Parallelization for three parallel architectures: • PROOF-Lite parallelizesthe data processing on one multiprocessor/multicores machine • PROOF parallelizes processing on heterogeneous computing cluster • Parallel data processing inGRID system • Transparency: the same program code can execute both sequentially and concurrently GertsenbergerK.V.

  9. Using PROOF inMPDRoot • The last parameter of the reconstruction: run_type (default,“local”). Speedup on the user multicore machine: $ root reco.C(“evetest.root”, “mpddst.root”, 0, 1000, “proof”) parallel processing of 1000 events with thread count being equal logical processor count $ root reco.C(“evetest.root”, “mpddst.root”, 0, 500, “proof:workers=3”) parallel processing of 500 events with three concurrent threads Speedup on the NICA cluster: $ root reco.C(“evetest.root”, “mpddst.root”, 0, 1000, “proof:mpd@nc10.jinr.ru:21001”) parallel processing of 1000 events on all cluster’s coresof the PoDfarm $ root reco.C(“evetest.root”, …, 0, 500,“proof:mpd@nc10.jinr.ru:21001:workers=15”) parallel processing of 500 events on the PoDcluster with 15 workers • XRootD files support GertsenbergerK.V.

  10. The speedup of the reconstruction on 4-cores machine GertsenbergerK.V.

  11. PROOF on the NICA cluster event count $ root reco.C(“evetest.root”,”mpddst.root”, 0, 3, “proof:mpd@nc10.jinr.ru:21001”) GlusterFS mpddst.root *.root evetest.root event №0 event №2 event №1 proof proof proof proof (14) (10) (10) proof = master server Proof On Demand Cluster proof = slave node GertsenbergerK.V.

  12. The speedup of the reconstruction on the NICA cluster GertsenbergerK.V.

  13. The description of PROOF system on mpd.jinr.ru GertsenbergerK.V.

  14. Conclusions • The distributed NICA cluster was deployed on LHEP farm for the NICA/MPD experiment (Fairsoft, ROOT/PROOF, MPDRoot, Gluster). 128 cores • The data storage was organized with the GlusterFS distributed file system: /nica/mpd[1-8]. 10 TB • PROOF On Demand cluster containing nc10 (with POD server), nc11 and nc13 machines with 34 processor cores was implemented to parallelize event data processing for the MPD experiment. PROOF support was added to the reconstruction macro. • The web sitempd.jinr.ru insectionComputing –NICA cluster – PROOF parallelize presents the manual for the PROOF system. GertsenbergerK.V.

More Related