160 likes | 279 Vues
The ALICE Data Challenge aims to prepare for rapid data acquisition at a rate of 1.5 GB/s, significantly higher than other LHC experiments. This initiative requires the simulation of both online and offline data recording and processing chains, testing technologies like DATE, ROOT, HPSS, and various hardware setups. The first challenge involved storing 10 TB of raw data in a ROOT database within 5 days at 25 MB/s. By analyzing event models and optimizing data handling processes, the project seeks to ensure readiness for future data flows with minimal surprises.
E N D
ALICE Data Challenges Fons Rademakers
Data Challenge • When fully operational ALICE will take data with a rate of 1.5 GB/s • An order of magnitude higher than any of the other LHC experiments • Since we are unique we can not piggyback on ATLAS/CMS/IT R&D
Objectives • Simulate the ALICE on-line and off-line data recording and processing chain • Track technologies: • DATE, ROOT, MSS, networking, CPU's and OS'es • Repeat frequently to collect data points to see if we will be ready for 1.5 GB/s in 2005 Be ready for T=0 with as little surprises as possible
First Data Challenge • To store 10TB of raw data in a ROOT DB on HPSS within 5 days (i.e. 25 MB/s) • To test the following technologies: • DATE, ROOT, HPSS, GB ethernet, Linux and PC's • Very simple event model, but fairly realistic raw, tag and catalogue database environment • Executed in March 1999
ADC1 Data Flow Computer center NA57 Data source Intel/Linux PC PowerPC/AIX ROOT objectifier LDC pipe GDC Event Builder 5 MB/s switch GB eth rfcp LDC 5 MB/s disk GB eth DATE = LDC+GDC 5 MB/s HPSS RS6000/AIX 5 node cluster
DATE Data Source • DATE is developed by the ALICE DAQ team and used by NA57 and COMPASS • See talk by R. Divia (B293) this afternoon • LDC = Local Data Concentrator • GDC = Global Data Collector • In this setup the LDC's generated bit patterns simulating (NA57) events (size 400 - 500 KBytes/event)
RAW Data DB • Raw data DB contains AliEvent objects, which contain: • an AliEventHeader object • 16 data members (72 bytes) • an AliRawData object • char array (variable length, 400-500 kB) • AliEvent objects are stored in a single TTree container with two branches (for the two contained objects)
RAW Data DB (cont.) • No compression (need more CPU power) • Size of individual raw DB's typically between 250 MB and 1.5 GB (tunable parameter) • As soon as a raw DB is closed it is moved to HPSS via rfcp (part of CERN SHIFT software interfaced to HPSS)
Tag DB • Tag DB contains AliEventHeader objects, which contain: • size, type, run number, event number, burst number, trigger, time, detector id, etc. • AliEventHeader objects are stored in a single TTree with 16 branches (one for each header data member) • Compressed • Stays on disk • Used for fast event selection
Run Catalog DB • Run DB contains one AliStats object per produced raw data DB • An AliStats object contains: • filename of raw DB, number of events, begin/end run number, begin/end event number, begin/end time, file size of raw DB, compression factor, quality histogram (TH1F) • Compressed • Stays on disk • Used for matching run/event with raw DB
NA57 Data source Computer center 9 PowerPC/AIX LDC Intel/Linux PC Cluster 5 MB/s LDC Switch Event Filter 5 MB/s GDC Event Builder GB eth ROOT objectifier Switch Pipe 5 IntelPC/Linux 10 MB/s LDC Switch GB eth 10 MB/s LDC 10 MB/s Catalog Tag DB HPSS/ CASTOR MSS Alice DAQ Lab Data source DATE = LDC+GDC 20 node cluster