How (and why) HEP uses the Grid.

How (and why) HEP uses the Grid.

Major challenges Scope of talk MC production Data transfer Data analysis Conclusions Overview

HEP in a nutshell • Workflows include: • Monte Carlo production • Data calibration • Reconstruction of RAW data. • Skimming of RECO data. • Analysis of RAW/RECO/TAG data. • 1000 physicists per experiment • So far main activities are MC production and user analysis

Large amounts of data. ~100 million electronics channels (per experiment). ~1MB per event. 40 million events per second. Record ~100 events per second. ~billion events per year. ~15PB per year. Trivially paralizable workflows Many users, O(1000), performing unstructured analysis Each analysis requires non-negligable data access (<1TB). Each analysis requires similar amounts of simulated (Monte Carlo) data. Balloon (30 Km) Computing Challenges I CD stack with 1 year LHC data! (~ 20 Km) Concorde (15 Km) Mt. Blanc (4.8 Km)

HEP requirements: Scalable workload management system with 10,000s of jobs, 1000s of users and 100s of sites worldwide. Useable by non computing experts. High levels of data integrity / availability. PBs of data storage Automatic/reliable data transfers between 100s sites managed at a high level. Of a 120TB data transfer Mr DiBona, open source program manager at Google said: "The networks aren't basically big enough and you don't want to ship the data in this manner, you want to ship it fast.” http://news.bbc.co.uk/1/hi/technology/6425975.stm We have no choice Computing Challenges II

I know most about LHC experiments, esp. CMS. Many Grid projects/organisations/acronyms Focus on EGEE/Glite == (mainly) Europe. NGS not included - though plans for interoperability. Illustrate the different approaches taken by LHC experiments. Attempt to give an idea of what works and what doesn’t. Scope of talk

As many ways to use distributed computing as there are experiments. Differences due to: Computational requirements Available resources (Hardware/Manpower) LCG systems used in a mix ‘n’ match fashion by each experiment Workload management Jobs submitted to Resource Broker (RB) which then decides where to send job, monitors it and resubmits if failure. Data management Similar syntax with jobs submitted to copy files between sites. Includes concepts of transfer channel, fair share and multiple retries. File catalogue maps files to locations (can have multiple instances for different domains) HEP approaches to grid

Computing modelATLAS (also ALICE / CMS) MSS MSS MSS London Tier ~200kSI2k Average CPU = ~1-1.5 kSpecInt2k ~Pb/sec Event Builder ~100 Gb/sec Event Filter~7.5MSI2k • Some data for calibration and monitoring to institutes • Calibrations flow back ~3 Gb/sec raw • ~5 PB/year • No simulation T0 ~5MSI2k Tier 0 Castor ~ 75MB/s/T1 raw for ATLAS Tier 1 • ~2MSI2k/T1 • ~2 PB/year/T1 UK Regional Centre (RAL) US Regional Centre Dutch Regional Centre French Regional Centre MSS 622Mb/s links 10 Tier-1s reprocess house simulation Group Analysis Tier 2 Tier2 Centre ~200kSI2k Tier2 Centre ~200kSI2k Tier2 Centre ~200kSI2k • ~200 TB/year/T2 622Mb/s links Each of ~30 Tier 2s have ~20 physicists (range) working on one or more channels Each Tier 2 should have the full AOD, TAG & relevant Physics Group summary data Tier 2 do bulk of simulation Imperial ~0.25TIPS QMUL UCL RHUL Physics data cache 100 - 1000 Mb/s links Desktop

Last few years conducted extensive analysis of simulated data. Required massive effort from many people. Only recently reached stage of large scale, automated production with grid. Taken a lot of work and still not perfect Each experiment has own system which use LCG components in different ways. CMS adopts a “traditional” LCG approach I.e. jobs to RB to site. ATLAS bypasses the RB sends direct to known “good” sites. LHCb implement their own system using the RB but managing their own loadbalancing. MC generation

LCG submission uses RB, multiple instances also can be multi-threaded. Adopts a “if fail try-try again” approach to failures. Does not use LCG file catalogues due to performance/scalability concerns. Instead use a custom system with an entry per dataset, O(10-100 GB). MC generation (CMS) ProdAgent Resource ProdRequest Jobs User Request User Interface Get Work ProdMgr ProdAgent Resource Jobs Report Progress Accountant ProdAgent Resource Jobs

MC generation (CMS II)

Large scale production round started 22 March. MC generation (CMS III)

Completely custom workload management framework “Pilot” jobs Late binding Pull mechanism Dynamic job priorities Single point of failure Use standard LCG file tools MC generation (LHCb)

MC generation (LHCb II) ALL CERN PIC GRIDKA CNAF NIKHEF RAL IN2P3

Couple of different approaches. LCG can cope with workload requirements but concerns over reliability, speed and scalability Multiple RBs with multiple (multi-threaded) submitters Automatic retry ATLAS Bypass RB and submit direct to known sites (x10 faster) LHCb implement their own late binding File handling Again scalability and performance concerns over central file catalogues. New LCG architecture allows multiple catalogues but some still have concerns Instead of tracking individual files use entire datasets MC generation overview

Generally less developed than MC production system. So far less jobs - but need to be ready for experiment start up. Experiment use similar methodologies to their production systems. LHCb adopts a late bindng approach with pilot jobs. CMS submits via resource broker Generally send jobs to data Additional requirements from MC production Local storage throughput of 1-5MB/s per job Ease of use Gentle learning curve Pretty interface etc. Sensible defaults etc. Data analysis

Data analysis (ATLAS/LHCb) See talk by Ulrik Egede

Standard grid model again using the RB. Requires large software (~4GB) install at site. Site provides nfs area to all worker nodes Software installed with apt,rpm (over nfs) Trivial to use tar etc… User provides application + config CRAB creates, submits and tracks jobs. Output returned to user or stored to a site Plans for server architecture to handle retries Data analysis (CMS)

Data analysis (CMS II) arda-dashboard.cern.ch/cms

More requirements on sites - harder for smaller sites to support. Non expert users cause a large user support workload. Data analysis summary

Require reliable, prioritisable, autonomous large scale file transfers. LCG file transfer functionality relatively new and still under development. Can submit a job to a file management system that will attempt file transfers for you. All(?) experiments have created their own systems to provide high level management and to overcome failures Data Transfer

PhEdEx Agents at each site connect to a central DB and receive work (transfers and deletions). Web-based management of whole system. With web interface Subscribe data Delete data All from any site in system Authentication with X509 certificates Data Transfer (CMS)

Data transfer (CMS II)

Data transfer (CMS III)

LCG provides tools for low level (file) access and transfer. For higher level management (I.e. multi-TB) need to write own system. Data transfer overview

The (LCG) grid is a vast computational resource ready for exploitation. Still far from perfect More failures than local resources Less performance than local resources But probably much larger! The less your requirements the more successful you will be. Conclusions

Backup

Computing model IILHCb • Similar but places lower resource requirements on smaller sites. • Allows uncontrolled user access to vital Tier-1 resources. • Possibility for conflict

Submission via Resource Broker slow >5-10 secs per job. Limit of 10,000 jobs per day per submitter. LCG submission bypassing the RB, goes direct to site, load balancing handled by experiment software. MC generation (ATLAS)

Similar approach to CMS. Throughput (MB/s) Data Transfer (ATLAS) Total Errors

How (and why) HEP uses the Grid.

How (and why) HEP uses the Grid.

Presentation Transcript

Assessment – Why and How?

Why Brazil and How

Sustainability: why and how

Podcasting: What, Why, and How?

GPS, Why, What and How

Emotional computer why and how?

How and Why EarthQuakes Occur

Photosynthesis: Why, where and how…

Complexity Research; Why and How

Measurement Invariance: Why and How?

Ontologies: What, Why, and How?

War Driving, Why and How

Involving Community: Why and How?

What, Why, and How?

How? Why?

Prototyping: What? Why? and How?

REQUIREMENTS - WHY WHAT AND HOW?

Prototyping: What? Why? and How?

What, Why, and How?

What, Why, and How?

What, Why, and How?