1 / 31

Aims of the analysis

Comparison between Gridice and Boss data related to LCG0 production G. Maggi, M. Maggi, A.Pierro, N. De Filippis, G. Donvito, T. Coviello. Aims of the analysis. To extract some parameters from LCG0 production data such as farm performance job lifetime CPU load and used memory

lan
Download Presentation

Aims of the analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Comparison between Gridice and Boss data related to LCG0 productionG. Maggi, M. Maggi, A.Pierro, N. De Filippis, G. Donvito, T. Coviello

  2. Aims of the analysis • To extract some parameters from LCG0 production data such as • farm performance • job lifetime • CPU load and used memory • LAN and WAN occupancy for the simulation of the CMS analysis environment • To select some reference distributions to be reproduced by the simulation

  3. User Interfaces & Boss databases • Bari • Padova • Bologna • Ecole Polytecnique (France)

  4.  Farm N PC Type HT  Clock (MHz) Memory (Mbytes) 1 Bari 1 dual P III no 1000 1000 2 dual P III no 1133 1000 6 dual P III no 1266 1000 2 Bologna 10 dual P III no 1000 500 1 dual P IV (Xeon) no 2400 1000 3 Padova 9 dual P III no 1000 500 8 dual P III no 1266 1000 9 dual P IV (Xeon) no 2400 1000 4 LNL 13 dual P III no 1000 500 14 dual P IV (Xeon) no 2400 1000 5 CERN 10 dual P IV (Xeon) no 2800 1000 6 CNAF 2 dual P III no 1000 7 E. Polytecnique 2.5 dual P III no 800 500 CMS-LCG0 testbed

  5. Jobs per farm and computer type

  6. Performances

  7. Job lifetime (dataset CMSim mu03 bb2mu)

  8. Farm PC clock Average time RMS RMS/average Measured Specint Specint 2000 Bari 1000 39920 2490 0,062 478 408 1133 30970 1686 0,054 616 562 1266 27540 3072 0,112 693 623 Bologna 1000 42040 6944 0,165 454 408 (*) 2400 22280 1337 0,060 856 833 Padova 1000 40940 4850 0,118 466 408 1266 27980 2245 0,080 682 623 2400 22790 1355 0,059 837 833 Cern 2800 19390 944 0,049 984 984 Lnl 1000 40110 2571 0,064 476 408 2400 22610 1905 0,084 844 833 (*) Long tail (however the peak is at 40000 s) Table of jobs lifetime(dataset CMSim mu03 bb2mu )

  9. “Specint 2000” is obtained from the web: http://www.specbench.org/osg/cpu2000/results/cint2000.html Specint • The “Measured Specint” is obtained by this formula:

  10. Measured Specint

  11. Notes about measured Specint • The Padova Farm, in dataset sim_mu03_MB and Kin_mu03_MB, did not behave as the others. Why? • The “measured Specint” does not depend on the Farm • ie: does not depend on element’s number in the Farm. • The “measured Specint” depends on assignment (kin, sim, etc)

  12. CPU SPEC CPU2000obtain from www.specbench.org Measured specint from cmsim mu03_bb2mu 1000MHZ 408 469 1133MHZ 562 616 1266MHZ 623 688 2400MHZ 833 846 2800MHZ 984 984 Measured Specint used for jobs lifetime normalization

  13. Measured and theoretical Specint

  14. dataset sim_mu03_bb2mu

  15. Job lifetime (dataset CMSim mu03 MB)

  16. dataset sim_mu03_MB using “measured specint” ofsim_mu03_bb2mu

  17. Job lifetime (dataset CMKin mu03 MB)

  18. dataset kin using “measured specint” ofsim_mu03_bb2mu

  19. Comparison between gridice and boss data • Number of jobs running and waiting according to gridice and boss • The agreement is reproduced except for a shift in time due to a not perfect synchronization between gridice and UI time (about 7 minutes shift for the Bari farm)

  20. Running jobs on Padova farmdate: 01 September – 08 September

  21. Running jobs on Bari farmdate: 08 September – 15 September

  22. Running jobs on single pc of Bologna farm, load cpu and load ram vs time (s)

  23. cmsim cmkin PC clock Average Ram Used Average Ram Used 1000 75MB 10MB 1266 80MB 2400 90MB Used Memory

  24. Running job vs Transfer rate (Byte)

  25. Summary • With this analysis we have • Measured the execution time of jobs on different CPU • Modelled the CPU performance • Measured the farm performance • Measured the resource load (RAM, CPU, transfer rate) • Validated the Boss data with Gridice ones • Now we are starting with the simulation of the computing model (Ptolemy, Monarc, etc.)

  26. BO Ass3195 BA Ass3195 PD Ass3195 job lifetime – cpu time = I/O TimedatasetCMSIM

  27. LNL Ass3340 CERN Ass3340 job lifetime – cpu time = I/O Time datasetCMSIM

  28. BA Ass2653 BO Ass2653 CERN Ass2653 Lifetime of job –execution time cpu = I/O TimedatasetCMKIM

  29. PD Ass2653 LNL Ass2653 Lifetime of job –execution time cpu = I/O TimedatasetCMKIM

  30. Notes about I/O Time • The I/O time of all Farm for CMSIM has a peak about to 60-100 seconds except CERN farm where the peak is to 35 seconds • The I/O time of CMKIN is about ten seconds according to kind of Assignment

More Related