240 likes | 252 Views
Learn how INFN is moving beyond SI2K as a benchmark for worker nodes, exploring new evaluation methods. Discover the challenges faced and proposed solutions by Michele Michelotto.
E N D
Moving out of SI2K How INFN is moving out of SI2K as a benchmark for Worker Nodes performance evaluation Michele Michelotto at pd.infn.it
INFN Problem • INFN needs to procure computing power for the CNAF Tier1 and for several Tier2 • Which benchmark should we use to specify our requirements? • Since at least Hepix Fall 2006 is clear the SI2000 is close to meaningless • INFN started a working group in charge of proposing a solution michele michelotto - INFN
What it is SI2K • SI2K is actually a nickname for SPECCPUInt2000 benchmark • The geometric mean of a dozen of benchmark which stress the integer performance of a cpu • Delivered by SPEC in 2000, it came after Spec89, Spec Int 92 and Spec Int 95 • Declared obsolete by SPEC in 2006 • Replaced by CPUInt2006 michele michelotto - INFN
Why is it important for us? • SI2K is the benchmark used up to now to measure the computing power of all the HEP experiments • Computing power requested by experiment, e.g. in the Computing TDR • Computing power provided by a Lab or a Tier-x • Computing power a funding agency is committed to pay for • Well, you have seen it yourself this week. Several site reported the Kilo or Mega SI2K available at their sites (other talked only of cores available) michele michelotto - INFN
Transition phase • Impossible to find SPEC Int 2000 pubblished results for the new processors • (e.g. the not so new intel Clovertown 4core) • Impossible to find pubblished SPEC Int 2006 for old processor (before 2006) • E.g. Old P4 Xeon, P4, AMD 2xx • There is no conversion constant from SI2000 to SI2006 • the ratio for x86 architectures is in the 137 – 172 range michele michelotto - INFN
SI2K/SI2K6 ratio michele michelotto - INFN
The SI2K inflaction • The main problem with SI2000 in our community is that it not (any more) proportional to HEP codes performance • You can buy new processors with huge SI2K number but you gain only a smaller increase in real performances • The face value is not the real value michele michelotto - INFN
Nominal SI vs real SI • SI2K results for the last generation processor affected by inflation • So CERN (and FZK) started to use a new currency: SI2K measured with “gcc”, the gnu C compiler and using two flavour of optimization • High tuning: gcc –O3 –funroll-loops–march=$ARCH • Low tuning: gcc –O2 –fPIC –pthread michele michelotto - INFN
Nominal SI vs real SI • CERN Proposal: Use as site rating the “Real SI” obtained by SI measured with gcc-low and increased by 50% • This make sense but only for a short period of time and for the last generation of processor (2006-2007) • Actually is has been proposed as a short term solution • Run n copies in parallel • Where n is the number of cores in the worker node • To take in account the drop in performance of a multicore machine when fully loaded. • Preferred to the use of Spec INT 2000 Rate which had syncronized access to memory, unlike our batch farm worker nodes michele michelotto - INFN
Too many SI2K • An example: how to evalutate a worker node with two Intel Woodcrest dual core 5160 at 3.06 GHz ? • SI2K (1 core): 2929 – 3089 (min – max) • SI2K on 4 cores: 11716 - 12536 • SI2K on 4 cores gcc-low : 5523 • SI2K on 4 cores gcc-high: 7034 • SI2K on 4 cores gcc-low + 50%: 8284 • SI2K on 4 cores gcc-high + 50%: 10551 michele michelotto - INFN
Even more • Actually all the gcc results in the previous slide are on i386 (32bit) • if you would like to know how your code is running on 64 bit machine, you can measure Spec INT 2000 with gcc on x86_64 • So the worker node with two Intel Woodcrest dual core 5160 at 3.06 GHz • SI2K nominal: 2929 – 3089 (min – max) • SI2K on 4 cores: 11716 - 12536 • SI2K on 4 cores gcc-low: 6021 • SI2K on 4 cores gcc-high: 6409 • SI2K pn 4 cores gcc-low + 50%: 9031 michele michelotto - INFN
A scale factor • All these numbers would be only annoying in a world with a unique architecture, in which only clock improves in time • You would be able to find a fixed ratio between all those number. • But in the real world, the ratio depends on both the CPU producer (intel vs AMD) and processor generation (old xeon vs new “core” Xeon ) michele michelotto - INFN
The nominal SI2K Big Differences between Intel and AMD when SI2K/GHz are plotted michele michelotto - INFN
The nominal SI2Krate SI2K RATE differentiate the dual core (TOP) from the 4 core (bottom) Smaller differences between Intel and AMD using SI2K Rate /GHz / core michele michelotto - INFN
The “CERN” SI • The intel-amd gap is now smaller • Intel Xeon 5xxx scores -50% wrt nominal • Amd Opteron scores only -29% • Big differences between 32 and 64 bit on same core • SI2K/core/Ghz derived by M.Alef page on the HEPIX site at Caspur michele michelotto - INFN
SI2K6rate michele michelotto - INFN
Which is the better? • At INFN we started to measure performances of HEP codes on several machines • Our goal was to find a “commercial” mantained benchmark to replace SI2K • We compared HEP code with • SI2K pubblished results • SI2K CERN (gcc low) • SI2006 and SI2006 rate pubblished results • SI2006 and SI2006 with gcc4 (32 and 64 bit) michele michelotto - INFN
Babar TierA Results • If you normalize by core and clock all new processors have the same performance • Doubling the older generation cpu • SI2006 matches this pattern (pubblished and gcc ratio constant) • SI2000-cern better than SI2K nominal • SI2000 clearly does’nt work michele michelotto - INFN
CMS MC SIM and Pythia • CMS Montecarlo simulation (32bit) and Pythia (64bit) show the same performance once normalized • Both Specint 2006 pubblished and Specint 2006 with gcc show the same behaviour • SI2K pubbished does not match HEP sw • SI2K cern better but not as good as SI2006 michele michelotto - INFN
Atlas • Here 100% is Xeon5160 • Few results for SI2006+gcc but no diff from CMS and babar • Few results also from SI2006 pubblished because of several old architectures • SI2K+gcc not bad • SI2K pubblished heavily overstimate new Xeon • Atlas simulation normalized performs the same on the new intel “core” or amd “opteron” (confirm the behaviour seen with CMS, Pithya, Babar) michele michelotto - INFN
What INFN should be do? • The Working Group proposed to infn: • Stop using SI2000 pubblished because it gives meaningless number • SI2k CERN is good as a temporary solution but it is still using a 7 years old benchmark that is not mantained by SPEC (v1.3) • You can use it in a tender this year • We should move to a benchmark of the SPEC CPU 2006 suite • Best long term solution michele michelotto - INFN
SI2006 Rate Best candidate is Spec INT 2006RATEmeasured withgcc • Spec INT 2006 because it’s new, you can find pubblished results to compare, will have bug fixes, and will be portable to new platform and new compiler • RATE because you can take in account of scalability issues with the whole machine architecture • Measured with GCC: to keep the environment as close as possible as the experiments. michele michelotto - INFN
What is missing • All the official number are still expressed in terms of SI2K (nominal) • So funding agencies are still bounded to use SI2K for all formal agreements • A general agreements on the next benchmark is still missing but work is in progress • Example: use gcc4.1 with CERN “low” tuning and 32bit • NB SI2006 rate with gcc 64 bit too big for my machines. It needs more than 1 GB per core michele michelotto - INFN
Thanks • The working group: A. Crescente, L. Dell’Agnello, D. Salomoni, F. Rosso, A. Sartirana, R.Stroili, D.Salomoni, M.Morandin, A.De Salvo, T.Zani • Thanks to CCR: the INFN Computing Committee • Thanks for input from LCG MB michele michelotto - INFN