1 / 12

Virtual MSS The first steps

Virtual MSS The first steps. Some new directions About the new ALICE::CERN::SE Fabrizio Furano CERN IT/GS 22-July -08 ALICE TF Meeting http://savannah.cern.ch/projects/xrootd http://xrootd.slac.stanford.edu. xrootd. xrootd. xrootd. xrootd. cmsd. cmsd. cmsd. cmsd.

armen
Download Presentation

Virtual MSS The first steps

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Virtual MSSThe first steps Some new directions About the new ALICE::CERN::SE FabrizioFurano CERN IT/GS 22-July-08 ALICE TF Meeting http://savannah.cern.ch/projects/xrootd http://xrootd.slac.stanford.edu

  2. xrootd xrootd xrootd xrootd cmsd cmsd cmsd cmsd BasicScalla/XRootD working principle Client A small 2-level cluster. Can hold Up to 64 servers Fabrizio Furano - The Scalla suite and the Xrootd

  3. xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd xrootd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd cmsd Simple LAN clusters Simple cluster Up to 64 data servers 1-2 mgr redirectors Advanced cluster Up to 4096 (2 lvls) or 262K (3 lvls) data servers cmsd Everything can have hot spares Fabrizio Furano - The Scalla suite and the Xrootd

  4. Virtual MSS • Purpose: • A request for a missing file comes at cluster X, • X assumes that the file ought to be there • And tries to get it from the collaborating clusters, from the fastest one • Note that X itself is part of the game • And it’s composed by many servers • The idea is that • Each cluster considers the set of ALL the others like a very big online MSS • This is much easier than what it seems • Slowly Into production for ALICE Fabrizio Furano - The Scalla suite and the Xrootd

  5. ALICE global redirector (alirdr) root://alirdr.cern.ch/ Includes CERN, GSI, and others xroot clusters Prague NIHAM … any other xrootd xrootd xrootd xrootd GSI CERN all.role manager all.manager meta alirdr.cern.ch:1312 all.role manager all.manager meta alirdr.cern.ch:1312 all.role manager all.manager meta alirdr.cern.ch:1312 cmsd cmsd cmsd cmsd Cluster Globalization… an example all.role meta manager all.manager meta alirdr.cern.ch:1312 Meta Managers can be geographically replicated Can have several in different places for region-aware load balancing Fabrizio Furano - The Scalla suite and the Xrootd

  6. Cluster globalization • Up to now, xrootd clusters could be populated • With xrdcp from an external machine • Writing to the backend store (e.g. CASTOR/DPM/HPSS etc.) • E.g. FTD in ALICE now uses the first. It “works”… • Load and resources problems • All the external traffic of the site goes through one machine • Close to the dest cluster • If a file is missing or lost • For disk and/or catalogscrewup • Job failure • ... manual intervention needed • With 107 online files finding the source of a trouble can be VERY tricky Fabrizio Furano - The Scalla suite and the Xrootd

  7. Many pieces • Global redirector acts as a WAN xrootd meta-manager • Local clusters subscribe to it • And declare the path prefixes they export • Local clusters (without local MSS) treat the globality as a very big MSS • Coordinated by the Global redirector • Load balancing, negligible load • Priority to files which are online somewhere • Priority to fast, least-loaded sites • Fast file location • True, robust, realtime collaboration between storage elements! • Very attractive for tier-2s Fabrizio Furano - The Scalla suite and the Xrootd

  8. ALICE global redirector Prague NIHAM … any other xrootd xrootd xrootd xrootd GSI CERN all.manager meta alirdr.cern.ch:1312 all.role manager all.role manager all.manager meta alirdr.cern.ch:1312 all.role manager all.manager meta alirdr.cern.ch:1312 cmsd cmsd cmsd cmsd The Virtual MSS Realized all.role meta manager all.manager meta alirdr.cern.ch:1312 But missing a file? Ask to the global metamgr Get it from any other collaborating cluster Note: the security hats could require you use xrootd native proxy support Local clients work normally Fabrizio Furano - The Scalla suite and the Xrootd

  9. The ALICE::CERN::SE July trick • A particular way to use the same pieces of the vMSS • In order to phase out an old SE • Keeping its content! • Advantages • Files are spread evenly  load balancing is effective • More used files are fetched typically first • Default vMSSconfig will be restored soon • Fetch from the global rdr • But it’s already subscribed to the global rdr Fabrizio Furano - The Scalla suite and the Xrootd

  10. ALICE global redirector xrootd xrootd xrootd New SE (starting empty) Old CERN::ALICE::SE (full) cmsd cmsd cmsd The ALICE::CERN::SE July trick Grid Shuttle other LOAD Fabrizio Furano - The Scalla suite and the Xrootd

  11. Virtual MSS • The mechanism is there, fully “boxed” • The new setup does almost everything it’s needed • A (good) side effect: • Pointing an app to the “area” global redirector gives complete, load-balanced, low latency view of all the subscribed SEs • An app using the “smart” WAN mode can just run • Probably now a full scale production/analysis won’t • But what about an interactive small analysis on a laptop? • After all, HEP sometimes just copies everything, useful and not • I cannot say that in some years we will not have a more powerful WAN infrastructure • And using it to copy more useless data looks just ugly • If a web browser can do it, why not a HEP app? Looks just a little more difficult. • Better if used with a clear design in mind Fabrizio Furano - The Scalla suite and the Xrootd

  12. What’s missing • XrdCASTOR subscription to the Global redirector • Needs a complete xrd refurbishment, very old versions • The new xrootd packages will be published shortly • 1-2 weeks. Just some minor fixes, to avoid troubles • Verify that the ML info is there (should be) • Migration of the tier-2s? • They have very old versions too • This should be quite easy for pure-xrootd sites • Xrd-DPM refurbishment? • With consequent subscription to the global rdr • Needs a completexrd refurbishment, very old versions • 3rd party fetches development • Reduce load on FTD • Put the DCaches into the vMSS game in some way Fabrizio Furano - The Scalla suite and the Xrootd

More Related