1 / 23

UKI-SouthGrid Overview and Oxford Status Report

UKI-SouthGrid Overview and Oxford Status Report. Pete Gronbech SouthGrid Technical Coordinator HEPSYSMAN – RAL 10 th June 2010. UK Tier 2 reported CPU – Historical View to present. SouthGrid Sites Accounting as reported by APEL.

mandek
Download Presentation

UKI-SouthGrid Overview and Oxford Status Report

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. UKI-SouthGrid Overview and Oxford Status Report Pete Gronbech SouthGrid Technical Coordinator HEPSYSMAN – RAL 10th June 2010

  2. UK Tier 2 reported CPU – Historical View to present SouthGrid June 2010

  3. SouthGrid SitesAccounting as reported by APEL Sites Upgrading to SL5 and recalibration of published SI2K values RALPP seem low, even after my compensation for publishing 1000 instead of 2500 SouthGrid June 2010

  4. SouthGrid June 2010

  5. Site Resourcesat Q409 SouthGrid June 2010

  6. Resources at Q110 SouthGrid June 2010

  7. Gridpp3 h/w generated MoU for 2010,11,12 SouthGrid June 2010

  8. Oxford • Preparing tender to purchase h/w with the 2nd tranche of gridpp3 money • ATLAS pool accounts on the DPM problem, worked for some not for others. Increased the number fixed it. • Ewan working on KVM and iSCSI for VMs. SL5 has Virt IO support as both host and guest. Shared storage will give us VMware style live migration for free. No VMware tools hassle. SouthGrid June 2010

  9. t2torque02 T2wn40 T2wn5x T2wn6x T2wn7x T2wn8x T2wn85 Grid Cluster setup SL5 Worker Nodes Oxford T2ce04 LCG-ce T2ce05 LCG-ce Glite 3.2 SL5 SouthGrid June 2010

  10. T2wn41 t2scas01 T2wn40 -87 Grid Cluster setup CREAM ce & pilot setup Oxford t2ce02 CREAM t2ce06 CREAM glexec enabled Glite 3.2 SL5 Glite 3.2 SL5 SouthGrid June 2010

  11. ngsce-test.oerc.ox.ac.uk ngs.oerc.ox.ac.uk wn40 wn5x wn6x wn7x wn8x Grid Cluster setup NGS integration setup Oxford ngsce-test is a Virtual Machine which has glite ce software installed. The glite WN software is installed via a tar ball in an NFS shared area visible to all the WN’s. PBSpro logs are rsync’ed to ngsce-test to allow the APEL accounting to match which PBS jobs were grid jobs. Contributed 1.2% of Oxfords total work during Q1 SouthGrid June 2010

  12. 17th November 2008 Upgrade Decommissioned January 2009 Saving approx 6.6KW Originally installed April 04 Oxford Tier-2 Cluster – Jan 2009 located at Begbroke. Tendering for upgrade. 26 servers = 208 Job Slots 60TB Disk 22 Servers = 176 Job Slots 100TB Disk Storage SouthGrid June 2010

  13. 3com 5500 Worker Node 3com 5500 3com 5500 Grid Cluster Network setup Oxford Dual Channel bonded 1 Gbps links to the storage nodes Backplane Stacking Cables 96Gbps full duplex T2se0n – 20TB Disk Pool Node 10 gigabit too expensive, so will maintain 1gigabit per ~10TB ratio with channel bonding in the new tender T2se0n – 20TB Disk Pool Node T2se0n – 20TB Disk Pool Node SouthGrid June 2010

  14. Tender for new Kit • Storage likely to be 36 bay *2TB supermicro servers • Compute node options based twin squared supermicro with • AMD 8 core Best Value for money • AMD 12core • Intel 4 core • Intel 6 core 3GB RAM per core Dual SAS local disks • APC racks, PDUs and UPSs • 3COM 5500G network switches to extend our existing infrastructure SouthGrid June 2010

  15. Atlas Monitoring SouthGrid June 2010

  16. PBSWEBMON SouthGrid June 2010

  17. Ganglia SouthGrid June 2010

  18. Command Line • showq | more • pbsnodes –l • qstat –an • ont2wns df –hl SouthGrid June 2010

  19. Local Campus Network Monitoring SouthGrid June 2010

  20. Gridmon SouthGrid June 2010

  21. Patch levels – Pakitiv1 vs v2 SouthGrid June 2010

  22. Monitoring tools etc SouthGrid June 2010

  23. Conclusions • SouthGrid sites utilisation improving • Many had recent upgrades others putting out tenders • Will be purchasing new hardware in gridpp3 second tranche • Monitoring for production running improving SouthGrid June 2010

More Related