180 likes | 189 Views
This report outlines the activities and achievements of the HEP SysMan team over the past year, including performance, hardware purchases, issues faced, and plans for the future. It also discusses non-*nix related topics and goals for improving reliability and infrastructure.
E N D
RAL PPD Tier 2 (and stuff)Site Report Rob Harper HEP SysMan 30th June 2009 1
Outline • The last year • Things we did and stuff we bought • Where we are now • The next year • What is coming up • Some non *nix stuff
Over the last year • Performance • Hardware • Other stuff • Issues
Performance • 1,382,736 jobs in year to 2009-05-31 • Cluster has been underutilised much of the time • Availability 93% over last 6 months • Lower than we had hoped • But not entirely our fault!
Hardware • Most purchasing was for GridPP • 70 new SuperMicro twins (1120 cpu cores) • 25 new 20TB Viglen storage nodes (about 450TB of usable space) • Assorted new hardware for service nodes, etc. • Not all new kit is yet fully commissioned
Other Stuff • Tested SL5 WN • Logging with rsyslog (to 2 hosts) • Setting up machines and services for assorted projects • Desktop Linux • Talked to users • One test (SL5) box set up
Issues • Air conditioning • Several failures, including twice in one week after Xmas • Site power • Big power cut • Planned work (on short notice) • The Sun • Building Management System • R89 delays and movement schedules
Where We Are Now • ~613 TB Storage • dCache version 1.9.1-7 • 1584 CPU cores • 2740 kSI2k • Ahead of 2009 GridPP pledges
The next year • Consolidate storage in R1, CPU in lab R27 • Replacing RGMA and BDII hosts with virtual machines • Local private network for IPMI, RAID, APC, etc. • New machines to host VO software. • An SL5 CE • Getting ready to move to all SL5 some time. • CREAM for added yumminess. • Desktop Linux (again!) • LHC data? • Networking...
Networking: now • Nortel 55xx stack in R1 • Similar switches installed in R27 but not yet stacked • Interconnected through RAL site networking • 2 * 1GB links to R27 • 1 * 10GB to R1
Networking: planned • Establish stack in R27 • Connect machine rooms with direct 10GB fibre link • Hopefully add second 10GB link later
Not just *nix… • Windows • Macs • Other services
Windows • Moving to Windows 2008 domain (as I speak!) • Recent desktop machines running Vista Business • VMs for some legacy software • Laptops running Vista Ultimate with Bitlocker encryption (older laptops encrypted with Pointsec) • Remote reporting/updating to our own WSUS (Windows Updates) service and Sophos Enterprise server
Mac • An increasing number of Apple computers in the department • Support still not “official” but provided on a best efforts basis • Sophos antivirus • if you want to heckle this, wait for next year when we announce this for Linux! • Pointsec encryption
Some General Stuff • Visitors’ network • Authenticated wireless access coming up... • Promoting the use of WebDAV for remote file access, rather than PPTP (CERN already use this)
So… • We have the technology… • We’re doing pretty well, but are underutilised • Now we need to work on • Reliability • Getting everything commissioned promptly • Improving our infrastructure