STFC Cloud

STFC Cloud Alexander Dibbo

Contents • Background • OpenNebula • OpenStack • Design Considerations • Capacity • Users

Background • Started as a graduate project using StratusLab • Funded to set up and OpenNebula based cloud • Started evaluating and deploying OpenStack in 2016

Why run a cloud? • Provide Self Service VMs to SCD and wider STFC • Underpin Horizon 2020 projects • To support SCD’s Facilities program • To give “easy” access to computing for new user communities (GridPP and UKT0 goal)

OpenNebula • Running stably for 3 years • Works well for individual users • Tricky to use programmatically • “Small” close-knit community • Should be decommissioned this year

OpenStack • Very large community • Very flexible • Complicated • Momentum in scientific communities • Strong API • Preexisting integrations • Jenkins, Grid Engine, LSF etc. • Running for 18 months • Already used for some production services

OpenStack Design Considerations • Multi tenancy • Multiple user communities internally and externally • Highly Available • Services should be as highly available as possible • Flexible • We want to accommodate all reasonable requests

Highly Available • OpenStack services should be highly available where possible • Ceph RBD is used for VM images

OpenStack Services • Multiple instances of each OpenStack service are behind HAProxy loadbalancers

Ceph RBD • A replicated Ceph cluster called SIRIUS provides block storage for VMs and Volumes • 3x Replication • Optimised for lower latency

Multi Tenancy • Projects (Tenants) need to be isolated • From each other • From STFC site network • Security Groups • VXLAN private project networks • Brings its own problems

Private Networks • Virtual machines connect to a private network • VXLAN is used to tunnel these networks across hypervisors • Ingress and Egress is via a virtual router with NAT • Distributed Virtual Routing is used to minimise this bottleneck – every hypervisor runs a limited version of the virtual router agent.

VXLAN • VXLAN by default has significant overheads • VXLAN performance is ~10% of line rate • Tuning memory pages, CPU allocation, mainline kernel • Performance is ~40% of line rate • Hardware offload • VXLAN offload to NIC gives ~80% of line rate • High Performance Routed network + EVPN • 99+% of line rate

Cloud Network

Flexible • Availability Zones across site • 1st will be in ISIS soon • GPU support • AAI • APIs • Nova, EC2, OCCI • Design decisions shouldn’t preclude anything • Pet VMs

AAI – EGI CheckIn - Horizon

AAI – EGI CheckIn – Horizon 2

AAI – EGI CheckIn

AAI – Google Login

AAI – EGI CheckIn – Almost • Not quite working completely at RAL yet

Hardware • 2014 • 28 Hypervisors – 2x8C/16T 128GB • 30 Storage nodes – 8x4TB (1 used for OS) • 2015 (ISIS funded) • 10 hypervisors - – 2x8C/16T 128GB • 12 storage nodes - 12x4TB (1 used for OS) • 2016 (ISIS funded) • 10 hypervisors - – 2x8C/16T 128GB 2 Nvidia Quadro K620s • 10 storage nodes – 12x4TB (1 used for OS) • 2017 • 108 hypervisors – 2x8C/16T, 96GB (UKT0 funded) • 12 storage nodes – 12x4TB disk + 1x3.6TB PCIe SSD + 2 os disks (SCD Funded)

Hardware Available • Once 2017 hardware deployed • ~5000 logical cores • 20 Nvidia Quadro K620s • ~2PB raw storage (~660TB useable)

Use cases – January 2017 • Development and Testing • DLS TopCat sprint development server • LSF, ICAT and IJP Development and Testing • Tier 1 – Grid services (including CVMFS) development and testing • Tier 1 – GridFTP development environment • Testing for Indigo Datacloud project • Development hosts for Quattor Releasing • Development work supporting APEL • EUDAT – federating object stores • CICT – Testing software packages before deploying into production e.g. moodle and lime survey • Repository development for ePubs and eData • Building and Releasing • build and integration hosts for Quattor Releasing • Building software for the CA • Building APEL packages for release • Testing/Production work • CCP4-DAAS, IDAAS – User interface machines to other department resources. • Testing Aquilon sandboxes and personalities • CEDA – data access server • EUDAT.EU - Hosting a prototype graph database • Nagios monitoring system for the Database team • Dashboard and database for the database team’s backup testing framework • Blender Render Farm – Visualisation for Supercomputing conference

User Communities • SCD • Self service VMs • Some programmatic use • Tier1 • Bursting the batch farm • ISIS • Data-Analysis-as-a-Service • SESC Build Service • Jenkins • CLF – OCTOPUS • Diamond/Xchem • Cloud bursting Diamond GridEngine • Xchem data processing using OpenShift • WLCG Datalake project • Quattor Nightlies • West-Life

Links • Data-Analysis-as-a-Service • https://indico.cern.ch/event/713848/contributions/2932932/attachments/1617174/2570708/ukt0_wp2_software_infrastructure.pdf • https://indico.cern.ch/event/713848/contributions/2933001/attachments/1617640/2571686/IDAaaS-Overview.pdf

Any Questions? Alexander.dibbo@stfc.ac.uk

STFC Cloud

STFC Cloud

Presentation Transcript

STFC presentation Town Meetings March-April 2016 John Womersley STFC

Welcome to RAL (STFC)

STFC Science Board highlights

STFC Research and Innovation

Remarks from STFC

Europractice and STFC Overview

Welcome/Introduction to RAL (STFC)

Workshop Summary Peter McIntosh (STFC)

Welcome to RAL (STFC)

STFC-RAL site report

Update from STFC

STFC Science and Society Fellowship

STFC

STFC Cloud Introduction

STFC Cloud Developments

STFC Town Meeting

SCD Cloud at STFC

STFC Science Board highlights

Welcome to RAL (STFC)

STFC Perspective

Update from STFC