Operational lessons from running Openstack and Ceph for cancer research at scale

Operational lessons from running Openstack and Ceph for cancer research at scale George Mihaiescu, Senior Cloud Architect Jared Baker, Cloud Specialist

OICR • Largest cancer research institute in Canada, funded by the government of Ontario • Together with its collaborators and partners supports more than 1,700 researchers, clinician scientistsresearch staff and trainees • OICR hosts the ICGC's secretariat and its data coordination centre

ICGC - International Cancer Genome Consortium

Cancer Genome Collaboratory Project goals and motivation • Cloud computing environment built for biomedical research by OICR, and funded by government of Canada grants • Enables large scale cancer research on the world’s largest cancer genome dataset currently produced by the International Cancer Genome Consortium (ICGC) • Entirely built using open-source software like Openstack and Ceph • Compute infrastructure goal to provide 3,000 cores and 15 PB storage • A system for cost-recovery

Genomics

Genomics workloads • Users first download large files (150 - 300 GB), then they run workflows that analyze the data for days, or even weeks • Resulting data can be as large as the input data (alignment), or much smaller (mutation calling, 5-10 GB) • It is recommended that the workloads are independent, so one VM failure doesn’t affect multiple analyses • Newly designed workflows and algorithms are packaged as Docker containers for portability

Genomics workloads

Capacity vs. performance

Wisely pick your battles

No frills design • Use high density commodity servers to reduce physical footprint & related overhead • Use open source software and tools • Prefer copper over fiber for network connectivity • Spend 100% of the hardware budget on the infrastructure that supports cancer research, not on licenses or “nice to have” features

Other design constraints • Limited datacenter space (12 racks) • Fixed hardware budget with high data storage requirements • There are no local backups for the large data sets and re-importing the data, though possible is not desirable (+500 TB takes time to reimport over the Internet)

Hardware architecture Compute nodes

Hardware architectureCeph storage nodes

Control plane • Three controllers in HA configuration (2 x 6 cores CPU, 128 GB RAM, 6 x 200 GB Intel S3700 SSD drives) • Operating system and Ceph Mon on the first RAID 1 container • Mariadb/Galera on the second RAID 1 container • Ceilometer with Mongodb on the third RAID 1 container • Haproxy (SSL termination) and Keepalived • 4 x 10 GbE bonded interfaces, 802.3ad, layer 3+4 hash • Neutron + GRE, HA routers, no DVR

Networking • Brocade ICX 7750-48C top-of-rack switches configured in a stack ring topology • 6 x 40Gb Twinax cables between the racks, providing 240 Gbps non-blocking redundant connectivity (2:1 oversubscription ratio)

Software – entirely open source

Custom object storage client developed at OICR • A client-server application for both uploading and downloading data using temporary pre-signed URLs from multiple object storage systems • Core features • Support for encrypted and authorized transfers • High-throughput: multi-part parallel upload/download • Resumable downloads/uploads • Download-specific features • Support for BAM slicing • Support for Filesystem in Userspace (FUSE) https://github.com/icgc-dcc/dcc-storage https://hub.docker.com/r/icgc/icgc-storage-client/

Cloud usage • 57,000 instances started in the last 2 years • 6,800 in the last three months • 50 users in 16 research labs across three continents • More than 500 TB (1.5 PB) stored in Ceph

In-house developed usage reporting app

Openstack Upgrades Ubuntu 14.04

Ceph Upgrades

Security Updates

ELK Ops dashboard

Deployments • Evolving each deployment • Open to improvements • Avoid being tedious

Operations details • On-site spares and technicians • Let Ceph heal itself • Monitor everything • Can you script that?

ARA- Ansible Run Analysis

VLAN based networking

Ceph Monitoring IOPS

Ceph Monitoring Performance & Integrity

Ceph Monitoring Radosgw throughput

Ceph Monitoring Rebalancing - network traffic

Ceph Monitoring Rebalancing - cpu

Ceph Monitoring Rebalancing - memory

Ceph Monitoring Rebalancing - iops

Ceph Monitoring Rebalancing - disk

Rally Smoke tests & Load tests

Rally Grafana integration

Capacity usage

Lessons learned • If something needs to be running, test it • Simple tasks sometimes are not • Be generous with your specs for the monitoring and control plane (more RAM and CPU than you might think it will be needed) • More RAM and CPU on the Ceph storage nodes allow you to have larger nodes and not be affected by small memory leaks • Monitor RAM usage aggregated per process types • It’s possible to run a stable and performant Openstack cluster with few but qualified resources, as long as you carefully design it and choose the most stable (and absolutely needed) Openstack projects and configurations.

Future plans • Upgrade to Ubuntu 16.04 and Openstack Newton • Build a new and larger environment with a similar design, but a leaf-spine networking design • Investigate the stability of a container-based control plane (Kolla)

Thank you • Discovery Frontiers: Advancing Big Data Science in Genomics Research program (grant no. RGPGR/448167-2013, ‘The Cancer Genome Collaboratory’) • Natural Sciences and Engineering Research Council (NSERC) of Canada • the Canadian Institutes of Health Research (CIHR), Genome Canada • the Canada Foundation for Innovation (CFI) • Ontario Research Fund of the Ministry of Research, Innovation and Science.

Contact Questions? George Mihaiescu george.mihaiescu@oicr.on.ca Jared Baker jared.baker@oicr.on.ca www.cancercollaboratory.org

Operational lessons from running Openstack and Ceph for cancer research at scale

Operational lessons from running Openstack and Ceph for cancer research at scale

Presentation Transcript

From operational research to policy and practice

Case Study: The University of Alabama at Birmingham OpenStack , Ceph , Dell

A Journey to OpenStack Lessons Learned from Early Adopters

Our Experience Running YARN at Scale

OpenStack Chances and Practice at IHEP

CEPH at the Tier 1

Ceph Storage in OpenStack

Lessons from the 1990s Results and Operational Implications

e-gov.br Operational Lessons from Brazilian Experience

Lessons from Research

Critical Grid Research Issues: Perspective and Lessons from Large-Scale Grids

Lessons Learned from Large-Scale Experiments at the Asse Mine Germany

X10 at Petascale Lessons learned from running X10 on the PERCS prototype

Running the Operational Codes for the Brahmaputra

Hints for Low Supersymmetry Scale from Analysis of Running Couplings

Data Assimilation Strategies for Operational NWP at Meso-scale and Implication for Nowcasting

Operational and Research Activities at ECMWF

Ceph: de factor storage backend for OpenStack

Communities and Culture: Lessons for Practitioners from Our Research

openstack certification | openstack training | openstack courses

Research at small-scale accelerators

Lessons Learned from Children with Cancer: