240 likes | 386 Views
Implementing and using IaaS cloud within the Flexible Services for the Support of Research project.
E N D
Implementing and using IaaS cloud within the Flexible Services for the Support of Research project David Wallom1, Matteo Turilli1, Chris Williams1, Steve Gough2, Neil Curran2, Richard Tarrant2, Dan Bretherton2, Andy Powell3, Matt Johnson3, Terry Harmer4, Peter Wright4, John Gordon5 1Oxford e-Research Centre, University of Oxford, 2Information Services, University of Reading, 3Eduserv 4EoverI Ltd 5Rutherford Appleton Lab, STFC
The Project • Flexible Services for the Support of Research JISC funded, 10 months • 6 Partners, 12 staff members: • academic and industrial; • 3 cloud infrastructures; • 3 solutions providers. • Goals: • building an integrated Public-Private hybrid cloud infrastructure; • Developing accounting tools for cloud • pilot 1, Multi Platform Software Development; • pilot 2, On demand Research data storage.
Aims • Develop Hybrid Public-Private cloud within HEI and trusted sector provider • Implement stand-alone policy driven management layer allowing user abstraction • Implement open standards (OCCI) interface within Canonical UEC • Develop two trial use-cases as exemplar of developing management interfaces of swarms of VM with different policies
Private/Public Multiple Clouds • NGS cloud • Institutional cloud • Amazon cloud • Users • Eduserv cloud • Globally distributed; • different resources/cost; • different applications; • non standardised: different AAA and UI. • EGI cloud
Mediated Private/Public Multiple Clouds • Amazon cloud • NGS cloud • Institutional cloud • Management • Interface • Users • eZEEL • Policy driven • Automation; • load balancing; • costs reduction; • usability. • Eduserv cloud • EGI cloud
Install the Clouds Eduserv Ubuntu Enterprise Cloud/Eucalyptus 1.6->2.0 Oxford Reading
Integrate Management and Accounting Eduserv STFC/NGS Accounting Database Oxford Reading eZEEL Management eZEEL Management
Develop our Pilot case studies Eduserv STFC/NGS Accounting Database Oxford Reading eZEEL Management eZEEL Management Software Development Framework Storage Manager Software Development Framework Storage Manager
Accounting the Cloud – Requirements & Implementation • Requirements • No modification of Eucalyptus code base; • Accounting based on local accounts with VM, EBS, Snapshot usage for each registered user; • Non-destructive treatment and permanent recording of accounting data; • Open standard publishing allowing for multiple types of consumption. • Implementation • Data sources: Eucalyptus logs (cc.log, output-cloud.log, debug-cloud.log); • 4 stage design: • Aggregation: logs are parsed. Records are date and time stamped, checked for duplication and saved in daily archive files; • Parsing, records are parsed and stored into an accounting database (MySql 5.1); • Mining, database is mined to create OGF standard usage records; • Uploading, Usage Records are uploaded to the RUS server.
The not so simple Accounting Database Schema Storage Virtual Machine Local User Network? Cloud Operation
Accounting the Cloud – Advantages & Challenges • Advantages: • Full regression chain from account record to original Eucalyptus logs; • Agnostic storage and representation of the accounting records (SQL); • Multiple, concurrent access of accounting records from different types of accounting clients: RUS + local web interface; • Multiple representation of the accounting logs (SQL DB + aggregated files) allows for consistency tests embedded in the parser; • Challenges: • High volume of daily logs: up to 500Mb daily for just 3 file types; • Highly redundant log information -> expensive duplicates checking; • Non uniform formatting of log records, lack or date stamping for 2/3 of them.
Multi Platform Software Development Zeel/i Broker Instance configuration manager Build manager CVS / SVN repository FleSSR cloud Build instance 1 Build instance 2 Build instance 3 Build instance 4 Build instance 5
Multi Platform Usecase analysis • Built on SKA developed Multi-Platform Publishing Tool • MPP is a tool developed within the radio astronomy community to ease the complexity and cost of deploying computer software across multiple-platforms. • Each platform has its own conventions, standards, package managers and packaging formats for deploying software. • MPP allows the user to describe a software product in generic terms (e.g this is a binary, this is a library, this is documentation, it requires these dependencies), • Produce a suitable package tailored to each supported platform, that can be easily installed by the user in the normal way native to that platform. • Three Process layers: • Build, • Test • Publish • Designed to support the entire release and testing processes. Each of these processes can be launched with a single mpp command.
FleSSR Use Case: On demand Research data storage Zeel/i Broker Volume Manager FleSSR cloud EBS Volume VM EBS Interface
Prebuilt ‘Storage VM’ • Include • SCP /SFTP • WebDAV • FTP • rsync (for cloud migration) • (GlusterFS) • Pre-configure as much as possible • Re-bundle image for UEC • Determine post-instantiation changes • Create script / command list to carry out changes • Embed instantiation of script in eZEEL call
Flexible Data Pilot analysis • Eucalyptus security features were flexible enough to allow both volumes to be mounted in ESSC’s hierarchical network file system • Local (Reading) FleSSRvolume indistinguishable from local storage volumes, with write speeds of ~30MB/sec measured on average for a range of file sizes up to several GB per file • Public (Eduserv) volume slower as expected, write speeds of around 5MB/sec. • This level of performance would not rule out direct access to the data by computer models and data analysis utilities, but it is more likely that such volume s would be more useful as archive repositories of infrequently used data.
eZEEL experiences • Pros • Single point of management and login. • Cloud provider API abstraction. • Support of a heterogeneous cloud. • The concept of regions and resource costing allows for scoping of datacentre and type of server resource granted to a user. • Cons • The API is predominantly focussed on instance management rather than EBS management, at times needing additions to the API to create functional parity between the instance and disk management capabilities. • Eucalyptus providers don’t currently report as much status information as would be desired for fine-grained control of EBS volumes. • Can only use one keypair at a time when interacting with an instance. If deploying an instance with a keypair not generated by Zeel/I then it is incapable of accessing that instance as it is missing the private half of the keypair (Zeel/I has no means of retrieving a private key from the cloud provider). A workaround was to manually embed an existing public key in the image prior to or during deployment.
System Management • Instance management and Eucalyptus management are separated issues. • In order to manage and monitor Eucalyptus we have used: • a Firefox add-on called Hybridfox; • shell scripts for parsing and monitoring the logs stored in /var/log/eucalyptus, especially cc.log, cloud.log, cloud-error.log and nc.log; • euca-tools, even if almost all their functionalities have been replaced by Hybridfox; • Ganglia; • Nagios with dedicated probes scripts (it should be noted that the Nagios sample scripts provided with the Eucalyptus source are outdated and basically useless for Eucalyptus version 2.*). • New extended system developed by EoverI using eZEEL.
Eucalyptus • Persistent problems identified with specific versions • Eucalyptus 1.6.3. Corruption of the public IP database. Clean restart of CC/SC required; • Eucalyptus 16.3/2.0/2.0.2: quiet failure of the CLC. Restart needed. • Eucalyptus 2.0/2.0.2: quiet failure on the e-scsi EBS attachment + corruption of the EBS volumes. Shutdown of all the running VMs + CC CLEAN=1/SC restart needed. Loss of EBS data. • On the base of this testing and experience, Eucalyptus 2.* is not ready for delivering a (pre)production service to the research community. The same should be concluded for UEC, especially when considering the issues introduced by the modification implemented on the standard distribution of Eucalyptus.
Conclusions • Building production private clouds using open source software is currently very very hard! • Problems of Grid that are solved by Cloud are replaced by new problems, including critically, security and trust • Using mediation as a mechanism by which we can achieve federation is extremely powerful, allowing institutions to run their own private cloud, utilise external providers or a mix without having to explain this to the end user community • Allows you to change IaaS provider when versions are not as reliable as required • Both pilots have been continued to be of interest to the communities, on different platforms themselves • Accounting of clouds -> accounting of services for which the community is only just starting to consider