190 likes | 514 Views
Private Cloud at Wipro. Cloud computing based on Condor. Agenda. Background. Wipro Private Cloud. System architecture. Use of Condor. 1. 2. 3. 4. Background. Need:
E N D
Private Cloud at Wipro Cloud computing based on Condor
Agenda Background Wipro Private Cloud System architecture Use of Condor 1 2 3 4
Background • Need: • Share physical infrastructure between multiple projects and CoE's (Center of Excellence) to reduce server sprawl and number of physical labs • Provide an environment for evaluating new technologies, developing solutions and enabling collaboration between multiple labs • Centralize infrastructure procurement and management • Reduce infrastructure cost of CoE's by enabling multiple development environments • Solution: • Setup a private cloud for virtual compute and application infrastructure • Build a self-service portal for on-demand provisioning to reduce process overheads • Support multiple types of virtualization software • Reuse existing physical infrastructure, procure minimal new infrastructure
Wipro Private Cloud Wipro Users SaaS User Intranet Developers SaaSApp Virtual Lab SaaS Mgmt SaaS Enablers Managed Network Managed Network Wipro Cloud Portal / Web Services API Layer Wipro Cloud Core Cloud Admin • AutomatedProvisioning • Multi-tenancy & Isolation • Cloud Accounting & Auditing • Performance & Fault Monitoring • Automated Network & Security • Virtual Machines • Shared Storage • Virtual Appliances • Application Services Cloud OA&M Portal Physical Resource Pool - Servers, Storage, Network
Levels of Service • L1 - Virtual Servers on demand • Virtual servers, desktops, storage • Migration assistance • Self-service portal • L2 – Application infrastructure on demand • Appliances of standard software • Managed backup, proactive monitoring and help-desk • Itemized billing and charge-back • L3 – Business service infrastructure on demand • Scalable business services • Multi-tenant application infrastructure (content management, identity management, database, load balancer, firewall, ...)
Private Cloud – in Action Service LB - Active LB - passive Inst 1 Inst 2 Inst n VM 1 VM 2 VM n Customer OA & M Access Developers Business Users Operations • Configuration &Change mgmt OA&M Portal & Web Service Gateway • Design, Test • Package, Deploy Network Control Alarms Cloud Management Provisioning • Service design • SLA’s, Policies, rules, priorities • Packaging • Custom agents • Shared Services • Billing parameters Service Layer Monitoring Load Balancer App Layer • Virtual M/c design • Standardize • Automate • Agile • Caching • Appliances Virtual Machine Layer • Monitoring • Performance • Availability • Alarms • Billing • Provisioning • Resource mgmt • Workload mgmt • Auto recovery • Task & Process Automation • Management • Service Governor • Policy enforcement • Incident mgmt • Optimizer • Contention Bare-metal Layer • Bare-metal design • Standardize • Automate • Re-provisioning
Architecture & Service layers Cloud service
System Components Customer Portal Web Service Gateway Charge-back Service Governor Alerts VM Caching Cloud State Metrics Monitor Grid Scheduler Workflow Manager VM Repo N/W Plug-in Storage Plug-in Bare-metal Plug-in VM Plug-in Nagios plug-in N/W provisioning Storage provisioning Bare-metal provisioning VM provisioning N/W (nagios) Monitoring Identity Management Legend: 3rd Party components Developed in Wipro In Development
Deployment Example Project X 192.168.5.0/24 Router, firewall VPN Server, IPS, IDS, NAT Project Y 192.168.6.0/24 VM VM Virtual Storage Virtual Machines Project Z 192.168.7.0/24 Cloud Backbone 10.201.72.0/24 VM Isolated network per project Storage Mgmt Server HA Pair Switch Fabric Cloud physical systems Cloud Mgmt 192.168.3.0/24
Why Condor? • Trusty old features • Flexibility – ClassAd mechanism, configurations and policies • Web Services API • High availability • Resource utilization of jobs • Newer features we like • VM Universe • Partitionable Slots • Lease management • Integration with Amazon EC2 (public cloud) • Proven in large scale deployments • Condor-users and condor-admin support • Open source
How are we using Condor? • Mostly standard configuration • A few custom class ads in jobs and machines • Schedd and Collector configured in HA mode • Condor spool for VM persistence • Virtual machine provision request handled by Condor • VM job to physical machine match-making, file transfer • Partitionable slots for dynamic partitioning of physical machine resources • Customized condor_vm_* files for configuring and starting VM's • VLAN control, Swap disk and additional storage creation, ... • Lease management for limiting the number of running instances of a licensed image
Observations, Workarounds, Wish list • Working with Condor: • With advanced Condor skills, a lot can be achieved without modifying condor code Workarounds: • Passing number of virtual CPUs to VMware • Patch to pass proxy username and password to gSOAP for EC2 integration • Patch to get VM resource usage details on ESX • Special configuration to handle 2 hour delay in detecting a few execute node failures (Thanks Todd!) Feature wish list: • Remote IWD support for VM universe, to avoid any file transfer • Live migration of VM jobs
Thank You nitin.narkhede@wipro.com sateesh.potturu@wipro.com