1.01k likes | 1.03k Views
WW Grid. Grid meets Economics: A Market Paradigm for “ Resource Management and Scheduling ” for World-Wide Grid Computing. Rajkumar Buyya. Melbourne, Australia www.buyya.com/ecogrid. WW Grid. Need Honest Answers!.
E N D
WW Grid Grid meets Economics:A Market Paradigm for “Resource Management and Scheduling” for World-Wide Grid Computing Rajkumar Buyya Melbourne, Australiawww.buyya.com/ecogrid
WW Grid Need Honest Answers! • I want to have access to your Grid resources & want to knowhow many of you are willing to give me access ? (following cases) • I am unable to give you access our Australian machines, but I want to have access to yours! • Want to solve academic problems • Want to solve business problems • I am willing to gift you Kangaroos! (bartering) • I am willing to give you access to my machines, if you want. (sharing, but no measure & no QoS) • I am willing to pay you dollars on usage basis. (economic incentive, market-based, and QoS)
Grid EconomyGrid Scheduling Economics Overview • A quick glance at today’s Grid computing • Resource Management challenges for next generation Grid computing • A Glance at Approaches to Grid computing. • Grid Architecture for Computational Economy • Economy Grid = Globus + GRACE • Nimrod-G -- Grid Resource Broker • Scheduling Experiments • Case Study: Drug Design Application on Grid • Conclusions
2100 2100 2100 2100 2100 2100 2100 2100 2100 Scalable HPC: Breaking Administrative Barriers & new challenges ? PERFORMANCE Administrative Barriers • Individual • Group • Department • Campus • State • National • Globe • Inter Planet • Universe Desktop SMPs or SuperComputers Global Cluster/Grid Inter Planetary Grid! Local Cluster Enterprise Cluster/Grid
Why Grids? Large Scale Explorations need them—Killer Applications. • Solving grand challenge applications using modeling, simulation and analysis Aerospace Internet & Ecommerce Life Sciences CAD/CAM Digital Biology Military Applications Military Applications Military Applications
data archives What is Grid ? • An infrastructure that logically couples distributed resources: • Computers– PCs, workstations, clusters, supercomputers, laptops, notebooks, mobile devices, PDA, etc; • Software– e.g., ASPs renting expensive special purpose applications on demand; • Catalogued data and databases– e.g. transparent access to human genome database; • Special devices – e.g., radio telescope – SETI@Home searching for life in galaxy. • People/collaborators. • and presents them as an integrated global resource. • It enables the creation of virtual enterprises (VEs) for resource sharing. Widearea
Grid Applications-Drivers • Distributed HPC (Supercomputing): • Computational science. • High-throughput computing: • Large scale simulation/chip design & parameter studies. • Content Sharing (free or paid) • Sharing digital contents among peers (e.g., Napster) • Remote software access/renting services: • Application service provides (ASPs). • Data-intensive computing: • Data mining, particle physics (CERN), Drug Design. • On-demand, realtime computing: • Medical instrumentation & network-enabled solvers. • Collaborative: • Collaborative design, data exploration, education.
Building and Using Grids require • Services that make our systems Grid Ready! • Security mechanisms that permit resources to be accessed only by authorized users. • (New) programming tools that make our applications Grid Ready!. • Tools that can translate the requirements of an application/user into the requirements of computers, networks, and storage. • Tools that perform resource discovery, trading, selection/allocation, scheduling and distribution of jobs and collects results. Globus ?
What users want ?Users in Grid Economy & Strategy • Grid Consumers • Execute jobs for solving varying problem size and complexity • Benefit by selecting and aggregating resources wisely • Tradeoff timeframe and cost • Strategy: minimise expenses • Grid Providers • Contribute “idle” resource for executing consumer jobs • Benefit by maximizing resource utilisation • Tradeoff local requirements & market opportunity • Strategy: maximise return on investment
Sources of Complexity in Resource Management for World Wide Grid Computing • Size (large number of nodes, providers, consumers) • Heterogeneity of resources (PCs, Workstations, clusters, and supercomputers, instruments, databases, software) • Heterogeneity of fabric management systems (single system image OS, queuing systems, etc.) • Heterogeneity of fabric management polices • Heterogeneity of application requirements (CPU, I/O, memory, and/or network intensive) • Heterogeneity in resource demand patterns (peak, off-peak, ...) • Applications need different QoS at different times (time critical results). The utility of experimental results varies from time to time. • Geographical distribution of users & located different time zones • Differing goals (producers and consumers have different objectives and strategies) • Unsecure and Unreliable environment
Traditional approaches to resource management & scheduling are NOT useful for Grid ? • They use centralised policy that need • complete state-information and • common fabric management policy or decentralised consensus-based policy. • Due to too many heterogenous parameters in the Grid it is impossible to define/get: • system-wide performance matrix and • common fabric management policy that is acceptable to all. • “Economics” paradigm proved to effective institution in managing decentralization and heterogeneity that is present in human economies! • Fall of USSR & Emergence of US as world superpower! (monopoly?) • So, we propose/advocate the use of computational economics principles in management of resources and scheduling computations on world wide Grid. • Think locally and act globally approach to grid computing!
Benefits of Computational Economies • It provides a nice paradigm for managing self interested and self-regulating entities (resource owners and consumers) • Helps in regulating supply-and-demand of resources. • Services can be priced in such a way that equilibrium is maintained. • User-centric / Utility driven • Scalable: • No need of central coordinator (during negotiation) • Resources(sellers) and also Users(buyers) can make their own decisions and try to maximize utility and profit. • Adaptable, • It helps in offering different QoS (quality of services) to different applications depending the value users place on them. • It improves the utilisation of resources • It offers incentive for resource owners for being part of the grid! • It offers incentive for resource consumers for being good citizens • There is large body of proven Economic principles and techniques available, we can easily leverage it.
New challenges of Computational Economy • Resource Owners • How do I decide prices ? (economic models?) • How do I specify them ? • How do I enforce them ? • How do I advertise & attract consumers ? • How do I do accounting and handle payments? • ….. • Resource Consumers • How do I decide expenses ? • How do I express QoS requirements ? • How I trade between timeframe & cost ? • …. • Any tools, traders & brokers available to automate the process ?
NetSolve mix-and-match Object-oriented Internet/partial-P2P Grid Computing Approaches Network enabled Solvers Market/Computational Economy Nimrod-G
Australia Economy Grid Nimrod-G Virtual Lab Active Sheets DISCWorld ..new coming up Europe UNICORE MOL Lecce GRB Poland MC Broker EU Data Grid EuroGrid MetaMPI Dutch DAS XW, JaWS and many more... Japan Ninf DataFarm and many more... USA Globus Legion Javelin AppLeS NASA IPG Condor Harness NetSolve AccessGrid GrADS and many more... Cycle Stealing & .com Initiatives Distributed.net SETI@Home, …. Entropia, UD, Parabon,…. Public Forums Global Grid Forum P2P Working Group IEEE TFCC Grid & CCGrid conferences Many Grid Projects & Initiatives http://www.gridcomputing.com
WW Grid Many Testbeds ? & who pays ?, who regulates demand and supply ? GUSTO (decommissioned) World Wide Grid Legion Testbed NASA IPG
Testbeds so far -- observations • Who contributed resources & why ? • Volunteers: for fun, challenge, fame, charismatic apps, public good like distributed.net & SETI@Home projects. • Collaborators: sharing resources while developing new technologies of common interest – Globus, Legion, Ninf, Ninf, MC Broker, Lecce GRB,... Unless you know lab. leaders, it is impossible to get access! • How long ? • Short term: excitement is lost, too much of admin. Overhead (Globus inst+), no incentive, policy change,… • What we need ? Grid Marketplace! • Regulates supply-and-demand, offers incentive for being players, simple, scalable solution, quasi-deterministic – proven model in real-world.
Building an Economy Grid(Next Generation Grid Computing!) To enable the creation of: Grid Marketplace (competitive) ASP Service Oriented Computing . . . And let users focus on their own work (science, engineering, or commerce)!
GRACE: A ReferenceGrid Architecture for Computational Economy Grid Bank Information Server(s) Grid Market Services Sign-on Health Monitor Info ? Grid Node N … Grid Explorer … Application Secure Job Control Agent Grid Node1 Schedule Advisor QoS Pricing Algorithms Trade Server Trading Trade Manager Accounting Resource Reservation Misc. services … Deployment Agent JobExec Resource Allocation Storage Grid User Grid Resource Broker … R1 R2 Rm Grid Middleware Services Grid Service Providers See PDPTA 2000 paper!
Economic Models for Trading • Commodity Market Model • Posted Prices Models • Bargaining Model • Tendering (Contract Net) Model • Auction Model • English, first-price sealed-bid, second-price sealed-bid (Vickrey), and Dutch (consumer:low,high,rate; producer:high, low, rate) • Proportional Resource Sharing Model • Shareholder Model • Partnership Model See SPIE ITCom 2001 paper!: with Heinz Stockinger, CERN!
Grid Components Applications and Portals Grid Apps. … Prob. Solving Env. Collaboration Engineering Web enabled Apps Scientific Grid Tools Development Environments and Tools … Web tools Libraries Languages Monitoring Resource Brokers Debuggers Grid Middleware Distributed Resources Coupling Services … QoS Security Information Process Resource Trading Market Info Local Resource Managers … TCP/IP & UDP Queuing Systems Operating Systems Libraries & App Kernels Grid Fabric Networked Resources across Organisations … Storage Systems Data Sources Clusters Scientific Instruments Computers
Economy Grid = Globus + GRACE Applications Grid Apps. … Science Engineering Commerce Portals ActiveSheet High-level Services and Tools Grid Status … Grid Tools Nimrod/G DUROC MPI-G CC++ globusrun Core Services Heartbeat Monitor Nexus GRACE-TS Grid Middleware GRAM Globus Security Interface GASS DUROC MDS GBank GARA GMD Grid Fabric Local Services GRD QBank JVM Condor TCP UDP eCash LSF PBS Linux Irix Solaris See IPDPS HWC 2001 paper!
GRACE components • A resource broker (e.g., Nimrod/G) • Various resource trading protocols for different economic models • A mediator for negotiating between users and grid service providers (Grid Market Directory) • A deal template for specifying resource requirements and services offers • Grid Trading Server • Pricing policy specification • Accounting (e.g., QBank) and payment management (GridBank, not yet implemented)
Call for Bid(DT) Grid Open Trading Protocols Trade Manager Trade Server Get Connected Pricing Rules Reply to Bid (DT) Negotiate Deal(DT) …. API Confirm Deal(DT, Y/N) DT - Deal Template - resource requirements (BM) - resource profile (BS) - price (any one can set) - status - change the above values - negotiation can continue - accept/decline - validity period Cancel Deal(DT) Change Deal(DT) Get Disconnected
Pricing, Accounting, Allocations and Job Scheduling Flow @ each site/Grid Level Pricing Policy GRID Bank (digital transactions) 0 0 2 DB@Each Site Trade Server QBank 1 3 5 8 0. Make Deposits, Transfers, Refunds, Queries/Reports 1. Clients negotiates for access cost. 2. Negotiation is performed per owner defined policies. 3. If client is happy, TS informs QB about access deal. 4. Job is Submitted 5. Check with QB for “go ahead” 6. Job Starts 7. Job Completes 8. Inform QB about resource resource utilization. Resource Manager 4 IBM-LL/PBS/…. 6 7 Compute Resources clusters/SGI/SP/...
Service Items to be Charged • CPU - User and System time • Memory: • maximum resident set size - page size • amount of memory used • page faults: with/without physical I/O • Storage: size, r/w/block IO operations • Network: msgs sent/received • Signals received, context switches • Software and Libraries accessed • Data Sources (e.g. Protein Data Bank)
How to decide Price ? • Fixed price model (like today’s Internet) • Dynamic/Demand and Supply (like tomorrow’s Internet) • Usage Period • Loyalty of Customers (like Airlines favoring frequent flyers!) • Historical data • Advance Agreement (high discount for corporations) • Usage Timing (peak, off-peak, lunch time) • Calendar based (holiday/vacation period) • Bulk Purchase (register 100 .com domains at once!) • Voting -- trade unions decide pricing structure • Resource capability as benchmarked in the market! • Academic R&D/public-good application users can be offered at cheaper rate compared to commercial use. • Customer Type – Quality or price sensitive buyers. • Can be Prescribed by Regulating (Govt.) authorities
Payments- Options & Automation • Buy credits in advance / GSPs bill the user later--”pay as you go” • Pay by Electronic Currency via Grid Bank • NetCash (anonymity), NetCheque, and Paypal • NetCheque: - http://www.isi.edu/gost/info/netcash/ • Users register with NC accounting servers, can write electronic cheques and send (e.g email). When deposited, balance is transferred from sender to receiver account. • NetCash - http://www.isi.edu/gost/info/netcheque/ • It supports anonymity and it uses the NetCheque system to clear payments between currency servers. • Paypal.com– account+email is linked to credit card. • Enter the recipient’s email address and the amount you wish to request. • The recipient gets an email notification and pays you at www.PayPal.com
Nimrod-G:The Grid Resource Broker Soft Deadline and Budget-based Economy Grid Resource Broker for Parameter Processing on P2P Grids
Parametric Computing(What Users think of Nimrod Power) Parameters Magic Engine Multiple Runs Same Program Multiple Data Killer Application for the Grid! See IPDPS 2000 paper! Courtesy: Anand Natrajan, University of Virginia
P-study Applications -- Characteristics • Code (Single Program: sequential or threaded) • High Resource Requirements • Long-running Instances • Numerous Instances (Multiple Data) • High Computation-to-Communication Ratio • Embarrassingly/Pleasantly Parallel
Sample P-Sweep Applications Bioinformatics: Drug Design / Protein Modelling Combinatorial Optimization: Meta-heuristic parameter estimation Ecological Modelling: Control Strategies for Cattle Tick Sensitivityexperiments on smog formation Data Mining Electronic CAD: Field Programmable Gate Arrays High Energy Physics: Searching for Rare Events Computer Graphics: Ray Tracing Finance: Investment Risk Analysis VLSI Design: SPICE Simulations Civil Engineering: Building Design Network Simulation Automobile: Crash Simulation Aerospace: Wing Design astrophysics
Thesis • Perform parameter sweep (bag of tasks) (utilising distributed resources) within “T” hours or early and cost not exceeding $M. • Three Options/Solutions: • Using pure Globus commands • Build your own Distributed App & Scheduler • Use Nimrod-G (Resource Broker)
Executing Remotely Choose Resource Transfer Input Files Set Environment Start Process Pass Arguments Monitor Progress Summary View Job View Event View Read/Write Intermediate Files Transfer Output Files +Resource Discovery, Trading, Scheduling, Predictions, Rescheduling, ...
Using Pure Globus commands Do all yourself! (manually) Total Cost:$???
Build Distributed Application & Scheduler Build App case by case basis Complicated Construction E.g., AppLeS/MPI based Total Cost:$???
Use Nimrod-G Aggregate Job Submission Aggregate View Submit & Play!
Nimrod & Associated Family of Tools Remote Execution Server (on demand Nimrod Agent) P-sweep App. Composition: Nimrod/ Enfusion Resource Management and Scheduling: Nimrod-G Broker Design Optimisations: Nimrod-O App. Composition and Online Visualization: Active Sheets Grid Simulation in Java: GridSim Drug Design on Grid: Virtual Lab File Transfer Server Upcoming?: HEPGrid (+U. Melbourne), GAVE(+Rutherford Appleton Lab) Grid (Un)Aware Virtual Engineering
Nimrod/G : A Grid Resource Broker • A resource broker for managing and steering task farming (parametric sweep) applications on computational Grids based on deadline and computational economy. • Key Features • A single window to manage & control experiment • Resource Discovery • Resource Trading • Scheduling & Predications • Transportation of data & results • Steering & data management • It allows to study the behaviour of some of the output variables against a range of different input scenarios.
A Glance at Nimrod-G Broker Nimrod/G Client Nimrod/G Client Nimrod/G Client Nimrod/G Engine Schedule Advisor Trading Manager Grid Store Grid Dispatcher Grid Explorer Grid Middleware Globus, Legion, Condor, etc. TM TS GE GIS Grid Information Server(s) RM & TS RM & TS RM & TS G C L G Legion enabled node. Globus enabled node. L G C L RM: Local Resource Manager, TS: Trade Server Condor enabled node. See HPCAsia 2000 paper!
Nimrod/G Grid Broker Architecture Legacy Applications Customised Apps (Active Sheet) Monitoring and Steering Portals Nimrod Clients P-Tools (GUI/Scripting) (parameter_modeling) XML? Farming Engine Meta-Scheduler XML Algorithm1 Programmable Entities Management Schedule Advisor . . . Resources Jobs Tasks Channels AlgorithmN Nimrod Broker IP hourglass ? AgentScheduler Agents JobServer Grid Explorer Trading Manager Database (Postgres) Dispatcher & Actuators . . . Legion-A P2P-A Globus-A . . . Condor GMD Middleware Globus Legion P2P GTS G-Bank . . . Computers Local Schedulers Storage Networks Instruments Fabric . . . PC/WS/Clusters Condor/LL/Mosix/ Database Radio Telescope
Cost A Nimrod/G Monitor Deadline Legion hosts Globus Hosts Bezek is in both Globus and Legion Domains
Active Sheet: Spreadsheet Processing on Grid Nimrod Proxy Nimrod/G See HPC 2001 paper!
I/O server File access Nimrod/G Interactions Resource Discovery Grid Info servers Scheduler Grid Trade Server Resource allocation (local) Farming Engine Queuing System Nimrod Agent User process Dispatcher Process server “Do this in 30min. for $10?” Root node Gatekeeper node Computational node