590 likes | 622 Views
Cluster and Grid Computing Information Session / Introduction. Subject Code: COMP90024 / 433-678. Clou d Computing and D istributed S ystems (CLOUDS) Lab . The University of Melbourne Melbourne, Australia www.cloudbus.org. Dr. Rajkumar Buyya. Agenda. Introductions
E N D
Cluster and Grid ComputingInformation Session / Introduction Subject Code: COMP90024 / 433-678 Cloud Computing and Distributed Systems (CLOUDS) Lab. The University of MelbourneMelbourne, Australiawww.cloudbus.org Dr. Rajkumar Buyya
Agenda • Introductions • Teaching Staff and Students • Why this subject and news items • Topics Covered • Books/References • Assessments • IT Resources • Expectations
Teaching Staff • Lecturer: Dr. Rajkumar Buyya • Email: raj@csse.unimelb.edu.au • Web: http://www.buyya.com • Phone: 834 41344 • Office: 5.31 (ICT Building) • Office hours: open most of the time except travel/meetings time. • Discussion: Prefer if you catch me after the lecture. • Teaching Associate: • Dr. Rodrigo Calheiros rnc@unimelb.edu.au • Phone: 41335, Office: 5.35 • Handles lectures in my absence and assists with labs/projects. • Office hours: open most of the time, but don’t disturb too much!
Web and Course Schedule • Course Web Site: • http://www.csse.unimelb.edu.au/678 • Lectures: • Time: • Fri: 2:15pm-5:15pm • We will have a 10 minutes break in between. We can always finish the class by 5.00pm, if you like to have early dinner! • Venue: • ICT Building – Theatre 3
Why learn/teach such subjects ? Futuristic, Challenging, Exciting, Solves world’s problems + Helps in Getting Jobs!
Why learn/teach such subjects ? 2008-10 World Order Parallel and networked!- multicore, green, Cloud 1 Economic crisis!- Aug 2008 1 2 Flood crisis!- 2011 in Australia
Top 10 for 2010(Grid in the form of Cloud is becoming real for business and consumer world!)
Open Data Center Alliance for Cloud Computinghttp://www.opendatacenteralliance.org/
Subscription Oriented Cloud Services: X{compute, apps, data, ..} as a Service (..aaS) Clients Other Cloud Services Govt. Cloud Services
Topics Covered (1) • Part A: Foundation • Socket Programming • Thread Programming • Elements of Parallel Computing • Part B: Cluster Computing • Elements of Cluster Computing • Cluster Architecture and Components • Single System Image: Concepts and Levels • Parallel Programming Models and Paradigms • MPI Programming
Topics Covered (2) • Part C: Grid Computing Cloud Computing • Elements of Grid Computing • Grid Resource Management and Grid Economy • Nimrod-G Grid Resource Broker • Economic Scheduling Algorithms • Grid Security Infrastructure • Grid Middleware: Globus and Gridbus Broker • Recent Advances: • Virtualization and Cloud Computing • Aneka Cloud Application Platform • Applications
Practical Skills Acquired (1) • Part A: Foundation • Socket Programming • Thread Programming • How to write multi-threaded servers and clients • Part B: Cluster Computing • Parallel Programming: • MPI Programming • Coordination of multiple processes and load balancing • Performance Evaluation on real Clusters
Practical Skills Acquired (2) • Part C: Grid (or Cloud) Computing • Report Writing • Team-based projects • Medium scale software development • Developing Network computing programming: • Writing a simple Grid resource management system • Client with coordination ability • QoS-based scheduling • User-Interface design for network applications
Reference Text Books (First Chapters are available free online)
Books and References • Books (Selected Chapters): • R. Buyya (ed), High Performance Cluster Computing, Volume 1 and 2, Prentice Hall, NJ, USA, 1999. • R. Buyya and K. Bubendorfer (eds.), Market Oriented Grid and Utility Computing, Wiley Press, New York, USA, 2009. • R. Buyya, J. Broberg, and A. Goscinski (eds), Cloud Computing: Principles and Paradigms, Wiley Press, New York, USA, Feb. 2011. • Programming Reference: • R. Buyya, S. Selvi, X. Chu, “Object Oriented Programming with Java: Essentials and Applications”, McGraw Hill, New Delhi, India, 2009. • Sample chapters at book website: http://www.buyya.com/java/ • Research Articles: • To be supplied by the Lecturer at free of cost!
Presentation Slides • Usually on the web before lecture • They may be updated slightly before the lecture to reflect recent developments • Lecturer is happy to print and distribute them in the class as an additional service (at no cost) whenever possible. • No need to read beforehand if you don’t want to! • Many of them are derived from prescribed references. • Others from latest literature (magazines, journals) – link to them from Course Webpage
Course Assessment • Project work and some short assignments: • During semester worth 50%; • Expected to take about 40+ hours. • Written examination: • A written examination (three hours) at the end of the semester worth 50%. • All components must be completed satisfactorily to pass the subject.
Internal Assignments • Assignment 1 (5 marks): • Multithreaded server for… • Assignment 2 (10 marks): • MPI programming • (Team) Assignment 3 / Project (35 marks): • A study on project related technologies and a report (10) • A project in Cluster/Grid computing area (25 marks) • Written Exam: 50 marks. Check out Model Exam Paper • NOTE: • (1) You should score 50% minimum in every assignment to pass! Therefore, attend all classes. (2) Marks also correspond to the % of assessment component.
Computational Resources • Dept. Computing Resources: • You all have access to many servers by now? • Talk to technical staff – for details and support. • Use them for simple assignments • CLOUDS Lab Computing Resources: • Mainly for projects. • Lab exercises/assignments requiring special software setup by Associate Lecturer. • Associate Lecturer is responsible for providing support. • Microsoft .NET Lab • Some may be able to do project based on .NET if you learn .NET concepts yourself.
What do we expect from you? • 1. Regular attendance of lectures: • Pay full attention, be enthusiastic, fully committed to learn new things, ask questions during the class, participate in discussions. • 2. Review previous lecture material before coming to the class. – read material from reference books and recommended papers. • 3. Start working on assignments right from the day they are announced and submit on time. • 4. Be a Team player during the Group Project. • 5. If you have some problem with the lectures/subject/??, please discuss with us early. • Don’t take out your frustrations on me during QoT
QoT (Quality of Teaching) Question • # Q1, I had a clear idea of what was expected of me in this subject 5. Strongly agree : 4. Agree : 3. Neutral : 2. Disagree : 1. Strongly disagree : Mean :
Evolution: Humans eHumans (eHugging, eSmell, eFood!), Science eScience, Business eBusiness
Computing and Communication Technologies Evolution: 1960-2010! * HTC * P2P * PDAs Minicomputers * * PCs * Workstations * Mainframes * Grids COMPUTING * PC Clusters * Computing as Utility (Cloud) * Crays * MPPs * WS Clusters * XEROX PARC worm * e-Science * e-Business * IETF * W3C * TCP/IP Communication * Ethernet * HTML * Mosaic * Web Services * Email * Sputnik * SocialNet * Internet Era * WWW Era * XML * ARPANET 2010 1960 1970 1975 1980 1985 1990 1995 2000 Control Decentralised Centralised
Moore’s Law. Transistors on a single chip doubles ~ every 18–24 months. Gilder’s Law. Aggregate bandwidth triples ~ every year. Metcalfe’s Law. The value of a network may grow exponentially with the number of participants. Three Laws of Computing:Defining the growth rate of network & computing speed, and their value. Source: Cambridge Energy Resource Associates
Resource Hungry Applications • Solving grand challenge applications using modeling, simulation and analysis Aerospace Internet & Ecommerce Life Sciences CAD/CAM Digital Biology Military Applications Military Applications Military Applications
How to Run Applications Faster ? • There are 3 ways to improve performance: • Work Harder • Work Smarter • Get Help • Computer Analogy • Using faster hardware • Optimized algorithms and techniques used to solve computational tasks • Multiple computers to solve a particular task
2100 2100 2100 2100 2100 2100 2100 2100 2100 Computing is Scaling: Towards Inter-Planetary Level SERV ICES + PERFORMANCE Administrative Barriers • Individual • Group • Department • Campus • State • National • Globe • Inter Planet • Universe Personal Device SMPs or SuperComputers Global Grid Inter Planet Grid Local Cluster Enterprise Cluster/Grid
Towards Clusters The promise of supercomputing to the average PC User ?
PDA Clusters 1990 1995+ 2000+ 1980s 1960 Cluster computing: Past, Present, Future
HPCC Books, 2 Volumes - Prentice Hall, 1999Edited by R.Buyya with contributions from over 100 leading researchers (www.buyya.com/cluster/)
Definition: What is a Cluster? • A cluster is a type of parallel or distributed processing system, which consists of a collection of interconnected stand-alone computers cooperatively working together as a single, integrated computing resource. • “stand-alone” (whole computer) computer that can be used on its own (full hardware and OS).
Cluster Applications • Numerous Scientific & engineering Apps. • Business Applications: • E-commerce Applications (Amazon, eBay); • Database Applications (Oracle on clusters). • Internet Applications: • ASPs (Application Service Providers); • Computing Portals; • E-commerce and E-business. • Mission Critical Applications: • command control systems, banks, nuclear reactor control, star-wars, and handling life threatening situations.
Cluster 1 Scheduler Master Daemon LAN/WAN Submit Graphical Control Cluster 3 Execution Daemon Scheduler Clients Master Daemon Cluster 2 Scheduler Submit Graphical Control Execution Daemon Master Daemon Clients Submit Graphical Control Execution Daemon Clients Cluster of Clusters - Hyperclusters
What does Grid mean?/ (It means different things to different people) • IBM • On Demand Computing • Microsoft • .NET • Oracle • 10g • Sun • N1 – Sun Grid Engine • HP • Adaptive Enterprise • Amazon • Elastic Compute Cloud Services • United Devices and related companies: • Harvesting Unused Desktop resources
What is Grid?[Buyya et. al.] • A type of parallel and distributed system that enables the sharing, exchange, selection, & aggregation of geographically distributed “autonomous” resources: • Computers– PCs, workstations, clusters, supercomputers, laptops, notebooks, mobile devices, PDA, etc; • Software– e.g., ASPs renting expensive special purpose applications on demand; • Catalogued data and databases– e.g. transparent access to human genome database; • Special devices/instruments – e.g., radio telescope – SETI@Home searching for life in galaxy. • People/collaborators. depending on their availability, capability, cost, and user QoS requirements. Widearea
What is Grid?(It means different things to different people) • IBM • On Demand Computing • Microsoft • .NET • Oracle • 10g • N1 – Oracle (former Sun) Grid Engine • HP • Adaptive Enterprise • United Devices and related companies: • Harvesting Unused Desktop resources
database A Bird Eye View of World-Wide Grid Environment Grid Information Service Grid Resource Broker Application R2 R3 R4 R5 RN Grid Resource Broker R6 R1 Resource Broker Grid Information Service
Prominent Grid Drivers: Emerging eScinece and eBusiness Apps • Next generation experiments, simulations, sensors, satellites, even people and businesses are creating a flood of data. They all involve numerous experts/resources from multiple organization in synthesis, modeling, simulation,analysis, and interpretation. ~PBytes/sec High Energy Physics Brain Activity Analysis Newswire & data mining: Natural language engineering Digital Biology Life Sciences Astronomy Quantum Chemistry Finance: Portfolio analysis Internet & Ecommerce
Traditional (e.g., Oracle 9i) Tight/Vertical Integration of Storage, Database, Application Hosting Server, and Application Elements They reside on a single computing resource. Enhancing capability means a new investment: Replace a machine by new one or upgrade it. Can’t leverage existing resources. Expensive approach. Grid Based (e.g., Oracle 10g, 11g) Disintegration of Storage, Database, Application Hosting Server, and Application Elements They reside on a different resources in a Grid environment. Enhancing capability means: Leveraging existing resources Dynamic provisioning Cost-effective approach Oracle 10g: Towards Enterprise Grid Model