340 likes | 555 Views
IBM Initiatives in Autonomic Computing. Alan Ganek Vice President, Autonomic Computing. The on demand Era. Accelerating advances in technology Deeper integration of IT with business systems Emergence of industry ecosystems. Sense and respond to business changes. Simplify infrastructure.
E N D
IBM Initiatives in Autonomic Computing Alan Ganek Vice President, Autonomic Computing
The on demand Era Accelerating advances in technology Deeper integration of IT with business systems Emergence of industry ecosystems Sense and respond to business changes Simplify infrastructure Ensure continuity Improve cost structure Deliver unique value to customers Protect privacy
The good news… Processor Storage Communications Systems
Business Challenges Missing or Loss of critical data is immeasurable Up to 40% of today’s outages are unscheduled stoppages Complex, heterogeneous environments Poorly documented legacy applications make it painful to diagnose and resolve complex cross-product problems 25-50% of IT resourcesare spent on problem determination and resolution The skills needed to do manual cross-product problem determination are scarce and expensive Outages of mission-critical systems cost quite a bit Outages & unscheduled work leads to saturation on backup systems & power systems
Autonomic Vision “Intelligent” open systems that: • Manage complexity • Know themselves • Continuously tune themselves • Adapt to unpredictable conditions • Prevent and recover from failures • Provide a safe environment Focus on Mission, not infrastructure! “Autonomic computing allows companies to operate more efficiently and achieve more from their existing IT environments, enabling increased responsiveness, business continuance and availability.” —Rick Sturm
Autonomic Computing Attributes Self-managing systems that deliver: Increased Responsiveness Adapt to dynamically changing environments Business Resiliency Discover, diagnose,and act to prevent disruptions Operational Efficiency Tune resources and balance workloads to maximize use of IT resources Secure Information and Resources Anticipate, detect, identify, and protect against attacks
Improved resiliency and quality of service Always there when you need it Safe and secure Time to Value Improves responsiveness Reduction in Risks “Autonomic computing is a vision that will take several years to realize, but with the model that IBM has outlined, there are benefits attainable at every step, which pay you back...fairly quickly for the investments you make.” —Mike Gilpin How does autonomic computing help organizations? • Reduction in Infrastructure Costs • Better asset utilization • More productive people • Reinvestment of IT productivity and cost savings
Deliver core infrastructure technologies that provide for an open framework for the industry • Deliver products with built-in autonomic capabilities • Create open standards for autonomic computing The Big Picture of Autonomic Computing Technology • Define a base reference architecture model which creates a common vernacular for autonomic computing Business policy Resource Provisioning Autonomic core capabilities Solution Install Policy Autonomic Computing Architecture Problem Determination Admin Console Products delivering autonomic features Open Standards
ISV Solutions System Mgmt Database Application Servers WebServices Open Grid Services Architecture - OGSA Servers Storage Network Industrystandards are key A Holistic Approach Customer-centric solutions, not just products • End-to-end automation enabled by consistent technology across all system elements • Based on open standards • Distributed globally into a single virtual operating environment Autonomic Capabilities Applications Professional Services Architecture Framework
Sensors Effectors Analyze Plan Monitor Execute Knowledge Autonomic Manager Action Data Effectors Sensors Resource Manager Managed Element Autonomic Computing Architecture Overview Manageability Interface
Autonomic Control Loops: next step evolution BUSINESS SLA POLICY USER RESPONSE TIME AVAIL. RESOURCE Globalenvironmentview and knowledge Autonomic features Local view
Customer Relationship Management Enterprise Resource Planning Business Solutions (Business Policies, Processes, Contracts) Multiple Contexts for Autonomic Behavior Server Farm Enterprise Network Storage Pool Groups of Elements (Inter-elementself-management) System Elements (Intra-element self-management) Network Devices Servers Storage Middleware Database Applications
ThinkPad Active Protection System… A patented hardware and software solution that can reduce hard drive failures by avoiding fall-induced damage • Hardware: Motion Detector (or “accelerometer”) on the system board senses movement that could cause system damage • Same chip used in automobile airbags • Software: Interprets signals from the accelerometer and signals the hard drive to temporarily park the drive head when rapid system motion or vibration is detected • Can differentiate between potentially harmful movements and repetitive motion (such as in a car or airplane) Standard on all T41 and R50 ThinkPads The system senses computer’s orientation and displays it on the screen real-time
TotalStorage:SAN File Systems and SAN Volume Controller Files are automatically provisioned based upon policy based rules Individual application servers do not manage their own file system W2K SUN AIX UNIXLinux The IBM TotalStorage Virtualization software is designed to automate configuration and provisioning across multiple servers. Tasks that historically have been done manually, are automated, thus reducing complexity and increasing productivity HP SAN File System and SAN Volume Controller Virtualization hides the physical topology from the application servers Storage attributes can be auto-discovered Storage can be changed without disruption to the application server Storage can be aggregated into storage pools with similar service class
Load Balancer Policy-driven Self-Optimizing Solution:IBM Server Allocation for WebSphere High priority Database Server WebSphere Transactional Grid Stock Trading Mid priority Account Manager Low priority Forecaster Advice Application • Multiple WebSphere transactional applications • Multiple Service Level objectives • Dynamic and automated application provisioning Application Provisioning Parallel Services IBM Server Allocation for WebSphere WebSphere Application Server v5
Tourcast SU Utilization Annika at the Colonial Allocated Thursday Friday Saturday Sunday 200 150 100 50 0 0:59 1:59 2:59 3:59 4:59 5:59 6:59 7:59 8:59 9:59 13:59 14:59 15:59 17:59 19:59 20:59 21:59 22:59 10:59 11:59 12:59 16:59 18:59 23:59 Hour of Day IT Infrastructure Orchestration Business challenge • Extend Events to millions of sports fans • Real-time “point-by-point” scores and statistics • Latest news delivered online On demand business benefits • Secure and scalable infrastructure (up to 50 times the previous infrastructure capacity) • Real-time access to all play-by-play tournament information on “as-needed” basis • Global reach with around-the-clock availability When resources not in use, allocation is given to biotechnology research Tivoli Intelligent Orchestrator
Intelligent Orchestration Saves Time and Costs: IBM Tivoli Intelligent Orchestrator before after Intelligence 100 Servers at 20% Capacity 50 Servers at 40% Capacity utilizing current infrastructure 100 Software Licenses 50 Software Licenses 3 Administrators 1 Administrator Maintenance Costs $$$$$ Maintenance Costs $ Add/Remove Resources in Days Add/Remove Resources in Minutes
Core Capabilities Enable Autonomic Behavior Service Support Solution install & maintenance Problem determination Common System Administration Service Delivery Policy-based security and management Autonomic monitoring Complex analysis Provisioning Heterogeneous workload management Business policy Resource Provisioning Autonomic core capabilities Solution Install Policy Autonomic Computing Architecture Problem Determination Admin Console Products delivering autonomic features Open Standards
Building an Open Framework for the Industry… Solution Install & Maintenance Creation of Installable Units: descriptors and artifacts Managing changes through the lifecycle of the solution • One consistent software installation technology across all products • Consistent and up-to-date configuration and dependency data • Reduced deployment time with less errors • Reduced software maintenance time, improved analysis of failed system components • Component-based product install Tooling IU IU IU IU Installer: Dependency Checkers, Install, Configuration & Verification Actions, Updates IU IU Standards-based: OGSA, Web Services,CIM Solution Installed in Hosting Environment
Building an Open Framework for the Industry… Integrated Solutions Console One Stop Shopping for all Administrative Tasks • One consistent user interface across product portfolio • Common runtime infrastructure and development tools based on industry standards, component reuse • Provides a presentation framework for other autonomic core technologies • Through a unified portal ... n Standards-based: J2EE, JSR168 Unified end-to-end customer tasks from hardware management thru middleware
Building a Self-Healing System Self Healing Remediation • Consult Policy • Take corrective action • Closed Loop Analysis • Complex analysis • Suggested Corrective Action Data capture • Symptom databases • Correlation • Simple analysis • Common log format • Standardization • Adapters Level 1 Increasing level of automation
Applications Adapters Database • Disparate pieces and parts • Tools focused on individual products • No common interfaces among tools • No synergies in building tools OR in creating log entries ApplicationServer Servers • Generic Log Adapters • Common formats for log files • Common set of tools • Common interfaces among tools Storage devices Adapters Networks Log Format Tomorrow Common Base Event submitted to OASIS Log Format Today common base event
Log and Trace Analyzer for Autonomic Computing • Viewing, analysis, and correlation of log files • Consolidated environment that deals with logs and traces produced by various components • Easier and faster for developers and support personnel to debug and resolve problems • Link to WebSphere symptom database available today
Symptomservice Policies action situation actionable cause validate Knowledge Knowledge In development In development Tivoli Autonomic Monitoring Engine Resource Model: Process Analysis Resource Model: Event Log Processing Resource Model: Memory Management Analyze Plan data analysis information analysis knowledge reasoning Data e.g.: Memory, CPU Knowledge e.g.: Restart, command Information e.g.:App stopped, Mem leak Monitor Execute Action Data
Adapters Adapters autonomic manager Autonomic Computing Self-Healing Systems Tivoli Autonomic Monitoring Engine Feedback Applications Database ApplicationServer Knowledge Data Policies Servers common base event Storage devices Symptomservice Policyengine Networks
Innovation Needed!! Research Challenges • Life cycle of autonomic elements • Multi-agent learning & negotiation/conflict resolution • Software tools • Testing, verification, robustness • Policies and SLA’s • Availability, fault tolerance & recovery • Continuous operations • Problem determination • Optimization & prediction • End to end security • Distributed resource management & scaling • Peer system interaction • Context awareness • Human computer interface • Metering, monitoring & control • Cultural change & trust • . . . .
Autonomic Computing alphaWorks Zone • Get started developing autonomic solutions now • Available on alphaWorks: • Log and Trace Tool • Business Workload Management Developer Kit • Tivoli Resource Model Builder • Agent Building and Learning Environment (ABLE) • IBM Grid Toolbox • Web Services Tools • Coming soon – components from: • Autonomic Computing Toolkit • Solution Install • Policy-based Management • and more! www.alphaworks.ibm.com/autonomic
the journey has started…… • Products, services available today • Architecture and core technologies emerging • IBM is working with business partners and standards organizations to develop open standards for self-managing systems • Broad IT industry participation is needed – this is an industry-wide initiative • Innovation & Collaboration are a must!! Aggressive research is essential!! Freeing people to focus on their business instead of their infrastructure
Questions? • Web sites: www.ibm.com/autonomicwww.research.ibm.com/autonomicwww.alphaworks.ibm.com • A Vision of Autonomic Computing • IEEE Computer, January 2003 • IBM Systems Journal special issue on Autonomic Computing • http://www.research.ibm.com/journal/sj42-1.html