300 likes | 465 Views
Performance Management and Capacity Planning Solution GigaWorld 2004. TeamQuest Solution. Complete software suite for Performance Management & Capacity Planning Cross-platform support for all major vendors (Sun Solaris, IBM AIX, HP-UX, etc) Integrates with SNMP console applications
E N D
Performance Management andCapacity Planning SolutionGigaWorld 2004
TeamQuest Solution • Complete software suite for Performance Management & Capacity Planning • Cross-platform support for all major vendors (Sun Solaris, IBM AIX, HP-UX, etc) • Integrates with SNMP console applications • Flexible software that is easy to customize and adapt to heterogeneous IT environments • Extremely low resource consumption • Little manual labor required for daily operation
Data Collection Application Agents #!/bin/bash User Agents OS/Kernel Agents
Data Collection Aggregation Real time sampling 10 Minute Data Points 1 Hour Data Points 8 Hour Data Points Event Logs & Process Table 1 Minute Data Points TabularData NumericalData 1 Day 8 Hours 8 Days 5 Weeks 13 Months
Complete Process Information 08:01 08:02 08:03 08:04 08:05 TQ TQ TQ TQ TQ /var/adm/pacct
Trend Alarm Linear Trend Analysis (LTA) 1 Hour Projection Data Points 5 Weeks CPU runq_sz, 21 day projection
Exception Alarm Short Term Analysis (STA) 1 Hour Data Points Day-of-Week Data Points Percent CPU by workload, typical week Mon-Fri 5 Weeks 7 Days
Enterprise Solution Analysis SummaryReports CapacityPlanning
Customer Reports Internal External
Alarms Thresholds and severities “All-clear” notification Qualifiers
Alarm Notification Options TQ Alarm Table SNMP trap to console Run command
Central Administration TeamQuest Administration Server Workloads Reductions Alarms Solaris Reductions AIX HP-UX Systems HP-UX Workloads Workloads Alarms Sun Reductions AIX HP-UX AIX Systems Alarms Workloads Sun Alarms AIX Reductions HP-UX Solaris Systems
Deployment • Central deployment with replicated installation ”Prototype” system Network drive with: TQ install binaries “Response” file Key file “Response” file ./install.sh ./install.sh ./install.sh ./install.sh ./install.sh
Capacity Planning • Overview • Basics of Modeling • Example: Server Consolidation
Overview Performance Analysis Capacity Planning • Daily Follow-up • Baselining • Troubleshooting • Tracing • Bottlenecks • Correlation Analysis • Historical & Real-time Analysis • Optimization of OS/apps • Stress Testing Trending Modeling • Statistical Trends • Regression-based • Mathematical Simulation • Model-based • Growth Predictions • Server Consolidation TQView TQModel TQView TQWeb TQAlert
Stretch Factor • In the simplest terms, modeling is about finding the point where the server starts to slow down. • This state occurs when the StretchFactor reaches 2 in TeamQuest Model • Stretch Factor is a general indication of the performance of the server. • At Stretch Factor 2 the performance of the server starts to degrade exponentially.
Example • Six stand-alone servers will be consolidated into one • All existing applications will be moved to the sixth • The sixth server should also be able to handle some growth
Server Utilization • TeamQuest View and Workloads allow us to analyze each server in order to get a complete picture of the resource utilization on the servers involved in the consolidation exercise. • The resource utilization is measured as a TeamQuest Workload, which is a logical grouping of the work being done on the server.
Peak Hour • Select a time for each server. • Our goal is to size the sixth server for the worst case, i.e. all application may peak at the same time.
Get input file • Extract the chosen time frame, download the data file and open it in TeamQuest Model Web Browser TeamQuest Model
Calibrate • Calibrate the downloaded data file in TeamQuest Model. • A calibrated model can be manipulated and allows us ask what-if questions, simulate different growth scenarios and test different hardware. Raw data Calibration Calibrated Model
Merge models • Merge the calibrated models into the target system model. By merging the models we are in effect moving all applications and load to one server. Merge
Stretch factor • Solve the merged model and verify the results. Generally speaking, a stretch factor of less than 2 signifies a successful consolidation. The sixth server will be able to handle the extra load from other five with existing hardware. Before the consolidation After the consolidation
Increase load • Question: How will our consolidated server respond to a 5% increase per month of the “Order” workload?
Predicted stretch factor • Answer: the server will hit its limit in June when the stretch factor reaches 2. • It is important to follow-up on predictions made so that future predictions are more realistic and accurate.
Summary • Optimize utilization levels on existing servers • Predict performance problems with trend alarms and exception reports • Test new applications, consolidate servers, and upgrade hardware with confidence • Create workloads and know exactly how much resources are consumed by different customers • Consolidate performance data into enterprise databases for cross-server reporting • Spend less time monitoring your servers and more time on productive work