490 likes | 663 Views
vC Ops Manager v5.0. Jim Davidge – Solution Architect Joel Stephens – Solution Architect. vCenter Operations Management Suite. Automated Operations Management. Small and Mid-size Business. Enterprise. Smaller vSphere environments. Larger vSphere environments.
E N D
vC Ops Manager v5.0 Jim Davidge – Solution Architect Joel Stephens – Solution Architect
vCenter Operations Management Suite Automated Operations Management Small and Mid-size Business Enterprise Smaller vSphere environments Larger vSphere environments Virtual and cloud infrastructure Virtual, cloud and heterogeneous environments Enterprise Edition Standard Edition Enterprise Plus Edition Advanced Edition Maximize the performance, workload and health of vSphere. Ensure service levels, enforce configuration compliance and optimize for efficiency and cost. Everything you need to ensure optimal performance, efficiency and compliance in the infrastructure and inside the guest. Ensure best performance, highest resource efficiency and plan for future capacity needs. VC Ops Mgr 5.0 (incl. CapIQ) VC Ops Mgr 5.0 – Std. VC Ops Mgr 5.0(incl. CapIQ) VC Ops Mgr 5.0(incl. CapIQ) VCM for vSphere ** VC Configuration Mgr VC Infra Navigator ** VC Infra Navigator ** Chargeback Mgr Chargeback Mgr
What’s New in vC Ops Mgr 5.0 • New Name – VMware vCenter Operations Manager • Key part of the VMware vCenter Operations Management Suite • CapacityIQ Merged with vC Ops • CIQ gets VCOPs features • *New* Dashboard • New Badges (11 – Up from 3) • Improved Details Page • Greater Emphasis on the Datastore (First Class Object) • Performance Management and Capacity Management • New Integrations • VCM vC Ops • vC Ops Chargeback
vC Ops Mgr 5.0 – vApp Architecture vCenter Operations Manager vApp Analytics VM UI VM Collector Custom WebApp vSphere WebApp Admin WebApp OpenVPN ActiveMQ Capacity Analytics Performance Analytics Rolled up capacity data Metric Data Postgres DB Postgres DB FSDB
vC Ops Mgr 5.0 – High Level Architecture vC Ops Mgr vSphere UI vCenter Operations Manager vApp Analytics VM UI VM vSphere vSphere vSphere Collector Custom WebApp vSphere WebApp Admin WebApp OpenVPN vCenter Communications over SSL ActiveMQ VMware Cloud / vCenter vCenter Configuration Manager Capacity Analytics Performance Analytics Rolled up capacity data Metric Data 3rd Party Data Sources Postgres DB Postgres DB FSDB vC Ops Mgr Custom UI
vC Ops Mgr 5.0 – UIs Accessing vC Ops Mgr – Three (3) UIs • *NEW* – vC Ops vSphere UI • Available in all Editions • Summary and Deep Dive view into vSphere • Used by the VI Admin and Infrastructure Teams • https://<UI VM IP> or https://<UI VM IP> /vcops-vsphere • vC Ops Custom UI • Available at the Enterprise Suite • What you would know as the vC Ops Enterprise 1.x UI • Used by Operations • Provides a view into the ENTIRE enterprise • https://<UI VM IP>/vcops-enterprise • vApp Admin UI • https://<UI VM IP>/admin
What’s New in vC Ops Mgr 5.0 • Brand New UI Navigation • Tons of new features and functions Tabs and Sub-Tabs for Simpler Navigation Left Pane Navigation Window matches vCenter Topology Tree
Vc Ops vSphere UI – Unified Dashboard • Launching Pad • Click to Drill down • Focused on problems • Click to drill into details! • Almost everything is clickable • Main Themes • Health • Risk • Efficiency • New Concepts • Faults • Weekly Stress Profile • Reclaimable Waste • Density
vC Ops vSphere UI – Two Different Users Short and Long Term Capacity Operations • Forward Looking • Are there areas that I should be concerned about from a capacity perspective? • Have I deployed my VI in the most efficient manner? • Immediate problems • What is happening right now? • What do I need to pay attention to?
vC Ops Default UI – Major and Minor Badges • High level Understanding • Calculated from scores of Minor Badges Major x 3 • Specifics • Guidance Minor x 8
Operations: Major Badge – Health • “How is this object doing right now?" • Identifies current problems in the system • Issues that need to be resolved immediately to avoid problems • High Health is good (100-0) • Heatmap • Provides quick view of many objects at once • Shows Health of all parent and child objects • Go back in time (6 hours) and see the “weather” of the Virt Infrastructure • Health Score is calculated from its Minor Badges • Workload • Anomalies • Faults
95 Operations: Health Minor Badge – Workload • Measures how hard an object is working? • High Workload is bad (0-100 or more!) • Percentage of Demand divided by effective capacity • As workload approaches (and exceeds) 100% • Performance Problems! • Starving object for resources! • Focused attention • CPU • Memory • Disk I/O • Network I/O • Improved Network and Disk I/O calculations • Eliminates idle networks and storage from showing High Workload • Limit the erroneous 100% Workload scores
Operations: Health Minor Badge – Anomalies • Measures how normal is this object behaving? • Is what the vC Ops 1.x Health score was, but now inversed • Derived from the number of metrics that are outside of their “Normal” trended ranges • Learns dynamic ranges of “Normal” for each metric • Identifies metric abnormalities • Low Anomalies is good (0-100) • Zero meaning the object is performing exactly the way vC Ops expects it to for that time of the day, that day of the week • A high number of anomalies are usually an indication of a problem • Anomalies Chart • Current number of Abnormal Metrics • Problem/Noise Threshold • Crossing problem threshold will increase the Anomalies Score • Does not generate an alert in this vSphere UI
Operations: Health Minor Badge – Faults • Best Practices: • Do not change the Faults Threshold • Use Alerts View to manage Faults • Faults shown in Widget • Measures the degree of faults or problems the object is experiencing • Pulled from active vCenter events • VMware specific knowledge of which vCenter Events affect Availability and Performance (examples): • Loss of redundancy in NICs or HBAs • Memory checksum errors • HA failover problems • Low Faults is good (0-100) • Each fault has a default score (e.g. 25, 50, 75, 100) • Highest individual Fault Score drives the Fault object Score
Capacity Planning: Major Badge – Risk • Are there future risks to my systems and VI? • Identifies potential problems that could eventually hurt the performance • Low Risk is good (0-100) • Risk Score is calculated from its Minor Badges • Time Remaining • Capacity Remaining • Stress • Risk Chart • Shows Risk score over the last 7 days
Capacity Planning: Risk Minor Badge – Time Remaining • Measures time remaining before each resource type reaches its capacity • CPU • Memory • Disk • Network I/O • Early warning of upcoming provisioning needs • Avoid future performance issues • High Time Remaining is good (100-0) • Graph shows resource utilization trends
Capacity Planning: Risk Minor Badge – Capacity Remaining • Measures how many more VMs can be placed on the object • Percentage of Total VM “Slots” Remaining • Based on the average size of the VM on the object (e.g. VM profile) • Each object has its OWN VM profile size: Host, Cluster, Datacenter, Etc. • High Capacity Remaining is good (100-0) • Zero mean no room left for more VMs • 333 More VMs correlates to 77% Capacity Remaining for this object
Capacity Planning: Risk Minor Badge – Stress • Stress measures long-term or chronic workload • Workload shows an instantaneous value • Stress looks over a longer period of time • Quickly find and resolve • Undersized objects • Population contention • Low Stress is good (0-100) • Stress score encompasses a six (6) week period • Workloads > 70% = “Stressed” • Threshold Configurable • Chart shows weeks break down of Stress for each day/hour averaged over the last six (6) Weeks
Capacity Planning: Major Badge – Efficiency • Are there optimization opportunities in my systems? • How to run a leaner datacenter • Save $$$ by better utilizing resources • High Efficiency is good (100-0) • Efficiency Score calculated from Minor Badges • Reclaimable Waste • Density • Graph Depicts VMs by Percent • Optimal – Optimally Provisioned VMs • Waste – Over Provisioned VMs • Stress – Under Provisioned VMs • Not used in Efficiency Calculation (see Risk) • Three Resources Considered • CPU • Memory • Disk Space • Note: VMs can appear in Stress and Waste
Capacity Planning: Efficiency Minor Badge – Reclaimable Waste • Measures the over-provisioning for an object • It identifies the amount of reclaimable resources • CPU • Memory • Disk • Low Reclaimable Waste is good (0-100) • Reclaimable Waste = Reclaimable Capacity / Deployed Capacity • Score depicts the MAX of the CPU, Memory and Disk calculation • Disk calculation can also include old snapshots and templates • Graph shows breakdown of the Waste section of the Efficiency Badge pie chart • % Idle VMs (based on configured settings) • % Powered Off VMs • % Oversized VMs
Capacity Planning: Efficiency Minor Badge – Density • Contrasts Actual vs. Ideal Density • Identify Optimal Resource Deployment Before Contention Occurs • Greater Consolidation $$$ • High Density is good (100-0) • Measures consolidation ratios: • VMs/Host Ratios • vCPU/Physical CPU Ratios • vMem/Physical Memory Ratios
Dashboard Focus for VM and Datastore • Dashboard is focus driven • VM: • Health widget no longer has a heatmap as a VM has no children objects • Datastore: • Focused on disk metrics only • Stress & Density are not shown
Operations: Environment Operations Badges New World Object Multi vCenter Support Left Pane Navigation Drives Focus (e.g. Datastore) Relationship to the Datastore
Operations: Details • Health Badge Focus Overview of the 3 Minor Health Badges
Operations: Details • Workload Badge Focus : VM Example Reserved, Limits and Entitlement Highlighted on Graphs
Operations: Details • Workload Badge Focus : Datastore Example Space Available Throughput IOPS Latency
Operations: Details • Anomalies Badge Focus Subset of the Anomalies for an object Visualize magnitude and impact Help with any troubleshooting efforts
Operations: Details • Fault Badge Focus Details of vCenter Faults
Operations: Events • Updates to the 1.0 Events View Choose Badge Overlay Badge Alerts For which objects should I show Alerts and Events? Overlay Change Events Health Score Line
Operations: All Metrics • New Metrics Available Badge Metrics Capacity Planning Metrics
Planning: Environment Planning Badges New World Object Multi vCenter Support Left Pane Navigation Drives Focus (e.g. Datastore) Relationship to the Datastore
Planning: Scoreboard • Identical to the 1.0 Scoreboard View
Planning: Summary • “Classic CapIQ” Dashboard rolled up under Summary tab • Summary view context sensitive to object selected • Network I/O trending and forecasting • Usable Capacity supports Network I/O • What-if Modeling allows CPU & Memory Reservations and Limits configuration
Planning: Views • Reports Organized by “Badge” • 5 different categories – one for each minor badge under Risk and Efficiency • New List Reports • VM List • Datastores List • Datastores Waste List • Views associated with Datastores
Planning: Events • Identical to Operations: Events Tab Overlay Badge Alerts Choose Badge Overlay Change Events For which objects should I show Alerts and Events? Risk Score Line
Smart Alerts – Overview • New Alerting Functionality • Smarts Alerts Available in EACH vC Ops Suite edition • Different Types of Smart Alerts • vSphere UI Badge Alerts • Threshold Based • Driven by Badge Color Change Thresholds • Only Alert on Minor Badges • Workload YES – Health NO • Good for Alerts on single objects (e.g. VM) • Custom UI Alerts • Can show vSphere UI Badge Alerts • Alerts driven by • Problem/Noise Threshold Anomaly Breaches • KPI Threshold Breaches • Very useful for groups of objects (e.g. Application Monitoring)
Smart Alerts Details • Double click on an alert to see the details • Details view differs based on the alert type (e.g. Workload vs. Anomalies)
Analysis – Heatmaps • Heatmaps like in vC Ops Std 1.0 • We now have the Capacity badges and metrics available in the heatmaps • Examples: • Which Clusters are Healthy and have available Capacity? • Which hosts have a Low Workload and a low Density?
Reports • CapIQ Reports merged into Reports Tab
vCM vC Ops : Change Events Correlated with Performance • Overview • Integration between vCM and vC Ops Mgr for change events • Overlay Guest OS configuration changes from vCM in vC Ops performance trend graphs • Launch in context into vCM to see full details of changes and potentially remediate them • Benefits • Enable Operations to quickly understand and resolve performance issues arising from configuration changes (reduce MTTR) • Drive efficient & effective troubleshooting by correlating Guest OS configuration changes w/ VM performance degradations
Thank You Thank You