vC Ops Manager v5.0

vC Ops Manager v5.0 Jim Davidge – Solution Architect Joel Stephens – Solution Architect

vCenter Operations Management Suite Automated Operations Management Small and Mid-size Business Enterprise Smaller vSphere environments Larger vSphere environments Virtual and cloud infrastructure Virtual, cloud and heterogeneous environments Enterprise Edition Standard Edition Enterprise Plus Edition Advanced Edition Maximize the performance, workload and health of vSphere. Ensure service levels, enforce configuration compliance and optimize for efficiency and cost. Everything you need to ensure optimal performance, efficiency and compliance in the infrastructure and inside the guest. Ensure best performance, highest resource efficiency and plan for future capacity needs. VC Ops Mgr 5.0 (incl. CapIQ) VC Ops Mgr 5.0 – Std. VC Ops Mgr 5.0(incl. CapIQ) VC Ops Mgr 5.0(incl. CapIQ) VCM for vSphere ** VC Configuration Mgr VC Infra Navigator ** VC Infra Navigator ** Chargeback Mgr Chargeback Mgr

What’s New in vC Ops Mgr 5.0 • New Name – VMware vCenter Operations Manager • Key part of the VMware vCenter Operations Management Suite • CapacityIQ Merged with vC Ops • CIQ gets VCOPs features • *New* Dashboard • New Badges (11 – Up from 3) • Improved Details Page • Greater Emphasis on the Datastore (First Class Object) • Performance Management and Capacity Management • New Integrations • VCM  vC Ops • vC Ops  Chargeback

vC Ops Mgr 5.0 – vApp Architecture vCenter Operations Manager vApp Analytics VM UI VM Collector Custom WebApp vSphere WebApp Admin WebApp OpenVPN ActiveMQ Capacity Analytics Performance Analytics Rolled up capacity data Metric Data Postgres DB Postgres DB FSDB

vC Ops Mgr 5.0 – High Level Architecture vC Ops Mgr vSphere UI vCenter Operations Manager vApp Analytics VM UI VM vSphere vSphere vSphere Collector Custom WebApp vSphere WebApp Admin WebApp OpenVPN vCenter Communications over SSL ActiveMQ VMware Cloud / vCenter vCenter Configuration Manager Capacity Analytics Performance Analytics Rolled up capacity data Metric Data 3rd Party Data Sources Postgres DB Postgres DB FSDB vC Ops Mgr Custom UI

vC Ops Mgr 5.0 – UIs Accessing vC Ops Mgr – Three (3) UIs • *NEW* – vC Ops vSphere UI • Available in all Editions • Summary and Deep Dive view into vSphere • Used by the VI Admin and Infrastructure Teams • https://<UI VM IP> or https://<UI VM IP> /vcops-vsphere • vC Ops Custom UI • Available at the Enterprise Suite • What you would know as the vC Ops Enterprise 1.x UI • Used by Operations • Provides a view into the ENTIRE enterprise • https://<UI VM IP>/vcops-enterprise • vApp Admin UI • https://<UI VM IP>/admin

What’s New in vC Ops Mgr 5.0 • Brand New UI Navigation • Tons of new features and functions Tabs and Sub-Tabs for Simpler Navigation Left Pane Navigation Window matches vCenter Topology Tree

Dashboards & Badges

Vc Ops vSphere UI – Unified Dashboard • Launching Pad • Click to Drill down • Focused on problems • Click to drill into details! • Almost everything is clickable • Main Themes • Health • Risk • Efficiency • New Concepts • Faults • Weekly Stress Profile • Reclaimable Waste • Density

vC Ops vSphere UI – Two Different Users Short and Long Term Capacity Operations • Forward Looking • Are there areas that I should be concerned about from a capacity perspective? • Have I deployed my VI in the most efficient manner? • Immediate problems • What is happening right now? • What do I need to pay attention to?

vC Ops Default UI – Major and Minor Badges • High level Understanding • Calculated from scores of Minor Badges Major x 3 • Specifics • Guidance Minor x 8

Operations: Major Badge – Health • “How is this object doing right now?" • Identifies current problems in the system • Issues that need to be resolved immediately to avoid problems • High Health is good (100-0) • Heatmap • Provides quick view of many objects at once • Shows Health of all parent and child objects • Go back in time (6 hours) and see the “weather” of the Virt Infrastructure • Health Score is calculated from its Minor Badges • Workload • Anomalies • Faults

95 Operations: Health Minor Badge – Workload • Measures how hard an object is working? • High Workload is bad (0-100 or more!) • Percentage of Demand divided by effective capacity • As workload approaches (and exceeds) 100% • Performance Problems! • Starving object for resources! • Focused attention • CPU • Memory • Disk I/O • Network I/O • Improved Network and Disk I/O calculations • Eliminates idle networks and storage from showing High Workload • Limit the erroneous 100% Workload scores

Operations: Health Minor Badge – Anomalies • Measures how normal is this object behaving? • Is what the vC Ops 1.x Health score was, but now inversed • Derived from the number of metrics that are outside of their “Normal” trended ranges • Learns dynamic ranges of “Normal” for each metric • Identifies metric abnormalities • Low Anomalies is good (0-100) • Zero meaning the object is performing exactly the way vC Ops expects it to for that time of the day, that day of the week • A high number of anomalies are usually an indication of a problem • Anomalies Chart • Current number of Abnormal Metrics • Problem/Noise Threshold • Crossing problem threshold will increase the Anomalies Score • Does not generate an alert in this vSphere UI

Operations: Health Minor Badge – Faults • Best Practices: • Do not change the Faults Threshold • Use Alerts View to manage Faults • Faults shown in Widget • Measures the degree of faults or problems the object is experiencing • Pulled from active vCenter events • VMware specific knowledge of which vCenter Events affect Availability and Performance (examples): • Loss of redundancy in NICs or HBAs • Memory checksum errors • HA failover problems • Low Faults is good (0-100) • Each fault has a default score (e.g. 25, 50, 75, 100) • Highest individual Fault Score drives the Fault object Score

Capacity Planning: Major Badge – Risk • Are there future risks to my systems and VI? • Identifies potential problems that could eventually hurt the performance • Low Risk is good (0-100) • Risk Score is calculated from its Minor Badges • Time Remaining • Capacity Remaining • Stress • Risk Chart • Shows Risk score over the last 7 days

Capacity Planning: Risk Minor Badge – Time Remaining • Measures time remaining before each resource type reaches its capacity • CPU • Memory • Disk • Network I/O • Early warning of upcoming provisioning needs • Avoid future performance issues • High Time Remaining is good (100-0) • Graph shows resource utilization trends

Capacity Planning: Risk Minor Badge – Capacity Remaining • Measures how many more VMs can be placed on the object • Percentage of Total VM “Slots” Remaining • Based on the average size of the VM on the object (e.g. VM profile) • Each object has its OWN VM profile size: Host, Cluster, Datacenter, Etc. • High Capacity Remaining is good (100-0) • Zero mean no room left for more VMs • 333 More VMs correlates to 77% Capacity Remaining for this object

Capacity Planning: Risk Minor Badge – Stress • Stress measures long-term or chronic workload • Workload shows an instantaneous value • Stress looks over a longer period of time • Quickly find and resolve • Undersized objects • Population contention • Low Stress is good (0-100) • Stress score encompasses a six (6) week period • Workloads > 70% = “Stressed” • Threshold Configurable • Chart shows weeks break down of Stress for each day/hour averaged over the last six (6) Weeks

Capacity Planning: Major Badge – Efficiency • Are there optimization opportunities in my systems? • How to run a leaner datacenter • Save $$$ by better utilizing resources • High Efficiency is good (100-0) • Efficiency Score calculated from Minor Badges • Reclaimable Waste • Density • Graph Depicts VMs by Percent • Optimal – Optimally Provisioned VMs • Waste – Over Provisioned VMs • Stress – Under Provisioned VMs • Not used in Efficiency Calculation (see Risk) • Three Resources Considered • CPU • Memory • Disk Space • Note: VMs can appear in Stress and Waste

Capacity Planning: Efficiency Minor Badge – Reclaimable Waste • Measures the over-provisioning for an object • It identifies the amount of reclaimable resources • CPU • Memory • Disk • Low Reclaimable Waste is good (0-100) • Reclaimable Waste = Reclaimable Capacity / Deployed Capacity • Score depicts the MAX of the CPU, Memory and Disk calculation • Disk calculation can also include old snapshots and templates • Graph shows breakdown of the Waste section of the Efficiency Badge pie chart • % Idle VMs (based on configured settings) • % Powered Off VMs • % Oversized VMs

Capacity Planning: Efficiency Minor Badge – Density • Contrasts Actual vs. Ideal Density • Identify Optimal Resource Deployment Before Contention Occurs • Greater Consolidation  $$$ • High Density is good (100-0) • Measures consolidation ratios: • VMs/Host Ratios • vCPU/Physical CPU Ratios • vMem/Physical Memory Ratios

Dashboard Focus for VM and Datastore • Dashboard is focus driven • VM: • Health widget no longer has a heatmap as a VM has no children objects • Datastore: • Focused on disk metrics only • Stress & Density are not shown

Operations Tab

Operations: Environment Operations Badges New World Object Multi vCenter Support Left Pane Navigation Drives Focus (e.g. Datastore) Relationship to the Datastore

Operations: Scoreboard

Operations: Details • Health Badge Focus Overview of the 3 Minor Health Badges

Operations: Details • Workload Badge Focus : VM Example Reserved, Limits and Entitlement Highlighted on Graphs

Operations: Details • Workload Badge Focus : Datastore Example Space Available Throughput IOPS Latency

Operations: Details • Anomalies Badge Focus Subset of the Anomalies for an object Visualize magnitude and impact Help with any troubleshooting efforts

Operations: Details • Fault Badge Focus Details of vCenter Faults

Operations: Events • Updates to the 1.0 Events View Choose Badge Overlay Badge Alerts For which objects should I show Alerts and Events? Overlay Change Events Health Score Line

Operations: All Metrics • New Metrics Available Badge Metrics Capacity Planning Metrics

Planning Tab

Planning: Environment Planning Badges New World Object Multi vCenter Support Left Pane Navigation Drives Focus (e.g. Datastore) Relationship to the Datastore

Planning: Scoreboard • Identical to the 1.0 Scoreboard View

Planning: Summary • “Classic CapIQ” Dashboard rolled up under Summary tab • Summary view context sensitive to object selected • Network I/O trending and forecasting • Usable Capacity supports Network I/O • What-if Modeling allows CPU & Memory Reservations and Limits configuration

Planning: Views • Reports Organized by “Badge” • 5 different categories – one for each minor badge under Risk and Efficiency • New List Reports • VM List • Datastores List • Datastores Waste List • Views associated with Datastores

Planning: Events • Identical to Operations: Events Tab Overlay Badge Alerts Choose Badge Overlay Change Events For which objects should I show Alerts and Events? Risk Score Line

Alerts Tab

Smart Alerts – Overview • New Alerting Functionality • Smarts Alerts Available in EACH vC Ops Suite edition • Different Types of Smart Alerts • vSphere UI Badge Alerts • Threshold Based • Driven by Badge Color Change Thresholds • Only Alert on Minor Badges • Workload YES – Health NO • Good for Alerts on single objects (e.g. VM) • Custom UI Alerts • Can show vSphere UI Badge Alerts • Alerts driven by • Problem/Noise Threshold Anomaly Breaches • KPI Threshold Breaches • Very useful for groups of objects (e.g. Application Monitoring)

Smart Alerts Details • Double click on an alert to see the details • Details view differs based on the alert type (e.g. Workload vs. Anomalies)

Analysis Tab

Analysis – Heatmaps • Heatmaps like in vC Ops Std 1.0 • We now have the Capacity badges and metrics available in the heatmaps • Examples: • Which Clusters are Healthy and have available Capacity? • Which hosts have a Low Workload and a low Density?

Reports Tab

Reports • CapIQ Reports merged into Reports Tab

vCM vC Ops Integration

vCM  vC Ops : Change Events Correlated with Performance • Overview • Integration between vCM and vC Ops Mgr for change events • Overlay Guest OS configuration changes from vCM in vC Ops performance trend graphs • Launch in context into vCM to see full details of changes and potentially remediate them • Benefits • Enable Operations to quickly understand and resolve performance issues arising from configuration changes (reduce MTTR) • Drive efficient & effective troubleshooting by correlating Guest OS configuration changes w/ VM performance degradations

Thank You Thank You

vC Ops Manager v5.0