1 / 13

Google-Wide Profiling: A Continuous Profiling Infrastructure For Data Centers

Google-Wide Profiling: A Continuous Profiling Infrastructure For Data Centers. Gang Ren, Eric Tune, Tipp Moseley, Yixin Shi Silvius Rus, Robert Hundt Google. Presented by Siddarth Asokan. Agenda. What is continuous profiling? Infrastructure Collector Profiles Symbolization

Download Presentation

Google-Wide Profiling: A Continuous Profiling Infrastructure For Data Centers

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Google-Wide Profiling:A Continuous Profiling Infrastructure For Data Centers Gang Ren, Eric Tune, Tipp Moseley, Yixin Shi Silvius Rus, Robert Hundt Google Presented by Siddarth Asokan

  2. Agenda • What is continuous profiling? • Infrastructure • Collector • Profiles • Symbolization • Profile Storage • User Interface • Reliability Analysis • Questions

  3. Continuous Profiling • GWP is a continuous profiling infrastructure for data centers & provides performance insights for cloud applications • The applications of these profile ranges from platform affinity measurements and identification of platform – specific micro architectural peculiarities

  4. Infrastructure of GWP

  5. GWP collector • GWP samples in two dimensions. At any moment, profiling occurs only on a small subset of all machines in the fleet, and event-based sampling is used at the machine level • Each event sampling rate is chosen high enough to provide meaningful machine-level data while still minimizing the distortion caused by the profiling on critical applications

  6. Profiles and profiling interfaces • Collects two categories of profiles: • Whole – machine • Per – process • Users without root access cannot directly invoke most of the whole – machine profiling systems, so lightweight daemons are deployed on every machine to let remote users to access the profiles

  7. Symbolization • To provide meaningful information profiles must correlate to source code • The code is not available offline and can no longer be symbolized • It’s too resource intensive and sometimes impossible for applications whose source is not ready. The alternative is to permanently store binaries that contain debug information before they are stripped

  8. Profile storage • To make the data useful and accessible, the samples are loaded into a read only dimensional database that is distributed across hundreds of machines • The database supports a subset of SQL like semantics • Most queries are seen frequently, so the profile server uses aggressive caching to hide database latency

  9. User Interfaces • GWP deploys a webserver to provide a user interface on top of the profile database • It makes it easy to access profile data and construct ad hoc queries for the traditional use of application profiles • Various views: • Query view • Call graph view • Source annotation

  10. Reliability analysis • To conduct continuous profiling on datacenter machines serving real traffic, extremely low overhead is paramount, so we sample in both time and machine dimensions • Two indirect methods are to evaluate the soundness of applications’ profiles • Study the stability of aggregated profiles using different metrics • Correlate profiles with the performance data from other sources to cross – validate both

  11. The number of samples and the entropy of daily application – level profiles. The primary y-axis (bars) is the total number of profile samples. The secondary y-axis (line) is the entropy of the daily application – level profile

  12. The Manhattan distance between daily application level profiles for various profile types The correlation between the number of samples and the Manhattan distance of profiles

  13. Questions?

More Related