1 / 24

An Application Monitoring System in Grid

Explore the efficient and scalable Grid Application Monitoring OCM-G system, including components, services, request types, and message routing. Learn about the benefits of this system in handling multiple applications and users.

tyer
Download Presentation

An Application Monitoring System in Grid

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. An Application Monitoring System in Grid Bartosz Baliś Tomasz Szepieniec

  2. AGENDA • PART I Monitoring – introduction (B. Baliś) • PART II Concepts and problems in Grid application monitoring (T. Szepieniec)

  3. PART I - OUTLINE • Application Monitoring • OMIS • Grid Monitoring System – OCM-G • Instrumentation

  4. Application monitoring • Monitor = obtain information on or manipulate target application • e.g. read status of application’s processes, suspend application, read / write memory, etc. • Monitoring module needed by tools • Debuggers • Performance analyzers • Visualizers • ...

  5. Monitoring – integrated module • Monitoring module integrated with GUI • Usual case

  6. Monitoring – autonomous system • Separate monitoring system • Tool / Monitor interface – OMIS

  7. Monitoring system – benefits • Modularity • GUI development separated from monitoring module development • Single monitoring system • multiple tools do not conflict each other • coordination of access to shared objects • enables tool interoperability

  8. Interoperability • Multiple tools • Common monitoring system • Interoperability • cooperation • e.g. debugger + visualizer

  9. OMIS • Universal tool-mon.sys. interface • Target system view • hierarchical set of objects • nodes, processes, threads • grid: additional objects – sites • objects identified by tokens, e.g. n_1, p_1, etc. • Three types of services • information services • manipulation services • event services

  10. OMIS – services • Information services • obtain information on target system • e.g. node_get_info = obtain information on nodes in the target system • Manipulation services • perform manipulations on the target system • e.g. thread_stop = stop specified threads • Event services • detect events in the target system • e.g. thread_started_libcall = detect infocations of specified functions

  11. OMIS – requests • Services can be combined into monitoring requests Two types of requests: • Unconditional requests • to be executed immediately • executed only once • Conditional requests • to execute actions whenever event occurs • actions can be executed multiple time

  12. OMIS – unconditional requests :thread_stop(t_1) Actions Operands = stop thread t_1

  13. OMIS – conditional requests thread_started_libcall(t_1, „MPI_Send”): counter_inc(c_1) Event Operands Actions = whenever thread t_1 invokes MPI_Send, increment counter c_1

  14. Grid-enabled OMIS-compliant Monitoring System – OCM-G • Scalable • distributed • decentralized • Efficient • local buffers  Three types of components • local monitors (LM) • service managers (SM) • application monitors (AM)

  15. Service Managers and Local Monitors • Service Managers • one or more in the system • request distribution • reply collection • Local Monitors • one per node • handle local objects • actual execution of requests

  16. Application monitors • Embedded in applications • Handle some actions locally • buffering data • filtering of instrumentation • monitoring requests • E.g. REQ: read variable a, REP: value of a • asynchronous • no OS mechanisms involvedion

  17. PART II – OUTLINE Concepts and problems in Grid application monitoring • OCM-G as Grid service • OCM-G component discovery • Handling Multiple Applications • OMIS message routing • Application start-up

  18. OCM-G as Grid service • Permanently running in Grid • Global service • multiple applications • multiple users based on Globus users • multiple tools

  19. OCM-G component discovery • Use cases • Assemble individual parts into a single monitoring system • Enabling attachment of application processes and tools • External mechanisms based on • local file system • MDS • R-GMA • ...

  20. Handling Multiple Applications • Create „virtual” monitoring system • One SM becomes MainSM for each application • which one? • Expansion and localization mechanism works in „virtual” mode

  21. OMIS message routing Two options: • route via Main SM(option 1) • SM communicate directly(option 2) Where localization info should be stored and how updated?

  22. Application start-up –requirements • Application started as usual (e.g. PORTAL,globusrun) • Independent of communication protocol the application uses • Use GRAM to placeapplication processes where OCM-G is enabled

  23. Application start-up – HOWTO • Monitoring parametersare put in command line • Initial OMIS requests can also be put in command line (e.g. breakpoints) Obligatory starting parameters • application token chosen apriori • MainSM token should be known apriori (we work on how to avoid this)

  24. Conclusion • OMIS as communication protocol • Autonomous application monitoring system • OCM-G use Globus services • Normal application startup can be achieved Some problems are still to be addressed...

More Related