230 likes | 424 Views
Grid Computing & Tuple Space. Presented by Nelson Chu. What is Grid?. Computing Grid is a very large distributed computing network, Virtual Organizations (VO) Grid is an infrastructure to collaborate many diverse computing resources in a network to accomplish a task(s). ESG. What is Grid?.
E N D
Grid Computing & Tuple Space Presented by Nelson Chu
What is Grid? • Computing Grid is a very large distributed computing network, Virtual Organizations (VO) • Grid is an infrastructure to collaborate many diverse computing resources in a network to accomplish a task(s).
ESG What is Grid? • Classification: • Computational Grid (eg. WestGrid) • Desktop Grid (eg. SETI@HOME) • Data Grid (eg. Earth Systems Grid) • Global Grid Forum (GGF), Globus, etc. • Open Grid Services Architecture - OGSA • Web Service Resource Framework - WSRF (OGSI)
Grid Architecture Fabric: Physical resources Resource & Connectivity Protocols: secured access Collective Services: brokering, monitoring User Applications/Tools
Fabric layer: resource management e.g. computational resources, storage resources, and network resources. Connectivity layer: Communication and authentication Collective layer: global-naming, brokering, scheduling, workload and workflow management, and data replication. Application layer: user defined tools and application
OGSA Architecture Architect Service
OGSA provides a service oriented architecture to simplify grid services development and to build a transparent grid structure Physical resources (for example: storage, network, and computer) and logical resources services Web services (including WSRF) OGSA architect services (such as security management, data management, workflow management, file system management, etc.) Grid applications Architect Service
Research Opportunities: Grid Security • Multi-institutional users and heterogeneous resources • Authentication & authorization mechanisms • Different local name spaces, credentials, and/or accounts • Trust mechanisms: domain-based, user-based, service-based, network-based, and scheduler-based Fault Tolerance • Detect and notify a fault occurrence • Handle any failure
Research Opportunities: Grid Scheduling • Workflows in a service-based environment • Split and distribute jobs to resources • Time, cost, performance, completeness, fairness, effectiveness, and security policies. • Resource discovery • System selection • Job submission: resources reservation and supporting operation • Monitoring and cleaning up
Current Research Trends • Scheduling, Security, Fault Tolerance • Replica location services • Metadata catalog services • Reliable file transfer services • Globus Toolkit 3 / 4: Replica Catalog, GridFTP, and (GASS) Global Access to Secondary Storage . • Globus, IBM, Oracle 10g, Sun, etc.
Research Opportunities: Data • Data Versioning • Global name space: data characterization, heterogeneous systems • Data Security • Fault tolerance: recovery • New challenges: inter-domains replication, data synchronization, replica registration, querying, and meta-catalog.
Grid Environment • Resources collaboration • Loosely coupling • Time, space, destination • Communication allows significant degree of interoperability, synchronization, and extensibility • Simple, flexible, and powerful mechanism
Grid System • Open issues: data synchronization, fault tolerance, persistence, concurrency, transparency, etc. • Tuple Space paradigm • Simple operations provide powerful inter-process mechanism for communication and synchronization. • Flexible for function expansion: ordered tuple, leasing, …
Tuple Space • Developed in a parallel and distributed programming system, Linda, at Yale U • Basic element ~ Tuple • Tuple: a vector of typed values / fields / variables • Operations: Writing, Reading, Consuming & Matching
Tuple Space • TS Vs DBS Producers Take “B” … … … wait/abort SpacesOne A Read C Write Consumers Processes SpacesTwo
Tuple Space • Globally shared memory space (Communication buffer) • Tuples are associatively addressed by matching • Loosely coupling • Mechanism for inter-process communication and synchronization • Simple, flexible, powerful, … ~> Grid Resource Collaboration
Tuple Space on the Grid • Service Registration • Service Discovery • Service Lookup • Job Scheduler • Data Mgmt Consumers Tuple Spaces (Services / Tasks / Data) Producers / Processes
Tuple Space on the Grid with Java Technology • Look up Service • Discovery Mgmt • Leasing • Plug and Work • Resource Mgmt (GRAM) • Security Feature • Fault Tolerant • Data Mgmt Grid Enabled • Resources / Services Collaboration
Grid Job Scheduling • Eg) WestGrid, … • Centralized System • Load balance • Synchronization • Fault Tolerance • Policy • Meta-Scheduler Grid I Scheduler/ Resources Broker Resource Resource Meta-Scheduler Grid II
Users Resource 1 Agent WriteResult() JobSpace LeaseRenew () JobSubmit() TakeJob() Resources List PolicyCheck() Resource N Agent GetResult() SecurityCheck() ResourcesList() Registrar JobExchange() Register() ResourcesListExchange() Domain II Registrar Result Job Desc. Resources Required Job Desc. Resources Required Figure 1: Tuple Spaces Based Grid Scheduler Tuple Spaces Based Gird Job Scheduler • Fault Tolerance Management • Leasing Structure for Dynamic Participation • On-Demand Service • Automatic Load Balance • Global Name Space • Security model ?
View 1 … … SearchView() View 3 Query ViewSite … … UpdateView() GetJobTicket() JobExchange () TicketBox QueueTicket() UpdateData() GridFTP() / TransferSchedule() PriorityCheck() OpenSpace TakeJob() SecCheck() New Data RenewL() PolicyCheck() Derived Data Update() Ticket: View(s)/Query(ies) Certificate User information Job description … … Update() EventNotify() EventNotify() Domain II Register() Replica EventNotify() Registrar Registrar ResourcesListExchange() Resource Agent Figure 2: Data Access Management with Tuple Spaces Gird Data Management System with Tuple Spaces
References • Ian Foster, Carl Kesselman, and Steven Tuecke, “The Anatomy of the Grid: Enabling Scalable Virtual Organizations”, Intl J. Supercomputer Applications, 2001 • Ian Foster, Carl Kesselman, Jeffrey M. Nick, and Steven Tuecke, “The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration”, http://www.globus.org, 2002 • Gregory T. Byrd, and Praveenkumar Cheruvu, “Tuple Space Computing on the Grid”, North Carolina State University • K.A. Hawick, H.A. James, and L.H. Pritchard, “Tuple-Space Based Middleware for Distributed Computing”, Technical Report DHPC-128, University of Wales • Globus, http://www.globus.org • IBM, www.ibm.com/grid
Discussions Thank you !