1 / 10

Data Grid Federation

Data Grid Federation. Arcot Rajasekar Michael Wan Reagan Moore (sekar, mwan, moore)@sdsc.edu. Data grids provide the ability to name, organize, and manage data on distributed storage resources Use case - BaBar experiment Implement independent data grids at SLAC and Lyon, France

willis
Download Presentation

Data Grid Federation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Grid Federation Arcot Rajasekar Michael Wan Reagan Moore (sekar, mwan, moore)@sdsc.edu

  2. Data grids provide the ability to name, organize, and manage data on distributed storage resources Use case - BaBar experiment Implement independent data grids at SLAC and Lyon, France Implement federation environment for controlled sharing of files between the data grids Data Grid federation provides a way to share resources, user names, data and metadata between multiple data grids (Virtual Organizations) Data Grid Federation

  3. Cross-register a digital entity from one collection into another Who manages the access control lists? Who maintains the context (metadata)? Consistency constraints on updates Who manages the updates (system or individual)? Do differences in constraints lead to standard federation approaches? What types of federations are possible? Federation Constraints

  4. Each SRB zone (data grid) uses a metadata catalog (MCAT) to manage the context associated with digital content Context includes: Storage resource names User names Logical name space for files Administrative, descriptive, integrity attributes Data Grids - SRB Zones

  5. Mechanisms to impose consistency and access constraints on: Resources Controls on which zones may use a resource User names (user-name / domain / SRB-zone) Users may be registered into another domain, but retain their home zone, similar to Shibboleth Data files Controls on who specifies replication of data Context metadata Controls on who manages updates to metadata SRB Federation Constraints

  6. C, C++, Java Libraries Unix Shell Databases DB2, Oracle, Sybase, SQLserver,Postgres, mySQL, Informix File Systems Unix, NT, Mac OSX Archives - Tape, HPSS, ADSM, UniTree, DMF, CASTOR,ADS Data Grid Federation - zoneSRB Application DLL / Python, Perl Linux I/O OAI, WSDL, OGSA Java, NT Browsers HTTP Federation Management Consistency & Metadata Management / Authorization-Authentication-Audit Logical Name Space Latency Management Data Transport Metadata Transport Catalog Abstraction Storage Abstraction Databases DB2, Oracle, Sybase, Postgres, mySQL, Informix ORB SRM

  7. Occasional Interchange - for specified users Replicated Catalogs- entire state information replication Resource Interaction- share resources Replicated Data Zones- no user interactions between zones Master-Slave Zones- slaves replicate data from master zone Snow-Flake Zones- hierarchy of data replication zones User / Data Replica Zones- user access from remote to home zone Nomadic Zones “SRB in a Box”- synchronize local zone to parent zone Free-floating “myZone” - synchronize without a parent zone Archival “BackUp Zone”- synchronize to an archive SRB Version 3.0.1 released December 19, 2003 Types of Federation

  8. Characterizing federation approaches (1536 possible combinations)

  9. Federation Approaches Peer-to-Peer Zones Free Floating Partial User-ID Sharing Occasional Interchange Partial Resource Sharing Replicated Data Hierarchical Zone Organization One Shared User-ID No Metadata Synch System Set Access Controls Complete User-ID Sharing System Controlled Complete Synch Resource Interaction Nomadic System Managed Replication System Set Access Controls System Controlled Partial Metadata Synch No Resource Sharing User and Data Replica System Managed Replication Connection From Any Zone Complete Resource Sharing Snow Flake Super Administrator Zone Control Master Slave Replicated Catalog System Controlled Complete Metadata Synch Complete User-ID Sharing Replication Zones Archival Hierarchical Zones

  10. For More Information Reagan W. Moore San Diego Supercomputer Center moore@sdsc.edu http://www.npaci.edu/DICE http://www.npaci.edu/DICE/SRB http://www.npaci.edu/dice/srb/mySRB/mySRB.html

More Related