170 likes | 272 Views
Scientific Data Management Group LBNL. SRM related demos SC 2002 Demos Robust File Replication of Massive Datasets on the Grid GridFTP-HPSS access Provided through HRM HRM enables NCAR’s MSS to be accessed on the Grid Uniform Grid Access to Different Mass Storage Systems Projects
E N D
Scientific Data Management Group LBNL SRM related demos SC 2002 • Demos • Robust File Replication of Massive Datasets on the Grid • GridFTP-HPSS access Provided through HRM • HRM enables NCAR’s MSS to be accessed on the Grid • Uniform Grid Access to Different Mass Storage Systems • Projects • Earth Science Grid: Use of HRMs in ESG Demo, new HRM-NCAR included • Particle Physics Data Grid: File replication for STAR project – Demo
Demo: Robust File Replication of Massive Datasets on the Grid • This demo illustrates: • Replicating Thousands of Files Robustly between Two remote HPSS installations • Using a single command for a request to replicate all files • Using an intermediate middleware storage management (called Hierarchical Storage Manager - HRM) • Using a client HRM command-line module • Without any modifications to the MSS • Synchronizing pre-staging, file transfer, and archiving • Robustness: recovers from MSS and network transient failures Junmin Gu, Vijaya Natarayan, Alex Sim, Arie ShoshaniLBNL http://sdm.lbl.gov/srm
Disk Cache Disk Cache SRMs in Action : PPDG Anywhere HRM-Client Command-line Interface HRM-COPY (thousands of files) BNL LBNL HRM-GET (one file at a time) HRM (performs writes) HRM (performs reads) GridFTP GET (pull mode) Network transfer archive files stage files
Web-Based File Monitoring Tool • Shows: • Files already transferred- Files during transfer • Files to be transferred • Also shows for • each file: • Source URL • Target URL • Transfer rate
Recent Measurements of large multi-file replication Shows that the network is the bottleneck
Demo: GridFTP-HPSSAccess Provided through HRM • This demo illustrates the capability of: • Providing GridFTP access to Mass Storage Systems (MSSs) • Using an intermediate middleware storage management (called Hierarchical Storage Manager - HRM) • Without any modifications to the MSS • Managing queues of multiple requests to the MSS • Minimizing tape mounts • Recovers from MSS transient failures • Alex Romosan,Alex Sim, Junmin Gu,Arie Shoshani • http://sdm.lbl.gov/srm
Where does this service belongin the Grid architecture? : . G N O R 2 S O T R O I E E O T Request Workflow or C C N V I A L Application- Community Consistency Services I I I F Interpretation Request C A T V A I I Specific Data Authorization (e.g., Update Subscription, C C R U M L and Planning Management E T E E P O E Discovery Services Services Versioning, Master Copies) P L S R P V Services Services D I L S I A V T O C C E L L G : R O 1 S N O C I E E E L F T V L Data Filtering or C A Data Data General Data Storage Compute Monitoring/ A I S P R R T N I E Transformation E Transport Federation Discovery Management Scheduling Auditing U T I C C D L N O E I Services Services Services Services (Brokering) (Brokering) Services R U E V L S L M G O R E O E O R C S C E L : S G E E N Hierarchical C Resource I C File Transfer Data Filtering or Database Compute R S R Monitoring/ U Service Resource Transformation Management Resource U G O O N Auditing (GridFTP) Manager Services Services Management S I S R E E A R R H S Y T I V I T Communication Authentication and C E Protocols (e.g., Authorization N TCP/IP stack) Protocols (e.g., GSI) N O C C I Other Storage R Mass Storage System (HPSS) Compute B Networks A This figure based on the Grid Architecture paper by Globus Team Systems F systems
HRMs and GridFTP Using HRM protocol New: GridFTP-HPSS through HRM Client Client GridFTP-API SRM-API GridFTP entry HRM SRM-API HRM GridFTP-API GridFTP GridFTP move
GridFTP-HRM-Layerimplementation detail Client GridFTP-API 1a 1b GridFTP entry GridFTP move GridFTP exit FTP- HRM Layer 2a Shared memory 2b HRM Corba 3b 3a 1a: stor/retv 1b: hrm_get/hrm_put 2b: call_back 2a: unblock semaphore 3a: success_code 3b: hrm_release
Demo: HRM enables NCAR’s MSS to be accessed on the Grid • This demo illustrates: • The application of Storage Resource Management (SRM) technology to make NCAR’s MSS Grid-enabled • NCAR’s MSS is a unique system developed in house • Making HRM-HPSS grid-enabled is a big task • HRM is a specialized SRM designed for Mass Storage Systems (MSS). HRM-HPSS already exists • Adapting HRM-HPSS to work with HRM-HPSS is a small task • Demo illustrates success and ease of this approach Alex Romosan,Junmin Gu,Alex Sim, Arie Shoshani LBNL http://sdm.lbl.gov/srm
Disk Cache Disk Cache How Was It Done?only MSS-specific module modified HRM-HPSS HRM-HPSS Disk Resource Manager (DRM) Disk Resource Manager (DRM) Tape Resource Manager (TRM) Tape Resource Manager (TRM) HPSS-specific Access Module NCAR-specific Access Module Specialize for NCAR-MSS NCAR-MSS
Demo details: File Replication from NCAR MSS to NERSC HPSS controlled at NCAR
LBNL HPSS High Performance Storage System disk ANL CAS Community Authorization Services NCAR SRM Storage Resource Management gridFTP Striped server gridFTP server openDAPg server Tomcat servlet engine MyProxy server LLNL disk MCS client MyProxy client CAS client RLS client SRM Storage Resource Management gridFTP server GRAM gatekeeper ORNL gridFTP server gridFTP SRM Storage Resource Management ISI gridFTP gridFTP server SRM Storage Resource Management MCS Metadata Cataloguing Services SOAP HPSS High Performance Storage System RLS Replica Location Services RMI MSS Mass Storage System disk disk SRMs in Action : ESG
Demo: Uniform Grid Accessto Different Mass Storage Systems • This demo illustrates: Uniform Grid Access to: • three MSSs: HPSS, Enstore, JASMine • Using the Storage Resource Manager (SRM) interface • SRM modules were developed independently for the 3 MSSs • Shows the power of standard SRM functionality • Contributors: • JLAB: Bryan Hess, Andy Kowalski • FNAL: Don Petravick, Timur Perelmutov, Rich Wellner • LBNL: Junmin Gu, Vijaya Natarayan,Ekow Otoo, • Alex Romosan,Alex Sim, Arie Shoshani
High Level View of Demo Setup Client (USER/APPLICATIONS) uniform interface SRM SRM JASMine SRM Enstore
Where do SRMs belongin the Grid architecture? : . G N O R 2 S O T R O I E E O T Request Workflow or C C N V I A L Application- Community Consistency Services I I I F Interpretation Request C A T V A I I Specific Data Authorization (e.g., Update Subscription, C C R U M L and Planning Management E T E E P O E Discovery Services Services Versioning, Master Copies) P L S R P V Services Services D I L S I A V T O C C E L L G : R O 1 S N O C I E E E L F T V L Data Filtering or C A Data Data General Data Storage Compute Monitoring/ A I S P R R T N I E Transformation E Transport Federation Discovery Management Scheduling Auditing U T I C C D L N O E I Services Services Services Services (Brokering) (Brokering) Services R U E V L S L M G O R E O E O R C S C E L : S G E E N C Resource I Storage C File Transfer Data Filtering or Database Compute R S R Monitoring/ U Service Resource Transformation Management Resource U G O O N Auditing (GridFTP) Manager Services Services Management S I S R E E A R R H S Y T I V I T Communication Authentication and C E Protocols (e.g., Authorization N TCP/IP stack) Protocols (e.g., GSI) N O C C I Other Storage R Mass Storage System (HPSS) Compute B Networks A Systems This figure based on the Grid Architecture paper by Globus Team F systems
Main advantages of using SRMs • Same uniform interface to all types of storage systems • Disk Storage System • Mass Storage System • For Mass Storage Systems it provides: • Queuing and pre-staging • Queued multi-file “get” requests (avoid flooding MSS) • Pre-staging of files (concurrent with transfer) • Queued archiving of “put” requests (avoid flooding MSS) • Robustness and efficiency • Recover in case of transient failures from MSS • Reorders pre-staging requests to minimize tape mounts • Recovery from failed GridFTP transfers • Re-issues requests in case of failure