160 likes | 183 Views
Data Management: a Grid Enablers Perspective. Mark Santcroos e-BioScience group Bioinformatics Laboratory Academic Medical Center, Amsterdam, NL EGI Technical Forum – Lyon – 19 september 2011. Academic Medical Center. e-BioScience group. Part of Bioinformatics Laboratory
E N D
Data Management:a Grid Enablers Perspective Mark Santcroos e-BioScience group Bioinformatics Laboratory Academic Medical Center, Amsterdam, NL EGI Technical Forum – Lyon – 19 september 2011
e-BioScience group • Part of Bioinformatics Laboratory • Medical researchers BiG Grid • “Grid Enablers” for domains: • Medical Imaging • (Next Genation) Sequencing • Proteomics • Virus Discovery • etc. • Life-Science Grid Community (VRC)
Access Requirements • Normal users ask for normal protocols • Non-data center environments ask for normal protocols • Inter-infrastructure ask for normal protocols • Standards (WEBDAV?) • Abstractions (SAGA?)
Access Requirements (2) • Direct I/O (or at least I) • File access privileges • Integrity (checksums) • Non-file (Databases)
Transfer vs Management • Lifetime, persistency • Provenance • Usage statistics • Coupling with catalogs from domains (syncat?) • Meta-data
Catalog vs Storage Element • One-way only • Out of sync privileges (try to explain that to users!)
Misc observations • Is “grid-storage” write-once or not? • Error reporting … • Testing • Decoupling of client and server • Software developers vs Operators vs Users
API vs Language binding • Protocol / API is just part of the story • Need proper language bindings too • (Will mostly get solved by “normal” protocols)