170 likes | 322 Views
The Odum Institute. Oldest Institute or Center at UNC-CH Founded 1924 Mission: Teaching, research, & service for social sciences Cross-disciplinary focus. Rules-Based Distributed Storage. Rules-Based policy enforcement iRODS grid based technology
E N D
The Odum Institute • Oldest Institute or Center at UNC-CH Founded 1924 • Mission: Teaching, research, & service for social sciences • Cross-disciplinary focus
Rules-Based Distributed Storage • Rules-Based policy enforcement • iRODS grid based technology • OAI-PMH harvesting from Odum Dataverse Network
Project Phases • Ingest Odum collections into iRODS • Break apart Odum preservation policies • Code these policies into series of iRODS rules
*From http://unc.renci.org/wordpress/wp-content/uploads/2009/05/irods-slide41.jpg
*From http://unc.renci.org/wordpress/wp-content/uploads/2009/05/nara-grid22.jpg
*From http://unc.renci.org/wordpress/wp-content/uploads/2009/05/irods-slide5.jpg
*From http://wiki.cetis.ac.uk/images/9/9d/OAI-PMH_overview.png
*From http://developers.sun.com/learning/javaoneonline/2007/pdf/TS-4656.pdf
<codeBookxmlns="http://www.icpsr.umich.edu/DDI"xsi:schemaLocation="http://www.icpsr.umich.edu/DDI http://www.icpsr.umich.edu/DDI/Version1-3.xsd"><docDscr><citation><titlStmt><titl>Harris 1986 Disabled Americans - Employment Survey, Study no. 864009</titl><IDNo agency="handle">hdl:1902.29/H-864009</IDNo></titlStmt> … <?xml version="1.0" encoding="utf-8"?><metadata><AVU><Attribute>Global Identifier</Attribute><Value>hdl:1902.29/H-864009</Value><Unit>handle</Unit></AVU><AVU><Attribute>Study Title</Attribute><Value>Harris 1986 Disabled Americans - Employment Survey, Study no. 864009</Value><Unit/></AVU> …
iRODS Rule ==== parseDDI.ir ==== Format DDI and extract metadata|| msiXsltApply(*xsltObjPath, *ddiObjPath, *BUF)## (XSLT transformation) msiDataObjCreate(*xmlObjPath,null,*DEST_FD)## (Create XML file) msiDataObjWrite(*DEST_FD,*BUF,*Written)## (Write XML file) msiDataObjClose(*DEST_FD,*junk)## (Close XML file) msiLoadMetadataFromXml(*ddiObjPath, *xmlObjPath)|nop (Load into iCAT) Input parameters *ddiObjPath=$1% Example: /odum/home/rods/1902.29/H-339/ddi.xml *xmlObjPath=$2% Example: /odum/home/rods/1902.29/H-339/AVUs.xml *xsltObjPath=/odum/home/rods/prototype/formatDDI.xsl Output parameters ruleExecOut *From https://www.irods.org/index.php/iRODS_Videos
Breaking apart preservation policies using approach modeled on MIT Pledge project • Step 1 = define policy areas • Step 2 = create policy declaration statements for each policy area; state the requirements for operation, not technical specifics • Step 3 = each entity in a policy statement is defined in language descriptions: humans and machine-readable references • Step 4 = deontic statements: logical statements define actors, actions, and constraints that enforce a policy statement. • Step 5 = Write iRODS rules for each statement Wolfe, Robert. 2007. PLEDGE policy list. MIT Libraries. <http://pledge.mit.edu/images/1/13/PLEDGEPolicies20070927.pdf>
Initial Rules Developed • Organization, Environment, and Legal Policies • Defined dataset succession plan • Defined access policies • Log access for accountability • Reference TRAC criteria • Community and Usability Policies • Require a deposit agreement • Process and Procedure Policies • Defined iCAT to DDI discovery crosswalk • Store dataset’s DDI metadata as object • Defined persistent identifiers • Defined UNF’s and Checksums • Provide reporting of preservation network • Technology and Infrastructure Policies • Defined number of replication copies • Defined geographic location for the copies • Provide authentication policy • Provide versioning • Provide control for deletion/replacement • Defined replica validation frequency via UNF’s and Checksums
Video Demo http://ils.unc.edu/~wardj/share/eScience2009/2009-11-16-IEEE_demo.swf
Acknowledgements This work is funded by the NSF grant OCI-0848296 and is a collaboration with NARA on the development of the "NARA Transcontinental Persistent Archive Prototype". The initial work on this project was funded by the NARA supplement to NSF SCI 0438741, “Cyberinfrastructure; from Vision to Reality” – Transcontinental Persistent Archive Prototype (TPAP) (2005-2008).