190 likes | 381 Views
The Fedora Project Where we’ve been and where we’re going. Mellon OS Retreat March 2005. Sandy Payette Cornell University. First… a quick review of the architecture. Fedora Repository Service. rdbms. files. RDF. Fedora Digital Object Model. Component View. Digital object identifier.
E N D
The Fedora ProjectWhere we’ve been and where we’re going Mellon OS Retreat March 2005 Sandy Payette Cornell University
Fedora Repository Service rdbms files RDF
Fedora Digital Object Model Component View Digital object identifier Persistent ID ( PID ) Relations (RELS-EXT) Reserved Datastreams Key object metadata Dublin Core (DC) Audit Trail (AUDIT) Datastream Datastreams Set of content or metadata items Datastream Disseminators Pointers to service definitions to provide service-mediated views Default Disseminator Disseminator
“Fedora Inside” Known Use Cases • Digital Library Collections • Institutional Repository • Educational Software • Information Network Overlay • Digital Archives and Records Management • Digital Asset Management • File Cabinet / Document Management • Scholarly publishing
Fedora – where we’ve been(versions 1.2 – 2.0) • FOXML (Fedora Object XML) • Simple XML format directly expresses Fedora object model • Easily adapts to Fedora new and planned features • Easily translated to other well-known formats • Enhanced Ingest/Export of objects • FOXML, METS (Fedora extension) • Extensible to accommodate new XML formats • Planned: METS 1.4, MPEG21 DIDL
Fedora 1.2 - 2.0 (continued) • Object-to-object Relationships • Ontology of common relationships (RDF schema) • Relationships stored in special datastream (RELS-EXT) • Resource Index (RI) • RDF-based index of repository (Kowari triple-store) • Graph-based index includes: • Object properties and Dublin Core • Object Relationships • Object Disseminations • RI Search • Powerful querying of graph of inter-related objects • REST-based query interface (using RDQL or ITQL) • Results in different formats (triples, tuples, sparql)
Fedora Digital Objects Resource Index View
Fedora 1.2 - 2.0 (continued) • New Utilities • Batch Modify Utility • Repository Administrator Reporting • Performance Tuning (~1 million objects) • Ingest testing (800K objects; 40 millisec/object) • Concurrency testing (access requests) • Communications and Outreach • New Fedora Web Site • Improved Documentation • Tutorials
Fedora 2.1 (May 2005) • Authentication plug-ins • HTTP basic authentication and SSL • Plug-in #1 : Tomcat user/password file/db • Plug-in #2 : LDAP tie-in • Plug-in #3 : Radius Authentication • Authorization module • XML-based policies using XACML • Fine-grained policy enforcement (API actions X subject attrs X object attrs) • Repository-wide policies • Object-specific policies • Policy Builder Client • New Services …
Other Planned Work • Fedora Showcase and Contributed Tools (website) • Object Creation • “Content Model” Specification Language • Advanced Workbenches • Workflow engine • Tools for RDF browse and graph traversal • Performance Tune – millions of objects • Web services security and Shibboleth • Code Refactoring • Fedora web app (.war) • MVC2 pattern for REST-based web exposures • Other misc.
Fedora Development Process • Core Development Team (10 dev+lead) • Cornell University • University of Virginia • Eclipse IDE • GForge and CVS (at Cornell) • Communications • fedora-dev, codewatch, users email lists • Bugzilla • Full Team call (1X/week) • Technical design/dev call (1X/week) • Full Team meeting (2X/year)
Fedora Development Consortium • Advisory Board • University of Virginia • Tufts • VTLS • ARROW (Monash University and Nat’l Lib Australia) • Harris Corp. • Danish Royal Library and DTU • Northwestern University • NSDL – Core Integration • Mission • Requirements Definition, Specifications. Joint Development • Commission of Working Groups • Content Modeling • Outreach and Education • Workflow and Service-Oriented Processes • Recommendation for Long-Term sustainability model • Governance and Funding • Set Fedora Free – full open source model (e.g., public SourceForge) • Code Maintenance (UVA until 2012; plan for beyond)
Recent News • Downloads ~20K; 52 countries • Growth – lots of new interest • Fedora Users Conference (May 13-14) • Interesting new adopters • OhioLink • DISA (South Africa history) • Interesting new proposals • Company X finalist for large government contract • Cornell Lab of Ornithology (data + tools + documents) • Recent Article • XML CoverPages http://xml.coverpages.org/ni2005-03-18-a.html
Selected Fedora Adopters • University of Virginia • VTLS • Tufts University • OhioLink • Northwestern: Library and Academic Technologies • National Science Digital Library (NSDL): Core Integration • ARROW: National Library of Australia and Monash University • Royal Library Denmark, National Library, and DTU • Rutgers University • Indiana University • American Geophysical Union • Library of Congress: I Hear America Singing • University of Delaware • Hamilton College • Cornell CIT • Tibetan Buddhist Resource Center • Yale University • DISA – South Africa, History of Apartheid resistance