1 / 31

Trident Project Managing an Extensible Digital Repository

This project discusses the development of Trident, an extensible digital repository at Duke University. It covers the scaling up process, organization, Apache Solr search server, cross-searching of collections, integration with other platforms, and the future plans for mass digitization and more multimedia content. The project aims to create a robust and extensible repository with a scalable metadata tool that can be reused with any schema.

cwalter
Download Presentation

Trident Project Managing an Extensible Digital Repository

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Trident ProjectManaging an Extensible Digital Repository David Kennedy

  2. Digital Collections @ Duke • Scaling Up • Trident Project • What’s new

  3. Digital Collections @ Duke • Digital Scriptorium - 1995 • 1997 – DynaWeb • Early 2000s – mainstream digital collections • 2007 - Tripod Boy covered by dirt smoking cigarette with one hand, holding can of tobacco in other. http://library.duke.edu/digitalcollections/gedney.KY0178

  4. Scaling Up • Organization • DPC, DCAG, DCIT • Digital Collections Coordinator View showing interior of Chapel. Large window in the rear of Choir. http://library.duke.edu/digitalcollections/duc.ducpp19310417WC0421

  5. Apache Solr Search server / index engine with XML API Apache Cocoon App framework / XSLT engine XML Document base METS, TEI

  6. Tripod • Cross searching of collections • Faceted browsing, term clouds, 3D Wall • Image, Video, Audio, Text • Content – advertisements, ration coupons, photographs, maps, sports footage, sheet music, letters, diaries, etc. • Integration with other platforms – Flickr, YouTube, iTunes

  7. 2008 Tripod til now • Migrated 90,000 items • Added 13 new collections and 12,000 items • 35 collections total, 28 cross searchable • Digitization rate = 5 min per image (scan + QC) • 1.5 TB Vica contre le service secret anglais http://library.duke.edu/digitalcollections/vica.viccb01001

  8. Looking ahead • Mass digitization! • On demand • 156 TB/year • Real time publication • More multimedia Laborers Working http://library.duke.edu/digitalcollections/gamble.172-969

  9. Needs • Robust and extensible repository • Software to aid process

  10. Trident • “Metadata tool that scales…” • Hire 2 programmers Dave TJ Will

  11. Modest goals • Create a metadata tool that can be reused with any schema • Develop modular architecture with parts that could be swapped out • Open source whatever we do • Lower cost of ownership of Fedora

  12. Architecture

  13. Web Services API

  14. Repository

  15. Metadata Application Profiles • Metadata editing page built on the fly • Built using metadata application profile • Includes metadata validation • Includes instruction on how to edit metadata MetadataFormDefinition + Metadata = MetadataForm

  16. MetadataFormDefinition

  17. ValidationRules

  18. Metadata

  19. MetadataForm

  20. Edit Form

  21. Edit Form

  22. Editor

  23. Editor

  24. Editor

  25. Editor

  26. Trident Project • Prototype = Broadsides Collection • Migrate existing collections • Package software • Digitization/QC workflow tool • Discovery layer redesign…

  27. Demo http://www.youtube.com/watch?v=uI1DKgX5ZuU (demo by Sean Aery)

  28. Wrap up • Does it scale beyond Duke’s walls?

  29. Questions • David Kennedy (david.kennedy@duke.edu) • Sean Aery (sean.aery@duke.edu)

More Related