430 likes | 539 Views
FEDORA at Northwestern University. Bill Parod Academic Technologies Northwestern University bill-parod@northwestern.edu. General Background. Academic Technologies Faculty projects Library partnerships Institutional partnerships Diverse clientele Diverse content “One-off” projects.
E N D
FEDORAatNorthwestern University Bill Parod Academic Technologies Northwestern University bill-parod@northwestern.edu
General Background • Academic Technologies • Faculty projects • Library partnerships • Institutional partnerships • Diverse clientele • Diverse content • “One-off” projects Northwestern University
Current FEDORA Projects • Block Museum of Art • The Last Expression Art Collection • Introduction to Asian Art History • BBC Spoken Word Archive • Paris Map Collection • Encyclopedia of Chicago • WordHoard Text Analysis Project Northwestern University
Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Diversity of Content Northwestern University
Wavelet Image Servers Vector Image Processors Streaming Media Servers RDBMS XML Databases XSLT Processors GIS Servlet Engines Diversity of Systems Northwestern University
Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Abstract Image Models Northwestern University
Core behavior getCoverpage getThumbnail Basic image (UVa) getThumbnail getMedium getHigh getVeryHigh Addressable image getRegion(rgn,size) getViewer Layered image getRegion(,,layers) getViewer(layers) Geographic image getRegion(,,, coords) getViewer(, coords) 4 Image Behavior Classes Northwestern University
Core behavior XML Metadata HTML XSLT script Thumbnail Image Basic image (UVa) Thumbnail jpeg Medium Res jpeg High Res jpeg Very High Res jpg Addressable image Image metadata Viewer XSLT script Layered image Layer metadata Geographic image World file for projection 4 Image Content Models Northwestern University
BDEF Interface Definition Northwestern University
BMECH Description • Method bindings to implementation • HTTP URL templates to image servlet • Accepts image server metadata stream • Accepts specific user parameters • Provides implementation flexibility • Currently using TrueSpectra/Scene7 image server Northwestern University
getCoverPage() for simple image – Block Museum Collection Northwestern University
getCoverPage() for zoomable image – History of Asian Art class Northwestern University
Ingesting Images • Imaging person deposits master TIFF images in WebDAV enabled file stor • Image server configured with “virtual path” to WebDAV stor for master image tiff. • TIFF master is converted to FlashPix and cached in image server • Image server handles request for FEDORA dissemination Northwestern University
Image Workflow: FEDORA – TrueSpectra – Xythos Department Academic Technologies Users Dissemination Requests Metadata in Excel METS FEDORA TrueSpectra Image Server Data flow Requests Tiffs in Xythos • Catalog in Excel converted to METS for FEDORA ingest • Tiff Masters deposited in collection’s Xythos directory • Access to Xythos directory enabled for TrueSpectra virtual paths • METS/FEDORA record includes link to TrueSpectra image • Access to image is through FEDORA image behaviors Northwestern University
Physical Collection Management Scenario: FEDORA – Content Service – Xythos Integration Faculty or Support Academic Technologies Users Dissemination Requests Files in Xythos Auto-ingester FEDORA TrueSpectra Streaming Server Search Data flow Requests Metadata update • FEDORA collection object attached to Xythos directory • Xythos notifies collection object of changes in the directory • File added – collection creates new member item • File updated – item accepts new version for file stream • File removed – item is set dormant in FEDORA • Metadata added/updated online or batch Northwestern University
Collection behavior getSearchForm performSearch() getItem() getItems() addItem() deleteItem() reindex() displayItem() Core behavior getCoverpage getThumbnail Block Museum of Art The Last Expression Vesalius Figures BBC Audio History of Asian Art Basic Collection Object Northwestern University
Collection Content Model • Search Form • XSLT for search results • Index • Header/footer XML for result stream • Member PIDs Northwestern University
Search Implementation • FEDORA METS files currently indexed offline • Plan to integrate update notification and indexing • Search Engine • Have 3 implementations: • FEDORA native search • Sgrep • OpenText • Investigating SRW/CQL • Search results passed through XSLT • Easy to provide search capability to collections Northwestern University
FEDORA – External Service External Services Cache data Dissemination Requests FEDORA Image Server Search Engine BMECH Data Request Dissemination Northwestern University
link Northwestern University
Virtual Collections • Collection maintenance • Topical galleries • Ad-hoc or dynamic collections • For classes... • personal collections… • special exhibits… Northwestern University
Database Integration • SQL/XQuery for object “data streams” • SQL/XQuery for object disseminations Northwestern University
Encyclopedia of Chicago • In active development • Metadata continually updated by research staff in Microsoft Access • New content continually added to MS Access and file stor • Varied entry types • All have dynamic “See Also”s Northwestern University
SQL Datastreams • “See Also” and “Content” datastreams • Cocoon urls that perform SQL queries on dynamic research data and convert to XML. • Dynamic updates during development • When project finished will consider moving to more robust database or “freeze” streams in the repository as “managed”. Northwestern University
FEDORA – External Service External Services Cache data Dissemination Requests Image Server FEDORA Search Engine BMECH Data Request Dissemination RDBMS Data stream Northwestern University
WordHoard Text Analysis • Large TEI XML Etext corpora • Word level grammatical and frequency data • Text requests via Xquery • Word level lexical queries via SQL Northwestern University
Basic Text Behavior BMECH Backed by eXist database Northwestern University
Viewer Object Presentation uncoupled from data object Northwestern University
Example Book Model Northwestern University
TEXT TOC Service • Request for TOC keyed by text PID • TOC XML requested from text • TOC DOM cached in service • User requests with “open nodes” parameter • Pruned DOM styled with XSLT from Viewer content model Northwestern University
Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Abstract Text Model Northwestern University
Structured text (UVa) getHeading getTOC(level) getChunk(idref) getPage(idref) Core behavior getCoverpage getThumbnail Text Methods Northwestern University
Digital video Spoken word Literary works Encyclopedias Lexical data Census data Event data Art collections Wall murals Photographs Historical maps GIS maps Newspapers Book page images Time-based Media Model Northwestern University
Core behavior getCoverpage getThumbnail Time-based media Play playSection() Time-based Media Behaviors Northwestern University
Behaviors by Type Northwestern University
Next Steps • Implement more object types • Event, video, tabular data • Transactions • Ad-hoc groupings of repository objects • Asset management, Annotation • Access control for user editing • Interoperability • Search protocols and repository interactions • Consider application models • Specialized clients Northwestern University
Specialized Clients Northwestern University
Viewer Object Northwestern University
Summary • Code reuse through object abstraction • Flexible implementation binding • Comprehensible APIs for applications • Stable APIs for Content reuse Northwestern University
Thank You Bill Parod Academic Technologies Northwestern University bill-parod@northwestern.edu