210 likes | 331 Views
Granularity: Subdocuments in Ensemble. Lois Delcambre, Dave Maier, Dave Archer, Jeremy Steinhauer Kelson Luc, Vikram Ramesh with assistance from Va. Tech. Outline. A vision What we have implemented A demo Questions/issues. The Authoring Experience. Ensemble. Find. New work (mashup).
E N D
Granularity:Subdocuments in Ensemble Lois Delcambre, Dave Maier, Dave Archer, Jeremy Steinhauer Kelson Luc, Vikram Ramesh with assistance from Va. Tech
Outline • A vision • What we have implemented • A demo • Questions/issues
The Authoring Experience Ensemble Find New work (mashup) 1 3 Source 2 Save & ingest 4 Ensemble Q: How many source types do we want to support? Q: How many different “mashup” tools do we want?
Basic Authoring Needs • Search/browse repository for documents • Identify resources (docs/subdocs/mashups) of interest; place them in a “workspace” • Create new works (mashups) from pre-existing documents or subdocuments and original material • Store new works (mashups) in repository in appropriate format: .PPT, .DOC, .HTM, ...(Keep track of the subdocuments & how/where they were used.)
Features • With subdocuments in the repository, we can: • Automatically generate citation lists • Explain where information came from: • Show bibliographic details of sources • Show subdocuments in original context • Explain how information is used by others: • “Have I (or anyone else) used this question in an exam before?” • Note: subdocuments can overlap; we need recursive processing. original material original material
Make this easy to use • Use familiar tools to create mashups • Use copy-and-paste or other mashup-creation mechanisms • Don’t introduce unnecessary additional mouse clicks
For one Subdocument“Where did this come fromand where else did it go?” Review details of the subdoc enclosingthis selection Ensemble Review details of parent/ancestordocuments Show this subdoc's: Metadata Provenance Parent context Other known uses Overlap with others See enclosingsubdoc in context of original/parent document Show other documentsthat incorporate enclosingsubdoc Show overlappingselections in repository
For Resources (including mashups) Review details ofparent document (if this doc is a subdocument) Ensemble Show this document's: Parent doc Derived subdocs Referenced subdocs Mashups that this doc contributed to Save & ingest into Ensemble: (save this document & referenced subdocs ) Show the pieces of thisdoc that were used elsewhere Show pieces ofothers docs included here Show documents that usecontent from this one
Added Value While Searching / Browsing Repository Show doc/subdoc hierarchy in result Subdocs Parents Instances of use See document relationshipsin repository search results
Relationships Types doc/subdoc mashup/subdoc HasSubDoc HasParent References UsedIn ... HasSubDoc HasSubDoc Recursive doc/subdoc mashup/subdoc ... HasParent HasParent ... References References ... UsedIn UsedIn HasSubDoc Using bothrelationships HasParent References UsedIn
Tool Architecture TargetDocument SourceDocument A Fetch relationship dataon pre-existing subdocs GetInfo AuthoringApplication BrowsingApplication Paste Ingest Copy subdocument subdocument & metadata Document EnsembleRepository subdocworkspace subdocuments Fetch relationship dataon subdocument Copy
Current Implementation • We have “assembly language” for our vision: • subdocument selection using Copy • In MSWord, OpenOffice Writer; creates 3 streams for Fedora (FOXML, text, subdoc) • subdocument ingestion to Fedora including relationship creation • Fedora search showing parent/child • Note: we are using existing Fedora relationships IsPartOf, HasPart (we need subdocument-specific relationships for final implementation) • No mashup creation (yet)
Selection using digital pens • Summer project for a high school intern (Kelson Luc) • Worked with the Anoto digital pens – with a digital camera near the tip of the pen and special paper with a dot pattern. • Print a pdf file on the dotted paper. • Circle the selected text with the pen. • Write a comment – in the margin – with the pen. • We are lacking software from the pen manufacturer to put the whole system together.
subDocument Selection Insert movie here
subDocument Ingestion Insert movie here
Subdocument: text file I wish I had never concealed it. For I, and I only, know what manner of fear lurked on that spectral and desolate mountain. In a small motor-car we covered the miles of primeval forest and hill until the wooded ascent checked it. The country bore an aspect more than usually sinister as we viewed it by night and without the accustomed crowds of investigators, so that we were often tempted to use the acetylene headlight despite the attention it might attract. It was not a wholesome landscape after dark, and I believe I would have noticed its morbidity even had I been ignorant of the terror that stalked there.
Searches With Relationships Insert movie here
Discussion • Will Ensemble users want to download/use our plug-ins, e.g., for MS Word? • Which “mashup” creation tool should we provide? • Where should subdocuments be stored? • (Metadata records for) subdocuments and their parent documents and their mashups needs to be in the same repository. • We could extend the browse/search interface for Ensemble to be subdocument-aware.
An Idea for the November launch • Develop an example – with documents from Ensemble that are used to create one or more mashup documents. (This will induce some number of subdocuments … with their relationships.) • Use it to demonstrate how the rewards could track use of materials across mashups. • Let people browse and search the example.