290 likes | 452 Views
Digital Preservation: The Multimedia Standards way. Mario Döller Assistant Professor University of Passau, Germany 1st International Digital Preservation Interoperability Framework (DPIF) Symposium. Agenda. Heterogeneity in Digital Preservation and Multimedia Retrieval in general
E N D
Digital Preservation: The Multimedia Standards way Mario Döller Assistant Professor University of Passau, Germany 1st International Digital Preservation Interoperability Framework (DPIF) Symposium
Agenda • Heterogeneity in Digital Preservation and Multimedia Retrieval in general • Selected solutions based on Multimedia Standards • MPEG Query Format • JPEG JPSearch • Conclusion Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Digital Preservation (DP) Efforts • National Programs (US, EU, Australia, etc.) • NDIIPP, RAMA, ORION, CASPAR, PLANETS, PADI, PANIC , … • Digital Preservation Europe (DPE) • improve collaboration and synergies between existing preservation initiatives across Europe • Developed Metadata Formats • General: MPEG-7, Dublin Core, … • DP: VRA (Visual Resources Association) Core 4.0, CIDOC Conceptual Reference Model (CRM, (ISO 21127:2006)), museumdat, … Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Multimedia RetrievalCurrent Situation Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Example for heterogeneous Image Annotation MPEG-7 <Creator> <Role href=„urn:…:CS:creator"></Role> <Agent xsi:type="PersonType"> <Name> <GivenName>Mario</GivenName> <FamilyName>Döller</FamilyName> </Name> </Agent> </Creator> Dublin Core <metadata> … <title>Alps</title> <creator>Mario Döller</creator> … </metadata> Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Current Standardization Efforts • MPEG – Query Format (ISO/IEC SC29 WG11) (ISO/IEC 15938-12) • JPEG - JPSearch(ISO/IEC SC29 WG1) (ISO/IEC 24800) • W3C – Media Annotations Working Group (http://www.w3.org/2008/WebVideo/Annotations/) Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
The MPEG Query Format (MPQF) • International Standard since end of 2008 • standardizes messages from and to multimedia repositories and provides extended functionalities for service discovery, service selection and service capability description. • General Concepts • bases on XML and is defined by an XML Schema • decoupled from any other metadata standard (also MPEG-7) • support for any XML based MM metadata description • integration of limited XQuery functionality • MPQF divided into 3 main categories • Management • Input Query Format • Output Query Format Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
MPQF Scenario Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
MPQF ConceptsQuery I • How to query MMRS satisfactorily? Query Design • MPQF supports: • Synchronous/Asynchronous mode • Timeout functionality • MPQF combines: • Exact matches • Fuzzy requests Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
MPQF ConceptsQuery - Condition • assign preferenceValue and thresholdValue to every condition • assign scoringFunction to every „Boolean Operator“ (AND, OR, XOR) (recommended to follow t-norm, t-conorm rules) • result in rank and confidence evaluation for every item Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
MPQF ExamplesManagement I • request: give me all available MMRS! • request: give me all available MMRS fitting to my desired requirements Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
MPQF ExamplesQuery I • Browsing Query • QueryByFree Text Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
MPQF Examples • Assume: DB contains images annotated with the Corine Land Cover specification combined with MPEG-7. • Example images show industrial or commercial units (121) in the area of Sines/Portugal [European ] Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
MPQF ExampleSpatial Query Give me all satellite images that show an industry unit in the south of something else! <MpegQuery mpqfID=""> … <QueryCondition> <Condition xsi:type="SpatialQuery"> <SpatialRelation relationType="urn:...:SpatialRelationCS:2008:south" sourceResource="ID1"> </SpatialRelation></Condition> </QueryCondition> … </MpegQuery> ID1 Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
JPEG - JPSearch (ISO/IEC SC29 WG1) (ISO/IEC 24800)current status
JPSearch Objectives • provide a standard for interoperability for image search and retrieval systems • by defining the interfaces and protocols for data exchange between them • provide an abstract framework and flexible search architecture that allows: • adding, updating or querying metadata of images and image collections • federated search across different systems • the integration of best-of-breed independent search components, provided by different companies Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
JPSearch Overall Structure Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Schema identification: identified by a URI and XML Namespace Schema and Transformation Rules management Central authority hosted by JPEG Create a single core schema Definition of Transformation Rules perform semantic, structural and syntactic mapping rules between XML-encoded metadata descriptions from different formats Part 2: Registration, Identification and Management of Metadata Schema (1) Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Workflow of a JPSearch request Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Query Transformation Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
AIR: ARCHITECTURE FOR INTEROPERABLE RETRIEVAL ON DISTRIBUTED ANDHETEROGENEOUS MULTIMEDIA REPOSITORIES Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Summary - Outlook • Introduced MPEG Query Format and the JPSearch approach for improving interoperability during multimedia retrieval • Future Work • Establish the standards for cultural heritage projects? Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Questions? Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Part 3: JPSearch Query Format (FCD) • Derived from MPEG Query Format • Only minor changes (namespaces, etc.) • Restrictions: • No TemporalQueryType • only image domain allowed • Modifications: • QueryByROI • QueryByMedia • MIME-Type Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Part 4: JPSearch File Format (CD) • Extension of JPEG-1/JPEG2000 file format • Fully compatible to JPEG-1/JPEG2000and provides additional functionality carrying associated metadata within a file Overall structure of JPEG-1-compliant version of JPSearch file format. Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Part 5: Data Interchange Format between Image Repositories (CD) • should be able to perform synchronization and consolidation/aggregation of repositories • Synchronization of: • Meta part: to identify the content of data part • Data part: image, collection of images, metadata, ontology or URI • Relys on image collection format of ISO/IEC 23000-3 (Photo player MAF), MP4 Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
Workflow of a JPSearch system (1) • JPQF query receives • Metadata based on Core Schema is transformed to the metadata of the N native schemas • Additional Metadata is transformed to N native Schemas (if possible else the information is discarded) • N times Query transformation (optional) • Forward N queries to N native systems where the interpreter transforms it to the native query language and executes it • Transforms N individual result sets to core schema. • Aggregate N result sets to 1 result set • Forward final result set to user Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch
AIR: Planned search concepts Universität Passau, Lehrstuhl für Verteilte InformationssystemeProf. Dr. Harald Kosch