220 likes | 330 Views
Scientific Data Discovery with XMC Cat Pushing Back on the Data Deluge: Advancements in Metadata, Archival and Workflows Scott Jensen, PhD Senior Researcher. XMC Cat. Need to capture detailed metadata for discovery and re-use Must be able to capture domain-specific metadata
E N D
Scientific Data Discovery with XMC Cat Pushing Back on the Data Deluge: Advancements in Metadata, Archival and Workflows Scott Jensen, PhD Senior Researcher
XMC Cat • Need to capture detailed metadata for discovery and re-use • Must be able to capture domain-specific metadata • Metadata standards implemented in XML • Adaptable to XML schemata from different scientific communities • Able to communicate results based on community schema • Detailed data discovery search capabilities • Describe data products in a broader experiment context • Capture metadata incrementally and early in the scientific process • Concept based partitioning of metadata schema • Incremental and asynchronous metadata capture
Logical ID Query Data Data XMC Cat: Standalone Front-end Metadata Catalog to Backend Storage Repositories XMC Cat Workspace XMC Cat Metadata Location Transparency name resolver Data Objects iRODS OPeNDAP Fedora
Scientific Metadata Captured as Concepts Concepts Enable: • Incremental capture • Detailed discovery • Fast response
Detailed Relational Search + XML Concept CLOBs Configurable to Varied Scientific Domains
Query Interface • Point & click query construction adapts to the user’s community schema • User builds query through point & click interface • =can be added • Strongly typed metadata allows for more precise search criteria • Results are returned using the community XML schema
Point & Click XMC Cat Configuration • Prompts for required schemas, determines dependencies and builds the necessary XML Bean jars. • Concepts identified through a point & click interface. • Default pushes concepts down as far as possible • Automatically adjusts other concept definitions • Human readable descriptions can be added • Wizard-based approach “remembers” your configuration through annotations. • All configurations saved for future sessions. • Configuration files are automatically generated and downloadable.
Try it Out! • XMC Cat is available through the D2I Website: http://pti.iu.edu/d2i/xmccat (Also check out our other D2I projects!) • Additional schemata being added as pre-packaged configurations. • Post-processing plug-ins available. • If your project has a metadata management need, please contact us: • Scott Jensen scjensen@cs.indiana.edu • Beth Plale plale@cs.indiana.edu