1 / 9

GeoSciGraph: Interoperable Inventory of EarthCube Resources for Geoscience

GeoSciGraph is an ontology management system that integrates and searches multiple data resources in Earth Science. It offers semantic processing, validation, provenance recording, and content enhancement components for metadata aggregation in CINERGI domain inventories.

susanwilson
Download Presentation

GeoSciGraph: Interoperable Inventory of EarthCube Resources for Geoscience

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Community Inventory of EarthCube Resources for Geoscience Interoperability Ilya Zaslavsky Jeffrey Grethe amarnath Gupta burak Ozyurt Thomas Whitenack David Valentine Adam Schahne University of California San Diego stephen Richard Arizona Geological Survey Kerstin lehnert, Leslie hsu LDEO, Columbia University Tanu Malik University of Chicago Luis bermudez Open Geospatial Consortium RDA 9/2016 - Denver

  2. Metadata aggregation in CINERGI Domain Inventories RCN (Research Coordination Networks) CINERGI Metadata Pipeline Domain workshops High-level assets Catalogs

  3. Content enhancement components • Common enhancer API • Provenance recording: W3C PROV and Neo4J • Spatial enhancer (bounding boxes) • Keyword enhancer • Materials; Processes; Equipment; Methods; Features; Activities; Science Domains; Geologic age;Organizations; Resource types • GeoSciGraph API for semantic processing • Validation and provenance components

  4. GeoSciGraph and Ontologies GeoSciGraph: an ontology management system that provides the semantic infrastructure to integrate and search multiple data resources across sub-disciplines of Earth Science Some included ontologies: • SWEET • ENVO • CHEBI • YAGO (geo features) • NASA GCMD (equipment, providers) • GeoSciML • Geochronology • EDAM Bioinformatics (software terms and operations) • Also: VIAF

  5. GeoSciGraph Services API • GeoSciGraph Services: The GeoSciGraph API exposes a set of web services for querying and exploring the CINERGI ontology. • Lexical Services are used to break text into sentences and perform sentence parsing using lightweight NLP techniques. • Vocabulary Services are used to find concepts, synonyms, term categories, autocomplete search, and term suggestions based on similarity.

  6. GeoSciGraph Services API • Graph Services are used to navigate the graph by following user-specified relationships and finding neighborhoods. Another service locates the head of a clique (all pair connected subgraph) in an ontology graph. • Refine Services provides a gateway to OpenRefine, Google service to match entries in a data table to an ontology. • Cypher Utility Service is a pass-through service that directs a user-specified Cypher query directly to the underlying Neo4J system. • Analyze Services provides a way to add custom-defined analyses into the GeoSciGraph system

  7. ManualReview ofKeyword and LocationAssignments(CINERGI MetadataAnnotator)

  8. Interesting issues… • Re-publishing linked data • ISO 19115? RDF? JSON-LD? • Semantic conflicts • Selecting which ontology IDs to use whenconflicts • Our ability to detect concepts and assign keywords may not match ontology’s level of detail • Lots of tricks in the bridge ontology • Enabling faceting and search • Pre-defining upper facets; adjusting underlying ontology fragments for consistency (cinergiParent, cinergiFacet annotations) • Generating corpus of text to analyze (crawling, introspection) • Curating keyword assignments • Manual; Tool-Assisted; Community curation, Automated (Machine learning; Rules) • Adding usage metadata (eventually a facet?) • Communities may promote their own facets

More Related