130 likes | 263 Views
Gary Berg-Cross SOCoP Executive Secretary Nov. 29-30, 2012 U. S. Geological Survey National Center 12201 Sunrise Valley Dr., Reston VA. SOCoP 2012 Workshop: Vision and Strategy. Outline for a VoCamp-Style Workshop. VoCamp Style is informal Workshop Ingredients Goals Workgroup Teams
E N D
Gary Berg-Cross SOCoP Executive Secretary Nov. 29-30, 2012 U. S. Geological Survey National Center 12201 Sunrise Valley Dr., Reston VA SOCoP 2012 Workshop: Vision and Strategy
Outline for a VoCamp-Style Workshop VoCamp Style is informal • Workshop Ingredients • Goals • Workgroup Teams • Logic of Phased Structure Sessions • Lightweight Methods • From Conceptualizations to Formalizations • Cautions and Tradeoffs • Ontologies and Ontology Patterns We are not here to teach a new discipline, but to employ multiple disciplines as a team.
Goals: “Group Work on Semantic Models of Interest” Building on past workshops and interests: • We seek clarified agreement & reduced ambiguities/conflicts on geospatial/earth science phenomena that can be formally represented in: • Constrained, engineered models to support understanding, reasoning & data interoperability and/or • Creation of general patterns that provide a common framework to generate ontologies that are consistent and can support interoperability. • We like data-grounded work since: • Much of the utility of geospatial ontologies will likely come from their ability to relate geospatial data to other kinds of information.
Workgroups Include Multiple Roles:Semantic Engineering is a Social Process Domain/Data Expert Ontologist Note taker Facilitator
Logic of Work Sessions At end of day Report back to whole and wrap up At end of day Group Reports on status After break Prepare Report After break Group Work on Draft Models After lunch Group Work on Concepts, Vocabulary & Model(s) After lunch firm up products and test against data Start Group organization & Introductions, goals and process After break Work Groups polish, formalize models Day One Intro, Topics, Methods.. 2nd day draft final model & initial formalizations
Lightweight Methods & Products • Choose lightweight approaches grounded by scenarios and application needs. • Low hanging fruit leverages initial vocabularies and existing conceptual models to ensure that a semantics-driven infrastructure is available for use in early stages of work • Reduced entry barrier for domain scientists to contribute data Triple like parts Simple parts/patterns & direct relations to data Constrained not totally Specified. Grounded Ecological..
How Ontologies Arise from “Analysis and Conceptualization” Pragmatic validation .. Stream Reaches are segments of surface water with similar hydrologic characteristics Perceived Distinctions & Situated Experience Real World Semantics Conceptualization starts to adaptively model (part of) the world 2Abstraction 1World Situations Stream Reach distinction Rationalized Intuition expressed in Some conceptual language Things D in world state W with conceptual relations R C= <SW, H, S> SW =surface water H=has S = Segments Domain folks and Modelers Adapted liberally from Guarino’s 1998 Formal Ontology in Information Systems
Intended Model Fitting C Ontological Engineering – Formalization 3. To express C we need LanguageL (Term StreamX corresponds to entity in world) We assign interpretative Functions I & Commitment K. K=<C,I) ..That is a type of Stream segment C Conceptualization model (part of) the world Say D Hydrology In some Conceptual Domain space Interaction UML OWL 1 WorldSituations Models for Domain D Expressible In L Logical Model Theories are consistent sets of sentences; closed under logical consequence Ontology Models for D Expressed In L using K Models defines relationship between L syntax and interpretations Adapted liberally from Guarino’s 1998 Formal Ontology in Information Systems
Caution(s) • No single (general or domain) taxonomy or ontology could satisfy all needs. • That’s part of the reason we have so many alternative ones. • We can make quality, useful ones for one or more purposes. • We have intended models of focus. • But if we make commitments that make strong claims that are greater our understanding and focus we may be making a mistake(s). Smith and Mark describe a good/quality ontology as one that is: • “neutral”, • And/or perhaps explicit about what it is not neutral about • computationally tractable, • acceptable to and reusable by domain people • One might add that it is rationalized.
Many Challenges – more on this later…. • These include tradeoffs between generality and precision. • Tom Bittner, for example, notes: • “The need for geometric representations, many of which rely on relatively precise boundaries, conflicts with the need for sophisticated classification systems for scientific and integration purposes. “ • Vagueness and the tradeoff between the classification and delineation of geographic regions - an ontological analysis • http://www.geoinfo.info/geoinfo2012/programa.php • But these, of course, make the work interesting…..
Ontologies and Ontology Patterns Dolce Ultra Lite? • Many levels & types of ontologies • Something like DOLCE is quite complex and so are domain ontologies like SWEET • One may pick some small repeating patterns (ODPs) out of large ontologies. • ODPs, like OWL, are tools for ontologies • They are more easily understandable with good explicit documentation for design rationales • Robust • Can be used to build on modularly for reoccurring problems needing representation • Capture best practices • Should help bridging/integrating ontologies • We focus on content ODPs using domain expertise rather than logical ODPs etc. • owl:Class:_:x rdfs:subClassOf owl:Restric(on:_:y • Inflammation>on rdfs:subClassOf (localizedIn some BodyPart) Patterns “Geo” Top “Geo”-Domain Apps ( Uses Cases & Demos) 11
Ontology-ODP Relations Small DUL Portion • <owl:Class rdf:ID="SocialObjectAttribute"> • <rdfs:label xml:lang="en">Social attribute</rdfs:label> • <rdfs:subClassOf> • <owl:Restriction> • <owl:onProperty> • <owl:ObjectProperty rdf:ID="isRegionFor"/> • </owl:onProperty> • <owl:allValuesFrom> • <owl:Class rdf:ID="SocialObject"/> • </owl:allValuesFrom> • </owl:Restriction> • </rdfs:subClassOf> • <rdfs:label xml:lang="it">Caratteristica sociale</rdfs:label> • <rdfs:subClassOf> • <owl:Class rdf:ID="Region"/> • </rdfs:subClassOf> • <rdfs:comment>Any Region in a dimensional space that is used to represent some characteristic of a SocialObject, e.g. judgment values, social scalars, statistical attributes over a collection of entities, etc.</rdfs:comment> • </owl:Class> • <owl:Class rdf:ID="WorkflowExecution">… ODP Abstract From Use for New Ontology Design “Unfriendly logical structures, some large, hardly comprehensible Ontologies” (Aldo Gangemi)
ODP Rationale –Reuse, Minimal Constraints.. • Problem • It is hard to reuse only the “useful pieces" of a comprehensive (foundational) ontology, and • the cost of reuse may be higher than developing a scoped ontology for particular purpose from scratch • “For solving semantic problems, it may be more productive to agree on minimal requirements imposed on .. Notion(s) • Werner Kuhn (Semantic Engineering, 2009) • Solution Approach • Use small, well engineered, modular starter set ontologies with • explicit documentation of design rationales, and • best reengineering practices • These serve as an initial constraining network of “concepts” with vocabularywhich people may build on/from for various purposes.