1 / 15

Ontology Framework for Knowledge Representation Diversity

Providing a comprehensive overview of ontologies, their characteristics, dimensions, and semantic and pragmatic aspects to ensure sharability and reusability in a diverse knowledge representation landscape.

wfranco
Download Presentation

Ontology Framework for Knowledge Representation Diversity

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Michael Gruninger University of Toronto, Canada Leo Obrst MITRE, McLean, VA, USA January 6, 2020 Ontology Summit: An Ontology FrameworkNIST, Gaithersburg, MDApril 22-23, 2007

  2. Agenda • Caveats: • The dimensions characterized in this briefing evolved during the course of discussions at the Ontology Summit • This briefing approximates this evolution • A Brief Inclusive Characterization of Ontologies • Towards a Common Set of Characteristics for Ontologies • Initial Ontology Dimensions that have Evolved: • Semantic • Degree of Formality and Structure • Expressiveness of the Knowledge Representation Language • Representational Granularity • Pragmatic • Intended Use • Role of Automated Reasoning • Descriptive vs. Prescriptive • Design Methodology • Governance • Conclusions

  3. A Brief Inclusive Characterization of Ontologies (in Computer Science & Engineering) • Ontologies are used to support sharable and reusable representations of knowledge • An early definition of Ontology: “a specification of a conceptualization” (Gruber, 1994) • Nevertheless, the sheer range of current work in ontologies: • Including taxonomies, thesauri, topic maps, conceptual models, and formal ontologies specified in various logical languages • Raises the possibility of ontologies being developed without a common understanding of their definition, implementation and applications • Our objective: • To provide a framework that ensures that we can support diversity without divergence • So that we can maintain sharability and reusability among the different approaches to ontologies

  4. A Common Set of Characteristics for Ontologies • An ontology includes: • A vocabulary together with a specification of the meanings of the terms in the vocabulary • This specification includes: • Identification of the fundamental categories in the domain • Identification of the ways in which members of the categories are related to each other • Constraining the ways in which the relationships can be used

  5. Ontology Dimensions: Semantic & Pragmatic • We propose a set of dimensions that can be used to distinguish among different approaches • Semantic • Pragmatic • Semantic Dimensions: These constrain how a given approach specifies the meaning of the terms • Degree of Structure and Formality • Expressiveness of the Knowledge Representation Language • Representational granularity • Pragmatic Dimensions: These cover the context in which the ontology is designed and used • Intended Use • Role of Automated Reasoning • Descriptive vs. Prescriptive • Design Methodology • Governance

  6. Ontology Semantic Dimension: (1) Degree of Formality & Structure • Related to but not the same as the expressive power required of a representation language used to specify the ontology • How elaborate (or well organized) are the semantics encoded by the ontology • Informal: An ontology can be specified in English or some other natural language in a document • This is an informal ontology, although it can be rich, unambiguous, precise • More Formal: A taxonomy can be term or concept based • Term-based: A topic hierarchy from more general terms at the top of the hierarchy to more specific terms as one descends through the hierarchy; • Concept-based: A hierarchy of classes in which the necessary and distinguishing properties of classes and their subclasses are represented • But, even if formalized, these models are very simply structured, i.e., structured with a subsumption relation (narrower_than or subclass) • Very Formal: An ontology for engineering equations • Would specify the formal semantics of its terms (such as quantity and unit of measure) in a language enabling precision & unambiguous expression

  7. Ontology Semantic Dimension: (2) Expressiveness of the Knowledge Representation Language • This is a dimension partially dependent on the first • Since an informal ontology will be expressed only as a list of terms or enumerated definitions in a natural language such as English • Although not a characteristic in itself for a given content, the language or framework that the content is modeled in: • Greatly affects, and in fact is the primary constraint on what can be expressed by the content • Example: Predicates and instances cannot be expressed in propositional logic • Example: Relationships besides subclass or narrower-than cannot be expressed in a strict (partially formalized) taxonomy, although they can be in a less strict (informal) taxonomy • In general, only if the KR language is logic-based will the ontology being machine-interpretable

  8. Ontology Semantic Dimension: (3) Representational Granularity: • An ontology may contain terms and limited inter-relationship representation • Example: a simple taxonomy • Example: a very formal ontology expressed in Common Logic but which contains only 3 classes and 2 properties • Or it may contain much more detail including many restrictions concerning how terms can relate to each other • Example: a very detailed English description of biological classes and discriminating properties • Example: a very formal ontology expressed in KIF, CL, Cyc-L, or OWL+SWRL which contains thousands of classes, thousands of properties, thousands of rules, and billions of instances/individuals • Some quantifiable metrics may give indications of the representational granularity: • Examples: average subclass/subproperty depth, average density/bushiness, average number of axioms per term, etc.

  9. Ontology Pragmatic Dimension: (1) Intended Use, Application Focus • The intended use may be (among others): • To enable communication among software agents • To help integrate disparate data sets and share knowledge bases • To represent a natural language vocabulary • To help provide knowledge-enhanced search • To provide a starting-point for building knowledge systems • To provide a conceptual framework for indexing/tagging content, etc. • Typically, there is some application that is envisioned for which the ontology is being developed • Categorization: one might want to situate documents within a framing topic taxonomy that roughly characterizes the primary content of the document and helps one semantically loosely organize document collections • Search Enhancement: one might want to use a thesaurus, its synonyms and narrower-than terms, to enhance a search engine that can employ query term expansion, expanding the user’s text search terms to include synonyms or more specific terms and thus increasing the recall (i.e., total set of relevant items) of retrieved documents • Enterprise Modeling, Question-Answering, Semantic Web Service Discovery • Semantic Search: one might to use “concept” based rather than “term” based search in specific domains or across domains • Complex Decision-Making, Intelligence Analysis: requiring machine reasoning

  10. Ontology Pragmatic Dimension: (2) Role of Automated Reasoning • Automated reasoning can range from simple to complex • Simple automated reasoning can mean: • Machine semantic interpretability of the content, which only requires that the language that the content is modeled in is a logic • This is a principled or standards-based approach • Or it can mean that a special interpreter/inference engine has been constructed that knows how to interpret the content • This is an ad hoc and often proprietary approach

  11. Ontology Pragmatic Dimension: (2) Role of Automated Reasoning (cont’d) • Simple automated reasoning: • The machine may be able to make inferences using the subclass relation by which properties defined at the parent class are inherited down to the children classes; this is the property of transitivity • More complex reasoning: • The machine may be able to take an arbitrary assertion in the KR language and classify it with respect to the taxonomic backbone of the ontology; e.g., description logics perform classificational reasoning • Very complex automated reasoning: • The use of deductive rules, i.e., inference rules or expressions that combine information from across the ontology; theorem-proving • These characterize dependencies much like if-then-else statements in programming languages • Business rules that try to characterize things that have to hold in an enterprise but which can’t typically be expressed in relational databases or object models • A logic-based KR language needed: ‘validly concludes’ or ‘X is consistent with Y’ are not expressible generally in ad hoc implementations

  12. Ontology Pragmatic Dimension: (3) Descriptive vs. Prescriptive • Descriptive: • Does the content describe, i.e., characterize the entities and their relationships as a user or an expert might characterize those objects? • Descriptive often takes a looser, less strict notion of characterization, perhaps allowing arbitrary objects into the model, which might not exist in the real world but which are significant conceptual items for the given user community • Potential partial synonym: multiplicative, i.e., concepts can include anything that reality seems to require or any distinction that is useful to make • Prescriptive: • Does the content prescribe, i.e., mandate the way that those entities and their relationships are characterized? • Prescriptive often takes a stricter notion of characterization, stating that only objects which actually exist or that represent natural kinds or types of things in the real world should be represented in the content of the engineering model • Potential partial synonym: reductionist,i.e., concepts are reduced to the fewest primitives from which it is possible to generate complex reality

  13. Ontology Pragmatic Dimension: (4) Design Methodology • Methodology employed in the construction of the ontology? • Bottom-up: (sometimes called “empirical”) methodology places strong emphasis on: • Either solely analyzing the data sources so that the resulting ontology covers their semantics • Emergent: or enabling arbitrary persons to characterize their content as they personally see fit, using terminology or metadata and whatever structuring relations (or not) that they desire to use, with perhaps an auxiliary notion or assumption that in by doing so, patterns of characterizations may emerge or be preferred by a large group or community of persons • Can be conceptually profligate, tolerable of redundancy and partial overlap • Top-down: (sometimes called “rationalist”) methodology places strong emphasis on: • Developing the ontology using known notions about the world or domain • Independent of existing data sources whose semantics will be covered by the resulting ontology • Considering a range of questions that a domain expert might want to ask about the domain • Typically preferring a rigorous methodology focused on consistency, parsimony, etc.

  14. Ontology Pragmatic Dimension: (5) Governance • How decisions concerning the structure and especially the content of an ontology are made • Ontologies with legal or regulatory implications will need to defer to existing legal, regulatory, and professional organizations concerning the natural language definitions of concepts and semantic relationships • It is necessary to record the provenance of every definition, etc. incorporated into an ontology

  15. Conclusions • We wanted to avoid positing definition, allow definition to emerge: AND IT EMERGED • The framework is intended to offer dimensions for comparison • How our “ontologies” are alike, how they are different but comparable • Goal: Converge!

More Related