360 likes | 377 Views
Defining Vocabularies, Ontological and Linguistic: A Tool for Ontologizing the Ontolog. Patrick Cassidy MITRE Corporation* Presented to the Ontolog Forum July 13, 2006
E N D
Defining Vocabularies,Ontological and Linguistic:A Tool for Ontologizing the Ontolog Patrick Cassidy MITRE Corporation* Presented to the Ontolog Forum July 13, 2006 * NOTE: The author’s affiliation with The MITRE Corporation is provided for identification purposes only, and is not intended to convey or imply MITRE’s concurrence with, or support for, the positions, opinions or viewpoints expressed by the author.
Outline • A Common Upper Ontology is a Good Thing to have. The minimum upper ontology will represent the set of basic concepts sufficient to specify the meanings of all other more specialized concepts. • A Linguistic Defining Vocabulary that parallels the Ontological Defining Vocabulary will make the Upper Ontology a lot easier to build, understand, and exploit. • To help in ontologizing the Ontolog, we can use the Linguistic Defining Vocabulary right now.
Problem Solving with Computers:Single Applications Persistent Stored Data Short-Term Task-specific Memory Procedural Program Specification: Solve Current Problem. Report results Interface
Performance = k / Integration_Flexibility Modal Policies Internet Semantic Mappings Semantic Brokers OWL-S Agent Programming RDF/S, OWL Peer-to-peer Web Services: UDDI, WSDL Web Services: SOAP Community Applets XML, XML Schema Data Application N-Tier Architecture EAI Workflow Ontologies Same Intranet Conceptual Models Enterprise Middleware Web Data Marts Same Wide Area Network Client-Server Data Warehouses Same Local Area Network Federated DBs Distributed Systems OOP Systems of Systems Same DBMS Same OS Same Address Space Same CPU Linking From Synchronous Interaction to Asynchronous Communication Same Programming Language Compiling Same Process Space 1 System: Small Set of Developers Tightness of Coupling & Semantic Explicitness Explicit, Loose Far Semantics Explicitness Local Looseness of Coupling Implicit, TIGHT Source: Leo Obrst
Problem Solving with Computers:Application Suite Stored Data 1 Short-Term Task-specific Memory Common Operating System for Stored data Access Stored Data 2 Message Format and Protocol App 1 App 2 App 3 Interface
Where Does The Upper Ontology Fit? Long-Term Knowledge Base: Ontology uses Upper Ontology for Concept Specifications Short-Term Memory: Ontology uses Upper Ontology Task Control: Select Processes To Solve Current Problem. Report results Upper Ontology: Provides defining concepts to specify conceptual message Content NLP Understanding and generation Case-Based Reasoning Robotics Expert Systems Informtn. Retrieval Probabilistic Reasoning Spatial Reasoning Interfaces
Where Does The Upper Ontology Fit? Overlord: Situation Awareness Goal-Directed Action Selection EGO: Self Awareness Knowledge Base: Ontology uses Upper Ontology for Concept Specifications Task Control: Select Processes To Solve Current Problem. Report results Episodic Memory Upper Ontology: Provides defining concepts to specify conceptual message Content NLP Understanding and generation Case-Based Reasoning Robotics Expert Systems Informtn. Retrieval Probabilistic Reasoning Spatial Reasoning Interfaces
Agents: Expectations & Norms OL Is Goal Possible Within Time Constraints? Yes. Request OL Overlord action No. Report Impediments Current Goal • Rules: • Laws • Community • (3) User • (4) Supervisor/Owner • (5) Self-generated • (6) Cultural Expectations Goal Stack OL Add/Modify/Delete Goals • Commitment: • Generate or Modify • Reference • Explain when asked OL
Where Does The Upper Ontology Fit? Long-Term Knowledge Base: Ontology uses Upper Ontology for Concept Specifications Short-Term Memory: Ontology uses Upper Ontology Task Control: Select Processes To Solve Current Problem. Report results Upper Ontology: Provides defining concepts to specify conceptual message Content NLP Understanding and generation Case-Based Reasoning Robotics Expert Systems Learning Probabilistic Reasoning Spatial Reasoning Interfaces
Is a Common Language EnoughFor Module Integration? Some doubt has been expressed: “Merely encapsulating this machinery into modules that pass messages among each other does not allow for the internal operation of one algorithm to be influenced by another” • Nicholas Cassamatis, A Cognitive Substrate for Achieving Human-Level Intelligence, AI Magazine, Summer 2006, pp. 45-56. However, the use of the Upper Ontology not only permits effective message passing, but fuses all components into a unified knowledge base. The ontology has not only a message format, but logical inference which may be used by multiple modules.
Modules May Have Tight Coupling Long-Term Knowledge Base: Ontology uses Upper Ontology for Concept Specifications Short-Term Memory: Ontology uses Upper Ontology Task Control: Select Processes To Solve Current Problem. Report results Upper Ontology: Provides defining concepts to specify conceptual message Content NLP Understanding and generation Case-Based Reasoning Robotics Expert Systems Learning Probabilistic Reasoning Spatial Reasoning Interfaces
Where Does Upper Ontology Fitin Natural Language Understanding? Knowledge Base: Ontology uses Upper Ontology for Concept Specifications NLP Understanding and generation TC Upper Ontology: Provides defining concepts to specify conceptual message Content Parsing Disambiguation Word Experts Learning Entity Extraction Metaphoric Reasoning Interfaces
What About Ontology Mapping Rather Than a Common UO? • OK if: • Modest accuracy (10-80%) is acceptable • Very shallow reasoning (e.g. taxonomy only) is to be used
First Order Logic? • In a multimodule message/blackboard system, FOL is only one of potentially many reasoning mechanisms. • Decidability and inference efficiency are not rate-limiting unless FOL alone is to be used on the full knowledge base.
So, What’s the Problemwith existing Upper Ontologies? • It is time-consuming to learn how to use them effectively. • If an interface that uses language people already know can be developed, it will make upper ontologies easier to exploit. • Human language is easy for people to use. • A defining vocabulary can serve as the intermediate phase to improve usability.
Outline • A Common Upper Ontology is a Good Thing to have • A Linguistic Defining Vocabulary that parallels the Ontological Defining Vocabulary will make the Upper Ontology a lot easier to build, understand, and exploit. • To help in ontologizing the Ontolog, we can use the Linguistic Defining Vocabulary right now.
What is a “Defining Vocabulary”? • For lexicographers, a controlled list of words which are the only words allowed to be used in creating definitions (e.g. in LDOCE). • makes definitions easier to understand, especially for learners of a language • For Ontologists, the set of basic concepts (and their ontological representations) which are sufficient to specify the meanings of any other concepts (or terms) by combinations of the basic concepts – a special type of “Upper Ontology”.
How Big is the Defining Vocabulary? • Longman’s Dictionary of Contemporary English (LDOCE) uses about 2000 root words, some of which are used in more than one sense. With morphological variants, there are over 9000 words. • For the conceptual defining vocabulary, probably at least 4000 senses will be needed. • The vocabulary will probably grow over time.
Is There a Relation Between theLinguistic Defining Vocabulary and The Conceptual Defining Vocabulary (Upper Ontology)? • Hypothesis: yes, we should be able to use a linguistic controlled vocabulary like that of LDOCE and have definitions in that vocabulary translate directly and automatically into logical specifications using the conceptual inventory of the Upper Ontology.
How Does this Differ from other Ontology Projects? • By emphasizing the primary importance of developing the defining vocabulary – ontological and linguistic – and creating relations between them, before attempting representation of complex domain-specific concepts. • The automatic conversion of linguistic to logical specifications is an essential element.
So, This has no Immediate Real-World Application? • Right! We do not already know the minimum set of conceptual components for representing everything; which basic concepts are required needs to be discovered by using a common upper ontology in generating other concepts.
Is This Just Basic Research? • NO! The need for information communication is immediate, and clear descriptions of information content can have immediate benefits. • There are no negative side-effects to creating clear and comprehensible descriptions of information content.
Specialists Will Want to Use Specialized Terms in Definitions • The “Controlled Defining Vocabulary” is infinitely expandable. • Probably, at least three levels will emerge: • the basic irreducible defining vocabulary • the general defining vocabulary, having terms which are defined by use of the basic vocabulary • specialized defining vocabularies, containing terms of interest to specific domains
How Will We Know When We Have Succeeded in Building The Linguistic Conceptual Defining Vocabulary? When almost all new terms can be defined using intuitive linguistic phrases, and the words are already in the defining vocabulary.
How Will We Know When We Have Succeeded in Building A Correct Mapping of Linguistic and Conceptual Defining Vocabularies? • When people can enter and retrieve information of a basic nature using intuitive linguistic phrases. • The most honest measure of correct representation is a correct answer to a straightforward question.
Example Definitions from Longman’s • Raspberry • a soft sweet red berry, or the bush that this berry grows on • Obligation • a moral or legal duty to do something • Automobile -- a car • Car • a vehicle with four wheels and an engine, that can carry a small number of passengers
Language Logic • Duty -- Longman: something that you have to do because it is morally or legally right • More specific: • An action that an intelligent agent must perform or refrain from; the failure to perform or refrain from that action carries some undesirable consequence for that agent; the undesirable consequence may be enforced by the authority that assigned the duty.
KIF “Duty” (=> (hasDuty ?AGENT ?DUTY)) (and (instance ?DUTY ActionOrInaction) (exists (?AUTHORITY ?CONSEQUENCE) (and (hasAttribute ?CONSEQUENCE (UndesirableFor ?AGENT)) (imposed ?AUTHORITY ?DUTY) (=> (not (performed ?AGENT ?DUTY)) (hasLiability ?AGENT (enforces ?AUTHORITY ?CONSEQUENCE)))))))
Spatial Metaphors • Orientation: Up, Down, Front, Back, Side • Proximity: in Space, In time • Basic shapes: compact, filament, sheet, multi-armed, network, hollow (container) • Motion: origin, goal, path • Edge: limit, obstacle, support • Contact: Force, gravity, causing motion • Body shape: Head, Foot, Arm • Parts and structural relations
What About Specialized Applications That Don’t Need a High-Level Ontology? • They can interoperate with other applications if they map the concepts they do use to the precisely defined concepts in the upper ontology. • This principle can be applied transitively, through multiple levels of expressiveness.
Definition Acceptance Hierarchy Executable Specification: Methods, Sequence, States is used in OpenCycSUMO DOLCE Axiomatic Ontology: Quasi-2nd Order, Function Terms accepts Restricted FOL: OWL accepts Taxonomy/Thesaurus/Terminology
What Can We Do Now? • Begin immediately to define terms in Knowledge Organization Systems (ontologies, taxonomies, glossaries, etc.) using the basic English defining vocabulary. • Add terms to the supplemental defining vocabulary as needed, with their definitions created from the basic terms • Add terms to the community domain vocabularies, with their definitions created from the basic or supplemental defining vocabulary.
Where is the Defining Vocabulary? • A version that runs in Windows XP is on the ONTACWG web site, along with a Java utility to check definitions against the controlled vocabulary. http://colab.cim3.net/file/work/SICoP/ontac/reference/ControlledVocabulary/CheckCV.ZIP Unzip in a separate directory and run the .bat file to use the utility (Opening screen shot, next slide).
And if The English Logic Translation Project Lags? • We will have terminologies and knowledge classifications with well-defined terms, understandable to anyone with a basic knowledge of English. • In itself, this result will be worth the effort.