Inference Tasks and Computational Semantics

Inference Tasks and Computational Semantics

Key Concepts • Inference tasks • Syntactic versus semantic approach to logic • Soundness & completeness • Decidability and undecidability • Technologies: • Theorem proving versus model building

QUERYING • Definition: • Given: Model M and formula P • Does M satisfy P? • P is not necessarily a sentence, so have to handle assignments to free variables. • Computability: yes if models are finite

Consistency Checking • Definition: Given a formula P, is P consistent? • Idea: consistent iff satisfiable in a model M, so task becomes discovering whether a model exists. • This is a search problem. • Computationally undecidable for arbitrary P.

Informativity Checking • Definition: given P, is P informative or uninformative? • Idea (which runs counter to logician's view) • informative = invalid • uninformative = valid (true in all possible models) • Informativity: is genuinely new information being conveyed? Useful concept from PoV of communication • Computability: validity worse than consistency checking since all models need to be checked for satisfiability.

Relations between Concepts • P is consistent (satisfiable) iff –P is informative (invalid) • P is inconsistent (unsatisfiable) iff –P uninformative (valid). • P is informative (invalid) if –P is consistent (satisfiable). • P is uninformative (valid) if –P is inconsistent (unsatisfiable).

Consistency within Discourse Mia smokes. Mia does not smoke. • Should be possible to detect the inconsistency in such discourses • To avoid detecting inconsistency in superficially similar discourses such as Mia smokes. Mia did not smoke

Consistency of Discourse w.r.t. Background Knowledge Discourse: Mia is a beautiful woman. Mia is a tree Background Knowledge: All women are human All trees are plants -Ex: human(x) and plant(x)

Consistency Checking for Resolving Scope Ambiguity Every boxer has a broken nose • Ax(boxer(x) -Ey(broken-nose(y) & has(x,y))) • Ey(broken-nose(y) & Ax(boxer(x) → has(x,y))) Second reading is inconsistent with world knowledge • What world knowledge? • How represented and used?

Informativity Checking • Make your contribution as informative as is required (for the current purposes of the exchange). H. P. Grice. Mia smokes. Mia smokes. Mia smokes • Is not informative • Informativity checking also wrt background knowledge

Informativity a `soft' signal Mia is married She has a husband • Superficially uninformative wrt background knowledge. • But nevertheless we can imagine contexts when such a discourse makes sense. • Technically uninformative utterances can be used to “make a point”

Consistency Checking Task(CCT) in FOL • Let Φbe the FOL semantic representation of the latest sentence in some ongoing discourse • Suppose that the relevant lexical knowledge L, world knowledge W, natural language metaphysical assumption M, and the information from the previous discourse D has been represented in FOL • CCT can be expressed: L U W U M U D |= ¬Φ

To put it another way… All-Our-Background-Stuff |= ¬Φ hence |= All-Our-Background-Stuff → ¬Φ (Deduction Theorem) Consequence: we can reduce CCT todeciding the validity of a single formula.

Informativity Checking Task(ICT) in FOL • Let Φbe the FOL semantic representation of the latest sentence in some ongoing discourse • Suppose that the relevant lexical knowledge L, world knowledge W, natural language metaphysical assumption M, and the information from the previous discourse D has been represented in FOL • ICT can be expressed: L U W U M U D |= Φ

To put it another way… All-Our-Background-Stuff |= Φ hence |= All-Our-Background-Stu → Φ (Deduction Theorem) Consequence: we can also reduce ICT todeciding the validity of a single formula.

Yes but … • This definition is semantic, i.e. given in terms of models. • This is very abstract, and • defined in terms of all models. • There are a lot of models, and most of them are very large. • So is it of any computational interest whosoever?

Proof Theory • Proof theory is the syntactic approach to logic. • It attempts to define collections of rules and/or axioms that enable us to generate new formulas from old • That is, it attempts to pin down the notion of inference syntactically. • P |- Q versus P |= Q

Examples of Proof Systems • Natural deduction • Hilbert-style system (often called axiomatic systems) • Sequent calculus • Tableaux systems • Resolution • Some systems (notably tableau and resolution) are particularly suitable for computational purposes.

Connecting Proof Theory toModel Theory • Nothing we have said so far makes any connection between the proof theoretic and the model theoretic ideas previously introduced. • We must insist on working with proof systems with two special properties • Soundness • Completeness.

Soundness • Proof Theoretic Q is provable in proof theoretic system|- Q. • Model Theoretic Q is valid in model theoretic system|= Q • A PT system is sound iff |- Q implies |= Q • Every theorem is valid

Remark on Soundness • Soundness is typically an easy property to prove. • Proofs typically have some kind of inductive structure. • One shows that if the first part of proof is true in a model then the rules only let us generate formulas that are also true in a model. • Proof follows by induction

Completeness • Proof Theoretic Q is provable in proof theoretic system|- Q. • Model Theoretic Q is valid in model theoretic system|= Q • A PT system is sound iff |= Q implies |- Q • Every valid formula is also a theorem

Remark on Completeness • Completeness is a much deeper property that soundness,and is a lot more difficult to prove. • It is typically proved by contraposition. We show that if some formula P is not provable then is not valid. • This is done by building a model for ¬P • The 1st completeness proof for a 1st-order proof system was given by Kurt Godel in his 1930 PhD thesis.

Sound and Complete Systems • So if a proof system is both sound and complete (which is what we want) we have that: |=Φ if and only if |-Φ • That is, syntactic provability and semantic validity coincide. • Sound and complete proof system, really capture the our semantic reality. • Working with such systems is not just playing with symbols.

Blackburn’s Proposal • Deciding validity (in 1st-order logic) is undecidable, i.e. no algorithm exists for solving 1st-order validity. • Implementing our proof methods for 1st-order logic (that is, writing a theorem prover only gives us a semi-decision procedure. • If a formulas is valid, the prover will be able to prove it, but if is not valid, the prover may never halt! • Proposal • Implement theorem provers, • but also implement a partial converse tool: model builders.

Computational Tools • Theorem prover: A tool that, when given a 1st-order formula Φattempts to prove it. • If Φ is in fact provable a (sound and complete) 1st-order prover can (in principle) prove it. • Model builder: a tool that, when given a 1st-order formula Φ, attempts to build a model for it. • It cannot (even in principle) always succeed in this task, but it can be very useful.

Theorem Provers and Model Checkers • Theorem provers: a mature technology which provides a negative check on consistency and informativity • Theorem provers can tell us when something is not consistent, or not informative. • Model builders: a newer technology which provides a (partial) positive check on consistency and informativity • That is, model builders can tell us when something is consistent or informative.

A Possible System Let B be all our background knowledge, and Φthe representation of the latest sentence: • Partial positive test for consistency: give MB B & Φ • Partial positive test for informativity: give MB B & ¬Φ • Negative test for consistency: give TP B → Φ • Negative test for informativity: give TP B → ¬Φ • And do this in parallel using the best available software!

Clever Use of Reasoning Tools(CURT) • Baby Curt No inference capabilities • Rugrat Curt: negative consistency checks (naive prover) • Clever Curt: negative consistency checks (sophisticated prover) • Sensitive Curt: negative and positive informativity checks • Scrupulous Curt: eliminating superfluuous readings • Knowledgeable Curt: adding background knowledge • Helpful Curt: question answering

Baby Curt computes semantic representations Curt: 'Want to tell me something?' > every boxer loves a woman Curt: 'OK.' > readings 1 forall A (boxer(A) > exists B (woman(B) & love(A, B))) 2 exists A (woman(A) & forall B (boxer(B) > love(B, A)))

Baby Curt accumulates information > mia walks Curt: 'OK.' > vincent dances Curt: 'OK.' > readings 1 (walk(mia) & dance(vincent))

But Baby Curt is stupid > mia walks Curt: 'OK.' > mia does not walk Curt: 'OK.' > ?- readings 1 (walk(mia) & - walk(mia))

Add Inference Component • Key idea: use sophisticated theorem provers and model builders in parallel. • The theorem prover provides negative check for consistency and informativity. • The model builder provides positive check for consistency and informativity. • The 1st to find a result, reports back, and stops the other

Example > Vincent is a man Message (consistency checking): mace found a result. Curt: OK. > ?- models 1 model([d1], [f(1, man, [d1]), f(0, vincent, d1)])

Example continued > Mia likes every man. Message (consistency checking): mace found a result. Curt: OK. > Mia does not like Vincent. Message (consistency checking): bliksem found a result. Curt: No! I do not believe that!

Example 2 > ?- every car has a radio Message (consistency checking): mace found a result. Message (consistency checking): bliksem found a result. Curt: 'OK.' > ?- readings 1 forall A (car(A) > exists B (radio(B) & have(A, B)))

Issues • Is a logic-based approach to feasible? How far can it be pushed? • Is 1st-order logic essential? • Are there other interesting inference tasks? • Is any of this relevant to current trends in computational linguistics, where shallow processing and statistical approaches rule? • Are there applications?

Inference Tasks and Computational Semantics

Inference Tasks and Computational Semantics

Presentation Transcript

Computational Lexical Semantics

Semantics and discourse

Computational Semantics

Computational Semantics

Computational Semantics coli.uni-sb.de/cl/projects/milca/esslli

QuASI: Question Answering using Statistics, Semantics, and Inference

Computational Approaches to Haplotype Inference

Don’t Take Shortcuts! Computational Lexical Semantics and the Turing Test

First Order Logic (Syntax, Semantics and Inference)

Computational Semantics Chapter 18

An Applied Ontological Approach to Computational Semantics

Issues in Computational Linguistics: Semantics

QuASI: Question Answering using Statistics, Semantics, and Inference

Semantics and Inference Part II Johan Bos

QuASI: Question Answering using Statistics, Semantics, and Inference

Computational Inference STAT 440 / 840 CM 461

Computational Semantics

Managing Parallel Computational Tasks in a Grid Environment