Generalized Inference with Multiple Semantic Role Labeling Systems

Generalized Inference withMultiple Semantic Role Labeling Systems Peter Koomen, Vasin Punyakanok, Dan Roth, (Scott) Wen-tau Yih Department of Computer Science University of Illinois at Urbana-Champaign

Outline • System Architecture • Pruning • Argument Identification • Argument Classification • Inference [main difference from other systems] • Inference with Multiple Systems • The same approach used by the SRL to assure a coherent output is used with input produced by multiple systems.

System Architecture • Identify argument candidates • Pruning • Argument Identifier • Binary classification • Classify argument candidates • Argument Classifier • Multi-class classification • Inference • Use the estimated probability distribution given by the argument classifier, and • Expressive structural and linguistic constraints. • Infer the optimal global output – modeled as a constrained optimization problem

Pruning [Xue&Palmer 2004] • Significant errors due to PP attachment • Consider PP as attached to both NP and VP

Modified Pruning

Argument Identification • Argument identifier is trained with a phrase-based classifier. • Learning Algorithm – SNoW • A sparse network of linear classifiers • Weight update: a regularized variation of the Winnow multiplicative update rule • When probability estimation is needed, we use softmax

Argument Identification (Features) • Parse tree structure from Collins & Charniak’s parsers • Clauses, chunks and POS tags are from UPC processors

Argument Classification • Similar to argument identification, using SNoW as a multi-class classifier • Classes also include NULL

Inference • Occasionally, the output of the argument classifier violates some constraints. • The inference procedure [Punyakanok et al., 2004] • Input: the probability estimation (by the argument classifier), and structural and linguistic constraints • Output: the best legitimate global predictions • Formulated as an optimization problem and solved via Integer Linear Programming. • Allows incorporating expressive (non-sequential) constraints on the variables (the arguments types).

Integer Linear Programming Inference • For each argument ai • Set up a Boolean variable: ai,tindicating if ai is classified as t • Goal is to maximize • i score(ai = t ) ai,t • Subject to the (linear) constraints • Any Boolean constraints can be encoded this way. • If score(ai = t ) = P(ai = t ), the objective is find the assignment that maximizes the expected number of arguments that are correct and satisfies the constraints

Constraints • No overlapping or embedding arguments ai,aj overlap or embed: ai,NULL +aj,NULL  1

Constraints • Constraints • No overlapping or embedding arguments • No duplicate argument classes for A0-A5 • Exactly one V argument per predicate • If there is a C-V, there must be V-A1-C-V pattern • If there is an R-arg, there must be arg somewhere • If there is a C-arg, there must be arg somewhere before • Each predicate can take only core arguments that appear in its frame file. • More specifically, we check for only the minimum and maximum ids

Results

Inference with Multiple Systems • The performance of SRL heavily depends on the very first stage – pruning [IJCAI 2005] • which is derived directly from the full parse trees • Joint Inference allows improvement over semantic role labeling classifiers • Combine different SRL systems through joint inference • Systems are derived using different full parse trees

Inference with Multiple Systems • Multiple Systems • Train and test with Collins’ parse outputs • Train with Charniak’ best parse outputs • Test with 5-best Charniak’ parse outputs

Naïve Joint Inference ..., traders say, unable to cool the selling panic in both stocks and futures. traders the selling panic in both stocks and futures a1 a1 a4 traders the selling panic in both stocks and futures b1 b2 b3

Joint Inference – Phantom Candidates a1 a1 a4 a2 a3 b1 b2 b3 b4 Default Priors

Results of Joint Inference

Results of Different Combination

Conclusion • The ILP inference can naturally be extended to reason over multiple SRL systems.

Thank You

Generalized Inference with Multiple Semantic Role Labeling Systems

Generalized Inference with Multiple Semantic Role Labeling Systems

Presentation Transcript

CS 388: Natural Language Processing: Semantic Role Labeling

Multiple Regression Analysis: Inference

Causal Inference with Multiple Treatments

SEMANTIC ROLE LABELING BY TAGGING SYNTACTIC CHUNKS

Semantic Role Labeling

Semantic Role Labeling of Implicit Arguments for Nominal Predicates

Starting from Scratch in Semantic Role Labeling

Semantic Role Labeling

Automatic Semantic Role Labeling

A Memory-Based Approach to Semantic Role Labeling

Two-Phase Semantic Role Labeling based on Support Vector Machines

SRL via Generalized Inference

Class-based nominal semantic role labeling: a preliminary investigation

DEPENDENCY PARSING ， Framenet , SEMANTIC ROLE LABELING, SEMANTIC PARSING

Semantic Role Labeling with support vector machines

Forest-based Semantic Role Labeling

Automatic Labeling of Semantic Roles

CS 388: Natural Language Processing: Semantic Role Labeling

Semantic Role Labeling on Nouns

Robust Semantic Role Labeling for Nominals