120 likes | 143 Views
Discourse Connectives and Their Argument Structure: Annotating a discourse treebank ARAVIND K. JOSHI Department of Computer and Information Science August 22 2003. Outline. Introduction Some properties of the discourse connectives
E N D
Discourse Connectives and Their Argument Structure: Annotating a discourse treebank ARAVIND K. JOSHI Department of Computer and Information Science August 22 2003
Outline • Introduction • Some properties of the discourse connectives • Some example annotations (preliminary) with comments
Introduction • Extending the notion of lexical anchors (such as verbs) and their arguments beyond sentences into discourse • Discourse connectives such as -- and, or, but, because, since, while, when, however, instead, although, also, for example, then, so that, insofar as, nonetheless, … , Empty Connectives, -- they take clauses as their arguments and express relations between clauses, i.e,, relations between propositions, events, situations, … associated with the clauses • Towards computing a class of inferences associated with discourse connectives, hence relevant to complex NLP tasks • Towards discourse structure - discourse understanding
Project: • Annotate discourse connectives and their argument structure for the Penn Treebank corpus • Discourse Lexicalized TAG parser (DLTAG) People: Eleni Miltsakaki, Rashmi Prasad, Annotators, Aravind Joshi Collaborator: Bonnie Webber (Edinburgh University) Consultants: Mitch Marcus, Martha Palmer, Ellen Prince, Fernando Pereira
Some properties of discourse connectives • Discourse connectives have argument structure (analogous to verbs and their argument structure) • However, there are crucial differences • arity of connectives is fixed, they are binary (some apparent exceptions) • One argument is in the same sentence in which the connective appears. The other argument may or may not be in the same sentence. It can be in the preceding or following discourse • one of the arguments can be anaphoric • Very little is known about the semantics of discourse connectives
Some properties of discourse connectives • Detailed annotation of the argument structure for a large corpus is providing new insights into the semantics of connectives • No known abstract semantic categories such as agent, patient, theme, etc. for discourse connectives -- New opportunities • At present arguments are labeled by noncommittal labels Cc for the clause containing the connective Cc’ for the clause not containing the connective • Example of semantics:John flunked the exam although he studied hardCc’ although Cc( Cc normally entails ~ Cc’ ) & Cc’
Subordinate: because [The federal government suspended sales of U.S. savings Bonds] because [Congress hasn’t lifted the ceiling on government debt.] Adverbial: however [Both Newsweek and U.S. News have been gaining circulation in recent years without heavy use of electronicgiveaways to subscribers, such as telephone or watches.] However, [none of the big three weeklies recordedcirculation gains recently.] • Both arguments are in the same sentence • The two arguments in different sentences
Adverbial: for example [The computers were crude by today’s standards.] [Apple II owners, for example, had to use their television|sets as screens and stored data on audiocassetts.] [The computers were crude by today’s standards.] [Apple II owners, for example, had to use their televisionsets as screens and stored data on audiocassetts.] • An argument can be a discontiguous string • Problems with aligning arguments with Penn Treebank constituents
Adverbial: instead [No price for the new shares has been set.]Instead, [the companies will leave it up to the marketplaceto decide.] • “No” is not a part of the left argument • Left argument must indicate the unselected alternative and the right argument indicates the selected alternative • Negation is the licensing context for the left argument * [Price for the new shares has been set.]Instead, [the companies will leave it up to the marketplaceto decide.] • Modalities, non-factivity are other licensing contextsJohn wanted [to go to New York.] Instead, [he went to Washington.]
Adverbial: still [Some senior advisors argue that with further fights overa capital-gains tax cut and a budget-reduction bill Mr.Bush already has enough pending confrontations withcongress. They prefer to put off the line-item veto untilat least next year.] Still, [Mr. Bush and some other aides are strongly drawnto the idea of trying out a line-item veto.] • The left argument has two sentences
Adverbial: also [On the Big Board, Crawford & Co., Atlanta, (CFD)begins trading today.] Crawford evaluates health careplans, manages medical and disability aspects of worker’scompensation injuries and is involved in claims adjustments for insurance companies. Also, [beginning trading today on the Big Board are ElPaso Refinery Limited Partnership, El Paso, Texas, (ELP)and Franklin Multi-Income Trust, San Mateo, Calif., (FMI).] • The sentence (in blue) after the left argument of “also” can be regarded as a kind of adjunct of the left argument • Discourse connectives have a fixed arity (2) and no adjuncts
Empty connective: EMPTY [El Paso owns and operates a petroleum refinery.] EMPTY= whereas [Franklin is a closed-end managementinvestment company.] • “whereas” is the connective that one annotator thought best described the relation expressed by the empty connective • Analogous to the empty relation in a noun-noun compound at the sentence level