1 / 29

PDT: Tectogrammatical Representation

PDT: Tectogrammatical Representation. Jan Haji č Institute of Formal and Applied Linguistics School of Computer Science Faculty of Mathematics and Physics Charles University, Prague Czech Republic. Tectogrammatical Annotation (t-layer). Underlying (deep) syntax 4 sublayers ( integrated ):

santo
Download Presentation

PDT: Tectogrammatical Representation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. PDT:Tectogrammatical Representation Jan Hajič Institute of Formal and Applied Linguistics School of Computer Science Faculty of Mathematics and Physics Charles University, Prague Czech Republic Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  2. Tectogrammatical Annotation (t-layer) • Underlying (deep) syntax • 4 sublayers (integrated): • dependency structure, (detailed) functors • valency annotation • topic/focus and deep word order • coreference (mostly grammatical only)‏ • all the rest (“grammatemes”): • detailed functors, underlying gender, number, ... • Total • 39 attributes (vs. 5 at m-layer, 2 at a-layer)‏ Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  3. Underlying verb + tense Deep function Elided Actor in Another ellipsis... Prepositions out Analytical vs. Tectogrammatical representation (TR: sublayer 1 only shown)‏ Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  4. Layer 3: Tectogrammatical • Underlying (deep) syntax • 4 sublayers: • dependency structure, (detailed) functors • topic/focus and deep word order • coreference (mostly grammatical only)‏ • all the rest (grammatemes): • detailed functors • underlying gender, number, ... Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  5. Tectogrammatical Functors • “Actants”: ACT, PAT, EFF, ADDR, ORIG • modify: verbs, nouns, adjectives • cannot repeat in a clause, usually obligatory • Free modifications (~ 50), semantically defined • can repeat; optional, sometimes obligatory • Ex.: LOC, DIR1, ...; TWHEN, TTILL,...; RSTR; BEN, ATT, ACMP, INTT, MANN; MAT, APP; ID, DPHR, ... • Special • Coordination, Rhematizers, Foreign phrases,... semantic syntactic Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  6. Tectogrammatical Example • Analytical verb form: • he would be allowed to be enrolled Collapsed Additional attributes (grammatemes): conditional + “allow” Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  7. Tectogrammatical Example • Predicate with copula (state)‏ • you were fired Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  8. Tectogrammatical Example • Passive construction (action)‏ • (The) book has been translated [by Mr. X] Disappeared Added Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  9. Tectogrammatical Example • Object • he gave Mary a book Obj goes into ACT, PAT, ADDR, EFF or ORIG based on governor’s valency frame Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  10. Tectogrammatical Example • Relative clause (embedded)‏ • the woman, who had a French accent, was very pretty Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  11. Tectogrammatical Example • Incomplete phrases • Peter works well, but Paul badly Added Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  12. Layer 3: Tectogrammatical • Underlying (deep) syntax • 4 sublayers: • dependency structure, (detailed) functors • topic/focus and deep word order • coreference (mostly grammatical only)‏ • all the rest (grammatemes): • detailed functors • underlying gender, number, ... Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  13. Deep Word Order, Topic/Focus • Example: • Baker bakes rolls. vs. BakerIC bakes rolls. Analytical dep. tree: Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  14. Deep Word OrderTopic/Focus • Deep word order: • from “old” information to the “new” one (left-to-right) at every level (head included)‏ • projectivity by definition (almost...)‏ • i.e., partial level-based order -> total d.w.o. • Topic/focus/contrastive topic • attribute of every node (t, f, c)‏ • restricted by d.w.o. and other constraints Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  15. Layer 3: Tectogrammatical • Underlying (deep) syntax • 4 sublayers: • dependency structure, (detailed) functors • topic/focus and deep word order • coreference (mostly grammatical only)‏ • all the rest (grammatemes): • detailed functors • underlying gender, number, ... Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  16. Coreference(intro only: see Silvie’s part)‏ • Grammatical (easy)‏ • relative clauses • which, who • Peter and Paul, who ... • control • infinitival constructions • John promised to go ... • reflexive pronouns • {him,her,thme}self(-ves)‏ • Mary saw herself in ... • promise • PRED • go • John • PAT • ACT • home • he • DIR3 • ACT Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  17. Coreference • Textual • Ex.: Peter moved to Iowa after he finished his PhD. Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  18. Layer 3: Tectogrammatical • Underlying (deep) syntax • 4 sublayers: • dependency structure, (detailed) functors • topic/focus and deep word order • coreference (mostly grammatical only)‏ • all the rest (grammatemes): • detailed functors • underlying gender, number, ... Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  19. “Grammatemes”‏ • Detailed functors (subfunctors)‏ • only for some functors: • TWHEN: before/after • LOC: next-to, behind, in-front-of, ... • also: ACMP, BEN, CPR, DIR1, DIR2, DIR3, EXT • Lexical (underlying)‏ • number (SG/PL), tense, modality, degree of comparison, ... • strictly only where necessary (agreement!)‏ Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  20. Tectogrammatical attributes I • node typing • complex, coap, qcomplex, root, atom, ... • functor, subfunctor • TWHEN: TWHEN.basic, TWHEN.before • is_member, is_generated, is_parenthesis, is_dsp_root, is_state, quot_type, ... • grammatemes (16): • aspect, degcmp, deontmod, sempos, tense, indeftype, politeness, person, ... Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  21. Tectogrammatical attributes II • topic/focus: • tfa, deepord • valency: t_lemma, val_frame.rf • bookkeeping: id • coref_gram.rf, coref_text.rf, compl.rf • reference to TR node, type of coreference • sentmod • Linking to analytical layer • a.lex.rf (“main” anal. node), a.aux.rf (others) Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  22. Fully Annotated Sentence He spends his days sketching passers-by, or trying to. Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  23. Definition of Valency • Ability (“desire”) of words (verbs, nouns, adjectives) to combine themselves with other units of meaning • Properties of valency: • Specific for every word meaning (in general) • leave: sb left sth for sbvs. sb left from somewhere • same as in PropBank leave.02 vs. leave.01 • Typically strongly correlates with surface form • morphological case (~ ending), preposition+case, ... • Semantic constraints are very dangerous Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  24. vyměnit (to replace) vyměnit1 ACT PAT EFF Nom. Acc. za+Acc. vyměnit2 ... Structure of Valency • word (lemma) • word sense group 1 • valency frame: • slot1 slot2 slot3 • surface expression • word sense group 2 • ... • PDT VALLEX (Cz), EngVallex (En) Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  25. PDT-VALLEX Entry • dosáhnout: “to reach”, “to get [sb to do sth]” • browser/user-formatted example: Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  26. Sentence 15345: Sentence 51042: Sentence 2035: Corpus <-> Valency Lexicon • Corpus: ENTRY: uzavřít vf1: ACT(.1) CPHR({smlouva}.4) ex: u. dohodu (close a contract) vf2: ACT(.1) PAT(.4) ex.: u. pokoj (close a room, house) • Lexicon: Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  27. Valency & Form: Constraints • Tree structure: • (Sets of) Constraints: • n1: lemma=uvažovat mode=active • n2: case=Nom afun=Sb • n3: lemma=o afun=AuxP • n4: case=Loc afun=Obj n1 n2 n3 n4 Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  28. Example: Valency & Form • 1:2 • relative clause lemma=say mode=active afun=AuxC lemma=that to_say: ACT EFF afun=Sb case=Nom afun=Obj POS=verb • linear representation: EFF(that[.v]) Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

  29. Valency and Text Generation • Using valency for... • ...getting the correct (lemma, tag) of verb arguments • Example: VALLEX entry: starat (se) ACT(.1) PAT(o.[.4]) starat V.............. starat_se PRED “to take care of” o ............... Martin ....1.......... se ............... Martin ACT tygr PAT “tiger” “Martin takes care of tigers.” tygr ....4.......... Martin se stará o tygry. Companions Semantic Representation and Dialog Interfacing Workshop - Tectogrammatics

More Related