1 / 67

Computational Paninian Grammar for Dependency Parsing

Outline. BackgrondPaninian Grammar :The Basic FrameworkSome Example CasesConclusion. Background. Indian languagesRich morphology Relatively flexible word order For example, 1. a) baccaa phala khaataa hai child' fruit' eat hab' pres' b) phala baccaa k

karli
Download Presentation

Computational Paninian Grammar for Dependency Parsing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


    1. Computational Paninian Grammar for Dependency Parsing Dipti Misra Sharma LTRC, IIIT, Hyderabad NLP Winter School 25-12-2008

    2. Outline Backgrond Paninian Grammar :The Basic Framework Some Example Cases Conclusion

    3. Background Indian languages Rich morphology Relatively flexible word order For example, 1. a) baccaa phala khaataa hai child fruit eat+hab pres b) phala baccaa khaataa hai c) phala khaataa hai baccaa d) baccaa khaataa hai phala

    4. Basic Structure in PS

    5. PS for 1(b)?

    6. Problems Complex tree In what ways subject (baccaa) is different from object (phala) ? Agreement does not hold Position does not hold

    7. How to Draw PSs for 1 (c-d) ? 1 c) baccaa khaata hai phala 'child' 'eat+hab' 'pres' 'fruit' 1 d) phala khaata hai baccaa 'fruit' 'eat+hab' 'pres' 'child' Simple and perfectly natural sentences - difficult to handle in Phrase Structure Dependency structures make it easy

    8. Dependency Structure

    9. Paninian Grammatical Formalism A dependency grammar based approach Motivation for following the Paninian approach Inspired by inflectionally rich language (Sanskrit)? Better suited for handling ILs Provides the level of syntactico-semantic interface for parsing Various linguistic phenomena handled seamlessly ( Refer Akshar Bharati et al Natural Language Parsing - a Paninian Perspective (1995) http://ltrc.iiit.net/showfile.php?filename=downloads/nlpbook/index.html)

    10. Panian Grammar Contd. The grammar facilitates realisation of the intended meaning as an 'expression' of what the speaker wants to communicate (vivaksha)?

    11. The Basic Framework Treats a sentence as a series of modifier-modified relations A sentence has a primary modified (generally a verb)? Provides a blueprint to identify these relations Syntactic cues help in identifying the relation types

    12. Levels of Representation (1) Semantic information Assignment of karakas (Th-roles) and of abstract tense (2) Morphosyntactic representation Morphological spellout rules (3) Abstract morphological representation Allomorphy and phonology (4) Phonological output form (From Kiparsky, Lectures in CIEFL, Hyderabad, pg2)?

    13. Some Concepts Speaker's intention (vivakshaa)? Root + Suffix (prakriti + pratyaya)? Expectancy (aakaankshaa)? Eligibility (yogyataa)? Proximity (sannidhi)? Karaka vibhakti

    14. Speakers Intention (vivakshaa)? Each sentence reflects speakers intention Various sub-actions come into focus Participants are assigned various relations accordingly key gets assigned karta, karana based on the kind of sub-action under focus Syntax reflects vivaksha

    15. Prakriti and Pratyaya (root and suffix)? The premise Every word is composed of two parts 1. Content part (root morpheme)? 2. Functional part (affix)? For languages such as English and Hindi the auxiliaries can be treated as the functional morphemes Morph analysers or Local word groupers can provide this information

    16. aakaankshaa (Expectation/Demand)? Every word has certain demands to be fulfilled. For Parsing, verb is the most critical element The demand frames (karaka frames) for the verbs list out their demands

    17. For Example, frame of Hindi verb 'khaa' Verb ? khaa Sense ? to eat Sense ID ??? Eg ? raam seb khaataa hai Ram ate an apple ---------------------------------------------------------------------------------- arc-label necessity vibhakti lextype reln ---------------------------------------------------------------------------------- k1 m 0 n c k2 m 0 n c ----------------------------------------------------------------------------------------- k1 ? karta; k2 ? karma; m ? mandatory; n ? noun; c ?child

    18. Yogyataa (Eligibility)? Selectional Restrictions For example, baccaa phala khaataa hai 'phala' (fruit) does not have the eligibility to become the 'karta' of the verb 'khaa' (eat)? Constraints based on yogyata require semantic knowledge for each lexical item This knowledge can be obtained from a lexical resource such as a 'WordNet'

    19. Sannidhi (Proximity)? The modifier and the modified tend to occur in close proximity in a sentence For example, 'rAma ne kelaa khaayaa, mohana ne duudha piyaa Ora Hari ne film dekhii' This Hindi example cotains three verbs - khAyA (ate), piyA (drank) and dekhI (saw)? Respective arguments of each of these verbs would tend to occur in close proximity to it

    20. Karaka and Vibhakti Two levels of analysis Syntactico-sematic relations : Direct participants of the action denoted by a verb (Karaka)? Other relations : purpose, genitive, reason etc Relation markers (Vibhaktis)?

    21. Semantics of the verb A verbal root denotes: The activity The result Locus of activity : karta Locus of result : karma

    22. karta - karma The boy opened the lock k1 karta k2 karma karta, karma sometimes correspond to agent/theme Not always

    23. Action bundle of sub-actions The boy opened the lock with the key The key opened the lock The lock opened Notion of vivaksha Realization of speakers intention in a sentence

    24. Sub-actions - Opening of lock

    25. Sub-actions - Opening of lock Action 1 The boy opened the lock with the key Action 2 The key opened the lock Action 3 The lock opened Each sentence reflects speakers intention

    26. Sub-actions - Opening of lock

    27. Basic karaka relations Only six karta subject/agent/doer karma object/patient karana instrument sampradaan beneficiary apaadaan source adhikarana location in place/time/other

    28. Basic karaka relations

    29. Basic karaka relations

    30. Basic karaka relations

    31. Other relations Other dependency relations Purpose, reason, direction etc Causatives, associatives, comparatives etc Genitive, adjective

    32. Vibhaktis : Markers for karaka Relations Relation markers (Vibhaktis)? raama ne caakuu se seba kaaTaa 'Ram 'erg' 'knife 'with' 'apple' 'cut' | | | karta(doer) karana(instrument) karma (theme)? raama ne mohana ke_liye seba kaaTaa Ram erg Mohan for apple cut Ram cut the apple for Mohan (purpose)? maiM mohana ke_saatha baazaara gayaa I Mohan with market went I went to the market with Mohan (associative)

    33. However No one-to-one correspondence between relations and relation markers

    34. Syntactic Cues Verbal inflections (Tense Aspect Modality (TAM))? Passive : verb agrees with the karma Some other cases raama ko jaanaa paDaa I+to go had to I had to go raama ko calanaa caahiye Ram to walk should I should leave

    35. Example Raama jaataa hai Ram go+hab pres Ram goes jaa karta raama

    36. Some Examples Relative Clause MWEs Change of state verbs Conjuncts Ellipsis

    37. Relative Clause A noun is modified by a clause with a relative pronoun as its co-referent Example meraa bhaaii jo dillii meM rahataa hai kala aa my brother who Delhi in live+hab pres tomorrow come rahaa hai prog pres My brother who lives in Delhi is coming tomorrow How to represent this ? Two possible representations

    38. Alternative 1 aa meraa bhaaii kala jo raha dillii

    39. Alternative 2 Aa meraa bhaaii kala coref raha jo dillii

    40. Other Relative-Corelative Constructions Adjective having a clausal modifier tuma aisaa sundara ghar banaao jaisaa unakaa hai you such beautiful house build such-that theirs is You build a house as beautiful as theirs banaao build k1 k2 tuma ghara adj sundara jjmod aisaa coref jo-vo-jjmod hai jaisaa unakaa jaisaa usakaa

    41. MWEs Conjunct Verbs ((raama ne)) ((bahuta dera)) ((ravi kii)) ((pratiikshaa kii))? 'rAma erg' 'very' 'late' 'ravi' of' 'wait did Ram waited for Ravi for a long time ((kaaryashaalaa ke liye)) ((biisa logoM kaa)) ((naamaaMkana kiyaa gayaa))? 'workshop 'for' 'twenty' 'people' of 'name registration' 'do+passive Twenty people were registered for the workshop

    42. Conjunct Verbs Conjunct verb prashna kiyaa below mohana ne ravi se prashna kiyaa 'Mohan' 'erg' 'Ravi' 'to' 'question' 'did' Mohan asked Ravi a question A conjunct verb can have partial modification mohana ne acchaa prashna kiyaa thaa 'Mohan' 'erg' 'good' 'question' 'do+perf' 'past The elements in a complex predicate can also be dis-continuous prashna to mohana ne kiyaa thaa 'question' 'part' 'Mohan' 'erg' 'do+perf' 'past'

    43. Conjunct Verbs However, Mohan ne ravi se acchaa prashna kiyaa prashna_kiyaa questioned k1 k2 ? mohan ne ravi se acchaa Mohan to Ravi good 'acchaa' is NOT a verb modifier, 'acchaa' modifies 'prashna' and not 'prashna kiyA', Solution ?

    44. Conjunct Verbs Solution Don't chunk a conjunct verb as a single verbal unit Thus, Mohan ne ravi se ((acchaa)) ((prashna kiyaa))_VG Revise to Mohan ne ravi se ((acchaa prashna))_NP ((kiyaa))_VG

    45. Conjunct Verbs Show 'part-of' relation between the noun and the verb Add a tag 'pof' to achieve the above Therefore, _kiyaa k1 k2 pof mohan ne ravi se prashna nmod acchaa

    46. DS for Discontinuous Elements

    47. MWEs Idioms ((kisaana kii)) ((patnii ko)) ((vaha ciDiyaa))? 'farmer' 'of' 'wife' 'to' 'that' 'bird' (( phuuTii aaMkha nahiiM suhaatii thii))? 'not appealed' The idiom (in bold) is functionally a verb.

    48. Idioms Two possible solutions phuuTii aazkha suhaa <fs tam=nahiiM+taa_thaa> not appealed k1 k2 patnii vaha ciDiyaa wife that bird r6 kisaana farmer Solution-1

    49. Idioms suhaa <fs tam=nahiiM+taa_thaa> not appealed k2 pof k1 vaha ciDiyaa phuuTii aazkha patnii that bird burst eye wife r6 kisaana farmer' Solution-2

    50. Change of State Verbs Change of state verbs such as raMganaa (colour) pose a problem such as, ((usane)) ((apanaa ghara)) ((piilaa)) ((raMgaa))? 'he/she' 'own' 'house' 'yellow' 'coloured' raMga colour k1 k2 ? usane ghara piilaa he/she house yellow Is 'piilaa' a complement of 'ghara' ? OR Is it the k2 of raMgaa ? If piilaa is the k2 of raMgaa then what is the relation of ghara with raMgaa ? Can they both be k2 ?

    51. Proposed Solution In Panini's framework, verbs denoting 'change of state' can have two 'karma' The object which is being changed The state after change Thus, raMga coloured k1 k2-1 k2-2 usane ghara piilaa he house yellow

    52. Conjuncts Need special treatment in a dependency representation (maiM baazaara gayaa)1 Ora (ve loga ghara para ruke)2 'I' 'market' 'went' 'and' 'those' people 'home' 'at 'stayed' I went to the market and those people stayed at home What is the head of a co-ordinate structure ? How to represent the equal status of 1 and 2 above ?

    53. Conjuncts Take Conjunct as the 'head' Label the relation as 'ccof' Ora and ccof ccof gayA went ruke stay k1 k2 k1 k7p mEM bAzAra loga ghara I market people home A subordinating conjunct will have a single child node

    54. Some Problem Cases Certain complex sentences pose problems For example : agara tuma aate to hama vahaaM jaate if you come then we there go Had you come, we would have gone there Counterfactual agara and to two connectives How to represent the dependencies ?

    55. Main Clause Subordinate Clause jaate go+? ? ? K1 k7p agara to hama vahaaM ccof aate k1 tuma This representation fails to capture the relation between agara-to

    56. Representation-Currently Followed to then ccof jaate go+? vmod k1 k7p agara hama vahaaM ccof we there aate 'come' k1 tuma 'you'

    57. Alternative Proposal agara-to pof pof agara to ccof ccof aate jaate k1 k1 k7p tuma hama vahaaM Treat agara-to as a complex conjunct

    58. Ellipsis How to show dependencies when the head is missing ? bacce baDe ho gaye hEM kisI kI bAta nahIM sunate The children have grown up, they don't listen to anyone No explicit conjunct !! Insert a NULL element to show the dependencies NULL_CCP ccof ccof bade_ho_gaye nahIM_sunate Insert a NULL node only if it is essential to represent the dependencies .

    60. Some English Examples English is : A configurational language Relatively fixed word order Relations are not realised in affixes Subject and object are positional Subject is sacrosanct

    61. Passive Rama ate a banana eat <fs tam=PAST> k1 k2 Rama banana A banana was eaten by Rama eat <fs tam=was_en> k2 k1 banana Rama Extend the notion of vibhakti to English subject, object positions

    62. Interrogatives Did Rama eat a banana ? A 'Yes-no' interrogative Structurally, Interrogative is realised through word order change Subject Auxiliary inversion No interrogative morpheme

    63. Interrogative Contd. Proposed solution: eat < fs stype=interrogative__yes-no> fragof k1 k2 Did Rama banana Position gives the cues for the constraints

    64. Interrogatives Contd. What did Rama eat ? Eat < fs stype=interrogative__wh> k2 fragof k1 What did Rama Question element 'what' and Auxiliary position provide the syntactic cues

    65. Control Verbs John persuaded Harry to leave persuade k1 k2 rt (?)? John Harry leave The object of persuade corefers to the 'missing' 'karta' of 'leave' John promised Harry to leave promise k1 k4 k2 John Harry leave The subject of promise corefers to the 'missing' 'karta' of 'leave'

    66. Verbs such as 'want' John wanted Harry to leave want k1 k2 John leave k1 Harry 'want' is a transitive verb and can take 'a clause' as its 'karma'

    67. Empty 'it' It is raining in Delhi rain <fs stype=expletive__it> k7p Delhi Possible representation Empty 'it' can be captured in the feature structure

    68. Conclusion Paninian Grammatical Formalism offers a depenency based approach for sentence parsing which suits better morphologically richer languages with relatively free word order such as Indian languages.

More Related