670 likes | 822 Views
Outline. BackgrondPaninian Grammar :The Basic FrameworkSome Example CasesConclusion. Background. Indian languagesRich morphology Relatively flexible word order For example, 1. a) baccaa phala khaataa hai child' fruit' eat hab' pres' b) phala baccaa k
E N D
1. Computational Paninian Grammar for Dependency Parsing
Dipti Misra Sharma
LTRC, IIIT,
Hyderabad
NLP Winter School
25-12-2008
2. Outline Backgrond
Paninian Grammar :The Basic Framework
Some Example Cases
Conclusion
3. Background Indian languages
Rich morphology
Relatively flexible word order
For example,
1. a) baccaa phala khaataa hai
child fruit eat+hab pres
b) phala baccaa khaataa hai
c) phala khaataa hai baccaa
d) baccaa khaataa hai phala
4. Basic Structure in PS
5. PS for 1(b)?
6. Problems Complex tree
In what ways subject (baccaa) is different from object (phala) ?
Agreement does not hold
Position does not hold
7. How to Draw PSs for 1 (c-d) ?
1 c) baccaa khaata hai phala
'child' 'eat+hab' 'pres' 'fruit'
1 d) phala khaata hai baccaa
'fruit' 'eat+hab' 'pres' 'child'
Simple and perfectly natural sentences - difficult to handle in Phrase Structure
Dependency structures make it easy
8. Dependency Structure
9. Paninian Grammatical Formalism
A dependency grammar based approach
Motivation for following the Paninian approach
Inspired by inflectionally rich language (Sanskrit)?
Better suited for handling ILs
Provides the level of syntactico-semantic interface for parsing
Various linguistic phenomena handled seamlessly
( Refer Akshar Bharati et al Natural Language Parsing - a Paninian Perspective (1995) http://ltrc.iiit.net/showfile.php?filename=downloads/nlpbook/index.html)
10. Panian Grammar Contd. The grammar facilitates realisation of the intended meaning as an 'expression' of what the speaker wants to communicate (vivaksha)?
11. The Basic Framework
Treats a sentence as a series of modifier-modified relations
A sentence has a primary modified (generally a verb)?
Provides a blueprint to identify these relations
Syntactic cues help in identifying the relation types
12. Levels of Representation (1) Semantic information
Assignment of karakas (Th-roles) and of abstract tense
(2) Morphosyntactic representation
Morphological spellout rules
(3) Abstract morphological
representation
Allomorphy and phonology
(4) Phonological output
form (From Kiparsky, Lectures in CIEFL, Hyderabad, pg2)?
13. Some Concepts Speaker's intention (vivakshaa)?
Root + Suffix (prakriti + pratyaya)?
Expectancy (aakaankshaa)?
Eligibility (yogyataa)?
Proximity (sannidhi)?
Karaka
vibhakti
14. Speakers Intention (vivakshaa)? Each sentence reflects speakers intention
Various sub-actions come into focus
Participants are assigned various relations accordingly
key gets assigned karta, karana based on the kind of sub-action under focus
Syntax reflects vivaksha
15. Prakriti and Pratyaya(root and suffix)? The premise
Every word is composed of two parts
1. Content part (root morpheme)?
2. Functional part (affix)?
For languages such as English and Hindi
the auxiliaries can be treated as the functional morphemes
Morph analysers or Local word groupers can provide this information
16. aakaankshaa(Expectation/Demand)? Every word has certain demands to be fulfilled. For Parsing, verb is the most critical element
The demand frames (karaka frames) for the verbs list out their demands
17. For Example, frame of Hindi verb 'khaa' Verb ? khaa
Sense ? to eat Sense ID ???
Eg ? raam seb khaataa hai
Ram ate an apple
----------------------------------------------------------------------------------
arc-label necessity vibhakti lextype reln
----------------------------------------------------------------------------------
k1 m 0 n c
k2 m 0 n c
-----------------------------------------------------------------------------------------
k1 ? karta; k2 ? karma; m ? mandatory; n ? noun; c ?child
18. Yogyataa(Eligibility)? Selectional Restrictions
For example,
baccaa phala khaataa hai
'phala' (fruit) does not have the eligibility to become the 'karta' of the verb 'khaa' (eat)?
Constraints based on yogyata require semantic knowledge for each lexical item
This knowledge can be obtained from a lexical resource such as a 'WordNet'
19. Sannidhi (Proximity)? The modifier and the modified tend to occur in close proximity in a sentence
For example,
'rAma ne kelaa khaayaa, mohana ne duudha
piyaa Ora Hari ne film dekhii'
This Hindi example cotains three verbs -
khAyA (ate), piyA (drank) and dekhI (saw)?
Respective arguments of each of these verbs would tend to occur in close proximity to it
20. Karaka and Vibhakti Two levels of analysis
Syntactico-sematic relations :
Direct participants of the action denoted by a verb (Karaka)?
Other relations : purpose, genitive, reason etc
Relation markers (Vibhaktis)?
21. Semantics of the verb A verbal root denotes:
The activity
The result
Locus of activity : karta
Locus of result : karma
22. karta - karma The boy opened the lock
k1 karta
k2 karma
karta, karma sometimes correspond to agent/theme
Not always
23. Action bundle of sub-actions
The boy opened the lock with the key
The key opened the lock
The lock opened
Notion of vivaksha
Realization of speakers intention in a sentence
24. Sub-actions - Opening of lock
25. Sub-actions - Opening of lock Action 1
The boy opened the lock with the key
Action 2
The key opened the lock
Action 3
The lock opened
Each sentence reflects speakers intention
26. Sub-actions - Opening of lock
27. Basic karaka relations Only six
karta subject/agent/doer
karma object/patient
karana instrument
sampradaan beneficiary
apaadaan source
adhikarana location in place/time/other
28. Basic karaka relations
29. Basic karaka relations
30. Basic karaka relations
31. Other relations Other dependency relations
Purpose, reason, direction etc
Causatives, associatives, comparatives etc
Genitive, adjective
32. Vibhaktis : Markers for karaka Relations Relation markers (Vibhaktis)?
raama ne caakuu se seba kaaTaa
'Ram 'erg' 'knife 'with' 'apple' 'cut'
| | |
karta(doer) karana(instrument) karma (theme)?
raama ne mohana ke_liye seba kaaTaa
Ram erg Mohan for apple cut
Ram cut the apple for Mohan (purpose)?
maiM mohana ke_saatha baazaara gayaa
I Mohan with market went
I went to the market with Mohan (associative)
33. However No one-to-one correspondence between relations and relation markers
34. Syntactic Cues Verbal inflections (Tense Aspect Modality (TAM))?
Passive : verb agrees with the karma
Some other cases
raama ko jaanaa paDaa
I+to go had to
I had to go
raama ko calanaa caahiye
Ram to walk should
I should leave
35. Example Raama jaataa hai
Ram go+hab pres
Ram goes
jaa
karta
raama
36. Some Examples Relative Clause
MWEs
Change of state verbs
Conjuncts
Ellipsis
37. Relative Clause A noun is modified by a clause with a relative pronoun as its co-referent
Example
meraa bhaaii jo dillii meM rahataa hai kala aa
my brother who Delhi in live+hab pres tomorrow come
rahaa hai
prog pres
My brother who lives in Delhi is coming tomorrow
How to represent this ?
Two possible representations
38. Alternative 1
aa
meraa bhaaii kala
jo
raha
dillii
39. Alternative 2
Aa
meraa bhaaii kala
coref
raha
jo dillii
40. Other Relative-Corelative Constructions Adjective having a clausal modifier
tuma aisaa sundara ghar banaao jaisaa unakaa hai
you such beautiful house build such-that theirs is
You build a house as beautiful as theirs
banaao build
k1 k2
tuma ghara
adj
sundara
jjmod
aisaa
coref jo-vo-jjmod
hai
jaisaa unakaa
jaisaa usakaa
41. MWEs Conjunct Verbs
((raama ne)) ((bahuta dera)) ((ravi kii)) ((pratiikshaa kii))?
'rAma erg' 'very' 'late' 'ravi' of' 'wait did
Ram waited for Ravi for a long time
((kaaryashaalaa ke liye)) ((biisa logoM kaa)) ((naamaaMkana kiyaa gayaa))?
'workshop 'for' 'twenty' 'people' of 'name registration' 'do+passive
Twenty people were registered for the workshop
42. Conjunct Verbs Conjunct verb prashna kiyaa below
mohana ne ravi se prashna kiyaa
'Mohan' 'erg' 'Ravi' 'to' 'question' 'did'
Mohan asked Ravi a question
A conjunct verb can have partial modification
mohana ne acchaa prashna kiyaa thaa
'Mohan' 'erg' 'good' 'question' 'do+perf' 'past
The elements in a complex predicate can also be dis-continuous
prashna to mohana ne kiyaa thaa
'question' 'part' 'Mohan' 'erg' 'do+perf' 'past'
43. Conjunct Verbs However,
Mohan ne ravi se acchaa prashna kiyaa
prashna_kiyaa questioned
k1 k2 ?
mohan ne ravi se acchaa
Mohan to Ravi good
'acchaa' is NOT a verb modifier,
'acchaa' modifies 'prashna' and not 'prashna kiyA',
Solution ?
44. Conjunct Verbs Solution
Don't chunk a conjunct verb as a single verbal unit
Thus,
Mohan ne ravi se ((acchaa)) ((prashna kiyaa))_VG
Revise to
Mohan ne ravi se ((acchaa prashna))_NP ((kiyaa))_VG
45. Conjunct Verbs Show 'part-of' relation between the noun and the verb
Add a tag 'pof' to achieve the above
Therefore,
_kiyaa
k1 k2 pof
mohan ne ravi se prashna
nmod
acchaa
46. DS for Discontinuous Elements
47. MWEs Idioms
((kisaana kii)) ((patnii ko)) ((vaha ciDiyaa))?
'farmer' 'of' 'wife' 'to' 'that' 'bird'
(( phuuTii aaMkha nahiiM suhaatii thii))?
'not appealed'
The idiom (in bold) is functionally a verb.
48. Idioms Two possible solutions
phuuTii aazkha suhaa <fs tam=nahiiM+taa_thaa> not appealed
k1 k2
patnii vaha ciDiyaa
wife that bird
r6
kisaana farmer
Solution-1
49. Idioms suhaa <fs tam=nahiiM+taa_thaa> not
appealed
k2 pof k1
vaha ciDiyaa phuuTii aazkha patnii
that bird burst eye wife
r6
kisaana
farmer'
Solution-2
50. Change of State Verbs Change of state verbs such as raMganaa (colour) pose a problem such as,
((usane)) ((apanaa ghara)) ((piilaa)) ((raMgaa))?
'he/she' 'own' 'house' 'yellow' 'coloured'
raMga colour
k1 k2 ?
usane ghara piilaa
he/she house yellow
Is 'piilaa' a complement of 'ghara' ? OR
Is it the k2 of raMgaa ?
If piilaa is the k2 of raMgaa then what is the relation of ghara with raMgaa ?
Can they both be k2 ?
51. Proposed Solution In Panini's framework, verbs denoting 'change of state' can have two 'karma'
The object which is being changed
The state after change
Thus,
raMga coloured
k1 k2-1 k2-2
usane ghara piilaa
he house yellow
52. Conjuncts Need special treatment in a dependency
representation
(maiM baazaara gayaa)1 Ora (ve loga ghara para ruke)2
'I' 'market' 'went' 'and' 'those' people 'home' 'at 'stayed'
I went to the market and those people stayed at home
What is the head of a co-ordinate structure ?
How to represent the equal status of 1 and 2 above ?
53. Conjuncts
Take Conjunct as the 'head'
Label the relation as 'ccof'
Ora and
ccof ccof
gayA went ruke stay
k1 k2 k1 k7p
mEM bAzAra loga ghara
I market people home
A subordinating conjunct will have a single child node
54. Some Problem Cases Certain complex sentences pose problems
For example :
agara tuma aate to hama vahaaM jaate
if you come then we there go
Had you come, we would have gone there
Counterfactual
agara and to two connectives
How to represent the dependencies ?
55. Main Clause Subordinate Clause jaate go+?
? ? K1 k7p
agara to hama vahaaM
ccof
aate
k1
tuma
This representation fails to capture the relation between agara-to
56. Representation-Currently Followed to then
ccof
jaate go+?
vmod k1 k7p
agara hama vahaaM
ccof we there
aate 'come'
k1
tuma 'you'
57. Alternative Proposal agara-to
pof pof
agara to
ccof ccof
aate jaate
k1 k1 k7p
tuma hama vahaaM
Treat agara-to as a complex conjunct
58. Ellipsis How to show dependencies when the head is missing ?
bacce baDe ho gaye hEM kisI kI bAta nahIM sunate
The children have grown up, they don't listen to anyone
No explicit conjunct !!
Insert a NULL element to show the dependencies
NULL_CCP
ccof ccof
bade_ho_gaye nahIM_sunate
Insert a NULL node only if it is essential to represent the dependencies
.
60. Some English Examples English is :
A configurational language
Relatively fixed word order
Relations are not realised in affixes
Subject and object are positional
Subject is sacrosanct
61. Passive Rama ate a banana
eat <fs tam=PAST>
k1 k2
Rama banana
A banana was eaten by Rama
eat <fs tam=was_en>
k2 k1
banana Rama
Extend the notion of vibhakti to English subject, object positions
62. Interrogatives Did Rama eat a banana ?
A 'Yes-no' interrogative
Structurally,
Interrogative is realised through word order change
Subject Auxiliary inversion
No interrogative morpheme
63. Interrogative Contd. Proposed solution:
eat < fs stype=interrogative__yes-no>
fragof k1 k2
Did Rama banana
Position gives the cues for the constraints
64. Interrogatives Contd. What did Rama eat ?
Eat < fs stype=interrogative__wh>
k2 fragof k1
What did Rama
Question element 'what' and
Auxiliary position provide the syntactic cues
65. Control Verbs John persuaded Harry to leave
persuade
k1 k2 rt (?)?
John Harry leave
The object of persuade corefers to the 'missing' 'karta' of 'leave'
John promised Harry to leave
promise
k1 k4 k2
John Harry leave
The subject of promise corefers to the 'missing' 'karta' of 'leave'
66. Verbs such as 'want' John wanted Harry to leave
want
k1 k2
John leave
k1
Harry
'want' is a transitive verb and can take 'a clause' as its 'karma'
67. Empty 'it' It is raining in Delhi
rain <fs stype=expletive__it>
k7p
Delhi
Possible representation
Empty 'it' can be captured in the feature structure
68. Conclusion Paninian Grammatical Formalism offers a depenency based approach for sentence parsing which suits better morphologically richer languages with relatively free word order such as Indian languages.