250 likes | 462 Views
Consortium Project on Development of Dravidian WordNet : An Integrated WordNet for Telugu, Tamil, Kannada and Malayalam. Objective. Develop an integrated WordNet in four major Dravidian languages, viz. Tamil, Telugu, Kannada and Malayalam Linked with Hindi and English WordNets.
E N D
Consortium Project on Development of Dravidian WordNet: An Integrated WordNet for Telugu, Tamil, Kannada and Malayalam
Objective • Develop an integrated WordNet in four major Dravidian languages, viz. Tamil, Telugu, Kannada and Malayalam • Linked with Hindi and English WordNets 2 PRSG Meeting
Consortium Members • Consortium Leader • Prof. Pushpak Bhattacharya, IIT Bombay • Consortium Members • Dr. S. Baskaran, Tamil University (Tamil) • Prof. K.P.Soman, Amrita ViswaVidyapeetham(Malayalam) • Prof. C.S.Ramachandra, University of Mysore (Kannada) • Dr. S. Arulmozi, Dravidian University (Co-Consortium Leader & Telugu) 2 PRSG Meeting
Project Details • Total Outlay of the Project: • 150.43 lakhs • Date of Commencement: • 26 Dec 2011 • Duration of the Project: • 24 months 2 PRSG Meeting
Project Deliverables • The integrated Dravidian WordNet will be linked with Hindi and English WordNets, with which the users will be able to • Look up their language specific words to obtain lexico-semantic relations like synonymy, hypernymy, meronymy etc. • Query for cross-lingual lexical information • Design and implement complex natural language applications like machine translation and cross-lingual search 2 PRSG Meeting
Organization and Distribution of Tasks • IIT-B • Overall Coordination of the project • providing guidance on the architecture and technology • making available existing tools and interfaces • Computational tasks; algorithms on WordNets 2 PRSG Meeting
Organization & Distribution of Tasks • Other Partners • 20000 synsetscreation • Validation of synsets • Adaptation of semantic relations and validation (each in Tamil, Telugu, Malayalam and Kannada) 2 PRSG Meeting
Tamil WordNet • Commencement Date: 24 April 2012 • Principal Investigator: Dr.S.Baskaran • Senior Linguist • G. Vasuki, M.A. M.Phil (Ling.) • Computer Scientist • G.Biju, MCA, M.Phil • Lexicographers • D. Yoga, M.A. M.Phil (Ling), M.A. (Tamil) • M. Ramasundari, M.A. M.Phil, Ph.D (Ling.) • D. Vinodha, M.A.(Hindi), Dip. In Translation • K. Bakkiyaraj, M.A. M.Phil (Ling.) 2 PRSG Meeting
Malayalam WordNet • Commencement Date: 24 April 2012 • Principal Investigator: Prof.K.P.Soman • Senior Linguist • N. Rajendran, M.A. Ph.D (Ling.) • Computer Scientist • K.Krishnakumar, MA, M.Phil, Ph.D (Ling.) • Lexicographers • S. Veera Alagiri, M.A. M.Phil, Ph.D (Ling) • Jyothi Ratnam, M.A. (Hindi) 2 PRSG Meeting
Telugu WordNet • Commencement Date: 2 July 2012 • Principal Investigator:Dr.S.Arulmozi • Co-PI: Dr.M.C.KesavaMurty • Senior Linguist • Dr.S.ChandraKiran, M.A. M.Phil (Tel.) Ph.D (Comp.Lit.) • Computer Scientist • T. Swathi, MCA • Lexicographers • S. Sravanti, M.A. (Telugu) • K. Sukanya, M.A. (Telugu) • K. Sampoorna, M.A. (Telugu) • N.Silparani, M.A. (Telugu) 2 PRSG Meeting
Kannada WordNet • Commencement Date: 23 July 2012 • Principal Investigator: Prof. C.S.Ramachandra • Co-PI: Prof. G.Hemanthakumar • Senior Linguist • Dr.B.P.Hemananda, M.A. Ph.D (Ling.) • Lexicographers • Chaya Devi, M.A. Linguistics • R M Ramya, M.A. Kannada 2 PRSG Meeting
Status of synset creation 2 PRSG Meeting
Total Synsets Developed Includes Pan-Indian, Universal, Remaining Synsets 2 PRSG Meeting
Status on Tasks • Synset Creation – • Pan-Indian, Universal – Completed • Nouns – 40% completed • Verbs – 70 % completed • Adjectives – completed • Adverbs – 70% completed • Language & Culture Specific synsets – Initiated • Named Entity – to start • Web tool – Telugu is completed, others are in line. 2 PRSG Meeting
Manpower Trained 2 PRSG Meeting
Equipment Purchased 2 PRSG Meeting
Financial Details 2 PRSG Meeting
Institute-wise Project Budget 2 PRSG Meeting
Head-wise Fund Distribution 2 PRSG Meeting
Amount Received & Expenditure(upto 28 Feb 2013) Project commenced after 5 months of administrative approval 2 PRSG Meeting
Man-power Details 2 PRSG Meeting
Papers Published • `Tamil WordNet’, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Rajendran) • `Building a WordNet’ for Dravidian Languages, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Rajendran, S.Gopakumar, V.Dhanalakshmi) • `Representation of Kinship in WordNet’, Proceedings of the 9th International Tamil Internet Conference, Coimbatore, 23-27 June 2010 (S.Arulmozi) • `Polysemy in Tamil and other Indian Languages’, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Arulmozi & PanchananMohanty) • `Telugu WordNet’, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Arulmozi) • `Augmenting IndoWordNet with Context’ Proceedings of the ICON 2010 (S.Rajendran & S.Arulmozi) 2 PRSG Meeting
Workshop conducted • First Dravidian WordNet Workshop • 16-17 March, 2012 • Amrita Vishwa Vidyapeetham • Second Dravidian WordNet Workshop • 5-6 October, 2012 • Dravidian University 2 PRSG Meeting
Action Plan • Hosting Web version • Completion of synset creation • Internal validation of synsets 2 PRSG Meeting
Thank you. 2 PRSG Meeting