Objective
Consortium Project on Development of Dravidian WordNet : An Integrated WordNet for Telugu, Tamil, Kannada and Malayalam. Objective. Develop an integrated WordNet in four major Dravidian languages, viz. Tamil, Telugu, Kannada and Malayalam Linked with Hindi and English WordNets.
Objective
E N D
Presentation Transcript
Consortium Project on Development of Dravidian WordNet: An Integrated WordNet for Telugu, Tamil, Kannada and Malayalam
Objective • Develop an integrated WordNet in four major Dravidian languages, viz. Tamil, Telugu, Kannada and Malayalam • Linked with Hindi and English WordNets 2 PRSG Meeting
Consortium Members • Consortium Leader • Prof. Pushpak Bhattacharya, IIT Bombay • Consortium Members • Dr. S. Baskaran, Tamil University (Tamil) • Prof. K.P.Soman, Amrita ViswaVidyapeetham(Malayalam) • Prof. C.S.Ramachandra, University of Mysore (Kannada) • Dr. S. Arulmozi, Dravidian University (Co-Consortium Leader & Telugu) 2 PRSG Meeting
Project Details • Total Outlay of the Project: • 150.43 lakhs • Date of Commencement: • 26 Dec 2011 • Duration of the Project: • 24 months 2 PRSG Meeting
Project Deliverables • The integrated Dravidian WordNet will be linked with Hindi and English WordNets, with which the users will be able to • Look up their language specific words to obtain lexico-semantic relations like synonymy, hypernymy, meronymy etc. • Query for cross-lingual lexical information • Design and implement complex natural language applications like machine translation and cross-lingual search 2 PRSG Meeting
Organization and Distribution of Tasks • IIT-B • Overall Coordination of the project • providing guidance on the architecture and technology • making available existing tools and interfaces • Computational tasks; algorithms on WordNets 2 PRSG Meeting
Organization & Distribution of Tasks • Other Partners • 20000 synsetscreation • Validation of synsets • Adaptation of semantic relations and validation (each in Tamil, Telugu, Malayalam and Kannada) 2 PRSG Meeting
Tamil WordNet • Commencement Date: 24 April 2012 • Principal Investigator: Dr.S.Baskaran • Senior Linguist • G. Vasuki, M.A. M.Phil (Ling.) • Computer Scientist • G.Biju, MCA, M.Phil • Lexicographers • D. Yoga, M.A. M.Phil (Ling), M.A. (Tamil) • M. Ramasundari, M.A. M.Phil, Ph.D (Ling.) • D. Vinodha, M.A.(Hindi), Dip. In Translation • K. Bakkiyaraj, M.A. M.Phil (Ling.) 2 PRSG Meeting
Malayalam WordNet • Commencement Date: 24 April 2012 • Principal Investigator: Prof.K.P.Soman • Senior Linguist • N. Rajendran, M.A. Ph.D (Ling.) • Computer Scientist • K.Krishnakumar, MA, M.Phil, Ph.D (Ling.) • Lexicographers • S. Veera Alagiri, M.A. M.Phil, Ph.D (Ling) • Jyothi Ratnam, M.A. (Hindi) 2 PRSG Meeting
Telugu WordNet • Commencement Date: 2 July 2012 • Principal Investigator:Dr.S.Arulmozi • Co-PI: Dr.M.C.KesavaMurty • Senior Linguist • Dr.S.ChandraKiran, M.A. M.Phil (Tel.) Ph.D (Comp.Lit.) • Computer Scientist • T. Swathi, MCA • Lexicographers • S. Sravanti, M.A. (Telugu) • K. Sukanya, M.A. (Telugu) • K. Sampoorna, M.A. (Telugu) • N.Silparani, M.A. (Telugu) 2 PRSG Meeting
Kannada WordNet • Commencement Date: 23 July 2012 • Principal Investigator: Prof. C.S.Ramachandra • Co-PI: Prof. G.Hemanthakumar • Senior Linguist • Dr.B.P.Hemananda, M.A. Ph.D (Ling.) • Lexicographers • Chaya Devi, M.A. Linguistics • R M Ramya, M.A. Kannada 2 PRSG Meeting
Status of synset creation 2 PRSG Meeting
Total Synsets Developed Includes Pan-Indian, Universal, Remaining Synsets 2 PRSG Meeting
Status on Tasks • Synset Creation – • Pan-Indian, Universal – Completed • Nouns – 40% completed • Verbs – 70 % completed • Adjectives – completed • Adverbs – 70% completed • Language & Culture Specific synsets – Initiated • Named Entity – to start • Web tool – Telugu is completed, others are in line. 2 PRSG Meeting
Manpower Trained 2 PRSG Meeting
Equipment Purchased 2 PRSG Meeting
Financial Details 2 PRSG Meeting
Institute-wise Project Budget 2 PRSG Meeting
Head-wise Fund Distribution 2 PRSG Meeting
Amount Received & Expenditure(upto 28 Feb 2013) Project commenced after 5 months of administrative approval 2 PRSG Meeting
Man-power Details 2 PRSG Meeting
Papers Published • `Tamil WordNet’, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Rajendran) • `Building a WordNet’ for Dravidian Languages, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Rajendran, S.Gopakumar, V.Dhanalakshmi) • `Representation of Kinship in WordNet’, Proceedings of the 9th International Tamil Internet Conference, Coimbatore, 23-27 June 2010 (S.Arulmozi) • `Polysemy in Tamil and other Indian Languages’, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Arulmozi & PanchananMohanty) • `Telugu WordNet’, Proceedings of the Fifth Global WordNet Conference, IIT-Bombay, 31 Jan-4 Feb 2010 (S.Arulmozi) • `Augmenting IndoWordNet with Context’ Proceedings of the ICON 2010 (S.Rajendran & S.Arulmozi) 2 PRSG Meeting
Workshop conducted • First Dravidian WordNet Workshop • 16-17 March, 2012 • Amrita Vishwa Vidyapeetham • Second Dravidian WordNet Workshop • 5-6 October, 2012 • Dravidian University 2 PRSG Meeting
Action Plan • Hosting Web version • Completion of synset creation • Internal validation of synsets 2 PRSG Meeting
Thank you. 2 PRSG Meeting