KCIS Language Resources
The main goal for Indian Languages - Treebanking Project is to develop treebanks for several resource poor Indian Languages. Treebanks are developed for Bengali, Kannada, Hindi, Malayalam and Marathi as a part of this project. Data from different domains like news articles, general domain, tourism domain are annotated. Treebanks have multi-layered representations encompassing both syntactic (Morph Information, POS Tags, Chunks) and semantic (dependency labels) information. Dependency Annotation is based on Paninian Grammar Framework.
The project is a collaborative effort of five organizations :
(1) C-DIT, Thiruvananthapuram [Malayalam], (2) Manipal Institute of Technology, Manipal [Kannada], (3) Jadavpur University, Kolkata [Bengali], (4) Indian Institute of Technology, Bombay [Marathi] and (5) International Institute of Information Technology (IIIT), Hyderabad [Hindi (Lead Institute)]
This project is funded by DeitY, Government of India
Please Login or Regiser in order to download the different language treebanks.