2009 Reports
CATiB: The Columbia Arabic Treebank
The Columbia Arabic Treebank (CATiB) is a resource for Arabic parsing. CATiB contrasts with previous efforts on Arabic treebanking and treebanking of morphologically rich languages in that it encodes less linguistic information in the interest of speedier annotation of large amounts of text. This paper describes CATiB's representation and annotation procedure, and reports on achieved inter-annotator agreement and annotation speed.
Subjects
Files
- CCLS-09-01.pdf application/pdf 289 KB Download File
More About This Work
- Academic Units
- Center for Computational Learning Systems
- Publisher
- Center for Computational Learning Systems, Columbia University
- Series
- CCLS Technical Report, CCLS-09-01
- Published Here
- November 22, 2010