2006 Presentations (Communicative Events)
Unsupervised Induction of Modern Standard Arabic Verb Classes
We exploit the resources in the Arabic Treebank (ATB) for the novel task of automatically creating lexical semantic verb classes for Modern Standard Arabic (MSA). Verbs are clustered into groups that share semantic elements of meaning as they exhibit similar syntactic behavior. The results of the clustering experiments are compared with a gold standard set of classes, which is approximated by using the noisy English translations provided in the ATB to create Levin-like classes for MSA. The quality of the clusters is found to be sensitive to the inclusion of information about lexical heads of the constituents in the syntactic frames, as well as parameters of the clustering algorithm. The best set of parameters yields an Fβ=1 score of 0.501, compared to a random baseline with an Fβ=1 score of 0.37.
Subjects
Files
- snider_diab_06b.pdf application/pdf 186 KB Download File
More About This Work
- Academic Units
- Computer Science
- Publisher
- Proceedings of Interspeech
- Published Here
- July 5, 2013