2014 Reports
Conventional Orthography for Dialectal Arabic (CODA): Principles and Guidelines -- Egyptian Arabic - Version 0.7 - March 2012
This document introduces CODA (Conventional Orthography for Dialectal Arabic) and presents specifications and detailed guidelines for Egyptian Arabic CODA. CODA addresses the problem of inconsistent orthographic choices in raw (naturally occurring) written dialectal Arabic text. The specifications are a succinct summary, while the guidelines contain details and examples. The document has three parts that are ordered from most general to the more specific. In Part 1, we define CODA and present its general goals, principles and considerations in a non-dialect specific manner. In Part 2, we present a high level CODA specification for Egyptian Arabic (EGY). And in Part 3, we present detailed guidelines for EGY CODA.
Subjects
Files
- CCLS-14-01.pdf application/pdf 2.92 MB Download File
More About This Work
- Academic Units
- Center for Computational Learning Systems
- Publisher
- Center for Computational Learning Systems, Columbia University
- Series
- CCLS Technical Report, CCLS-14-01
- Published Here
- September 26, 2014