1995 Presentations (Communicative Events)
Coerced Markov Models For Cross-Lingual Lexical-Tag Relations
We introduce the Coerced Markov Model (CMM) to model the relationship between the lexical sequence of a source language and the tag sequence of a target language, with the objective of constraining search in statistical transfer-based machine translation systems. CMMs differ from Hidden Markov Models in that state sequence assignments can take on values coerced from external sources. Given a Chinese sentence, a CMM can be used to predict the corresponding English tag sequence, thus constraining the English lexical sequence produced by a translation model. The CMM can also be used to score competing translation hypotheses in N-best models. Three fundamental problems for CMM designed are discussed. Their solutions lead to the training and testing stages of CMM.
Subjects
Files
-
fung_wu_95.pdf application/pdf 106 KB Download File
More About This Work
- Academic Units
- Computer Science
- Publisher
- Proceedings of the Sixth International Conference on Theoretical and Methodological Issues in Machine Translation
- Published Here
- April 26, 2013