Academic Commons

Presentations (Communicative Events)

Extracting paraphrases from a parallel corpus

McKeown, Kathleen; Barzilay, Regina

While paraphrasing is critical both for interpretation and generation of natural language, current systems use manual or semi-automatic methods to collect paraphrases. We present an unsupervised learning algorithm for identification of paraphrases from a corpus of multiple English translations of the same source text. Our approach yields phrasal and single word lexical paraphrases as well as syntactic paraphrases.

Files

More About This Work

Academic Units
Computer Science
Publisher
Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (ACL-EACL 2001)
Published Here
May 3, 2013
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.