Academic Commons

Presentations (Communicative Events)

Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task

Parton, Kristen; McKeown, Kathleen; Coyne, Robert Eric; Diab, Mona T.; Grishman, Ralph; Hakkani-Tür, Dilek; Harper, Mary; Ji, Heng; Ma, Wei Yun; Meyers, Adam; Rosenthal, Sara; Sun, Ang; Tur, Gokhan; Xu, Wei; Yaman, Sibel

Cross-lingual tasks are especially difficult due to the compounding effect of errors in language processing and errors in machine translation (MT). In this paper, we present an error analysis of a new cross-lingual task: the 5W task, a sentence-level understanding task which seeks to return the English 5W's (Who, What, When, Where and Why) corresponding to a Chinese sentence. We analyze systems that we developed, identifying specific problems in language processing and MT that cause errors. The best cross-lingual 5W system was still 19% worse than the best monolingual 5W system, which shows that MT significantly degrades sentence-level understanding. Neither source-language nor target-language analysis was able to circumvent problems in MT, although each approach had advantages relative to the other. A detailed error analysis across multiple systems suggests directions for future research on the problem.


  • thumnail for parton-cross-lingual-5w-task.pdf parton-cross-lingual-5w-task.pdf application/pdf 150 KB Download File

More About This Work

Academic Units
Computer Science
Published Here
August 5, 2013
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.