2002 Presentations (Communicative Events)
Corpus-Trained Text Generation for Summarization
We explore how machine learning can be employed to learn rulesets for the traditional modules of content planning and surface realization. Our approach takes advantage of semantically annotated corpora to induce preferences for content planning and constraints on realizations of these plans. We applied this methodology to an annotated corpus of indicative summaries to derive constraint rules that can assist in generating summaries for new, unseen material.
Subjects
Files
- kan_mckeown_02.pdf application/pdf 89.7 KB Download File
More About This Work
- Academic Units
- Computer Science
- Publisher
- Proceedings of the Second International Natural Language Generation Conference (INLG 2002)
- Published Here
- May 10, 2013