Home

Improving the Coherence of Multi-document Summaries: A Corpus Study for Modeling the Syntactic Realization of Entities

Ani Nenkova; Kathleen McKeown

Title:
Improving the Coherence of Multi-document Summaries: A Corpus Study for Modeling the Syntactic Realization of Entities
Author(s):
Nenkova, Ani
McKeown, Kathleen
Date:
Type:
Technical reports
Department:
Computer Science
Permanent URL:
Series:
Columbia University Computer Science Technical Reports
Part Number:
CUCS-001-03
Publisher:
Department of Computer Science, Columbia University
Publisher Location:
New York
Abstract:
References included in multi-document summaries are often problematic. In this paper, we present a corpus study performed to derive statistical models for the syntactic realization of referential expressions. Our work shows how the syntactic realization of entities can influence the coherence of the text and provides a model for rewriting references in multi-document summaries to smooth disfluencies. It shows how the syntactic realization of entities can influence the coherence of the text and how rewrite change s can smooth the disfluencies. A large corpus study is conducted in order to derive initial models for syntactic realization.
Subject(s):
Computer science
Item views:
151
Metadata:
View

In Partnership with the Center for Digital Research and Scholarship at Columbia University Libraries/Information Services.