Academic Commons

Reports

Synthesizing composite topic structure trees for multiple domain specific documents

Kan, Min-Yen; McKeown, Kathleen; Klavans, Judith L.

Domain specific texts often have implicit rules on content and organization. We introduce a novel method for synthesizing this topical structure. The system uses corpus examples and recursively merges their topics to build a hierarchical tree. A subjective cross domain evaluation showed that the system performed well in combining related topics and in highlighting important ones.

Subjects

Files

More About This Work

Academic Units
Computer Science
Publisher
Department of Computer Science, Columbia University
Series
Columbia University Computer Science Technical Reports, CUCS-003-01
Published Here
April 22, 2011