Home

Synthesizing composite topic structure trees for multiple domain specific documents

Min-Yen Kan; Kathleen McKeown; Judith L. Klavans

Title:
Synthesizing composite topic structure trees for multiple domain specific documents
Author(s):
Kan, Min-Yen
McKeown, Kathleen
Klavans, Judith L.
Date:
Type:
Technical reports
Department:
Computer Science
Permanent URL:
Series:
Columbia University Computer Science Technical Reports
Part Number:
CUCS-003-01
Publisher:
Department of Computer Science, Columbia University
Publisher Location:
New York
Abstract:
Domain specific texts often have implicit rules on content and organization. We introduce a novel method for synthesizing this topical structure. The system uses corpus examples and recursively merges their topics to build a hierarchical tree. A subjective cross domain evaluation showed that the system performed well in combining related topics and in highlighting important ones.
Subject(s):
Computer science
Item views:
152
Metadata:
text | xml

In Partnership with the Center for Digital Research and Scholarship at Columbia University Libraries/Information Services | Terms of Use