Academic Commons

Presentations (Communicative Events)

Methods for precise named entity matching in digital collections

Klavans, Judith L.; Davis, Peter T.; Elson, David K.

In this paper, we describe an interactive system, built within the context of CLiMB project, which permits a user to locate the occurrences of named entities within a given text. The named entity tool was developed to identify references to a single art object (e.g. a particular building) with high precision in text related to images of that object in a digital collection. We start with an authoritative list of art objects, and seek to match variants of these named entities in related text. Our approach is to "decay" entities into progressively more general variants while retaining high precision. As variants become more general, and thus more ambiguous, we propose methods to disambiguate intermediate results. Our results will be used to select records into which automatically generated metadata will be loaded.

Files

More About This Work

Academic Units
Computer Science
Publisher
JCDL '03 Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
Published Here
May 10, 2013