Theses Master's

Using Natural Language Processing to Organize and Analyze Oral History Projects

Pandza, Christopher

In this paper, I discuss how Natural Language Processing (NLP) tools, particularly large language models (LLMs) and topic modeling, can be employed to organize and analyze large-scale oral history projects. To illustrate their utility, I demonstrate two use cases with the Ellis Island Oral History Project: metadata mining and topic assignment. Subsequently, I use the outputs from these exercises to explore two research questions.

Keywords: natural language processing, artificial intelligence, archives, immigration


  • thumnail for Pandza_OHMA_Thesis_20230510.pdf Pandza_OHMA_Thesis_20230510.pdf application/pdf 1.11 MB Download File

More About This Work

Academic Units
Oral History Master of Arts
Thesis Advisors
McAllister, William
M.A., Columbia University
Published Here
October 19, 2023