Presentations (Communicative Events)

Making a Scene: Alignment of Complete Sets of Clips Based on Pairwise Audio Match

Ellis, Daniel P. W.; Naaman, Mor; Su, Kai; Gurjar, Avadhut; Patel, Mohsin

As the amount of social video content captured at physical-world events, and shared online, is rapidly increasing, there is a growing need for robust methods for organization and presentation of the captured content. In this work, we significantly extend prior work that examined automatic detection of videos from events that were captured at the same time, i.e. "overlapping". We go beyond finding pairwise matches between video clips and describe the construction of scenes, or sets of multiple overlapping videos, each scene presenting a coherent moment in the event. We test multiple strategies for scene construction, using a greedy algorithm to create a mapping of videos into scenes, and a clustering refinement step to increase the precision of each scene. We evaluate the strategies in multiple settings and show that a greedy and clustering approach results in best possible balance between recall and precision for all settings.


Also Published In

ACM International Conference on Multimedia Retrieval 2012

More About This Work

Academic Units
Electrical Engineering
Published Here
April 22, 2013