Cutting the Frame: An In-Depth Look at the Hitchcock Computer Vision Dataset

Nabeel Siddiqui

doi:10.5334/johd.163

Journal of Open Humanities Data (Jan 2024)

Cutting the Frame: An In-Depth Look at the Hitchcock Computer Vision Dataset

Nabeel Siddiqui

Affiliations

Nabeel Siddiqui: ORCiD; Communications Department, Susquehanna University, Selinsgrove, PA

DOI: https://doi.org/10.5334/johd.163
Journal volume & issue: Vol. 10
pp. 5 – 5

Abstract

Read online

This paper presents a comprehensive dataset comprising annotations generated by the Google Vision API for approximately 105,000 frames extracted from 15 Alfred Hitchcock films. These annotations include information about object detection, facial recognition, web-entity analysis, and explicit content filtering. With potential applications in the digital humanities and film studies, this dataset enables researchers to not only explore and evaluate cinematic content but also the ways that it resurfaces in various cultural contexts online. The paper provides a detailed account of the dataset creation process, which involved the decryption of the DVD, frame extraction, and costs of annotations. Additionally, the paper outlines future research possibilities based on the dataset. These include statistical analysis of frame content and labels to identify patterns and trends, comparisons of different computer vision algorithms to assess their accuracy and effectiveness, and the utilization of bipartite networks to explore mise-en-scene in films.

Published in Journal of Open Humanities Data

ISSN: 2059-481X (Online)
Publisher: Ubiquity Press
Country of publisher: United Kingdom
LCC subjects: General Works: History of scholarship and learning. The humanities; Language and Literature
Website: https://openhumanitiesdata.metajnl.com/

About the journal

Abstract

Keywords