The Shakespeare’s World Crowdsourced Transcription Project Datasets

Victoria Van Hyning; ZhiCheng Wang

doi:10.5334/johd.237

Journal of Open Humanities Data (Nov 2024)

The Shakespeare’s World Crowdsourced Transcription Project Datasets

Victoria Van Hyning,
ZhiCheng Wang

Affiliations

Victoria Van Hyning: ORCiD; College of Information, University of Maryland, College Park
ZhiCheng Wang: ORCiD; College of Information, University of Maryland, College Park

DOI: https://doi.org/10.5334/johd.237
Journal volume & issue: Vol. 10
pp. 52 – 52

Abstract

Read online

The Shakespeare’s World Datasets derive from a crowdsourced transcription project hosted on the Zooniverse platform between 2015 and 2019 (Van Hyning et al, 2015–2019). Volunteers transcribed 14,330 digitized early modern manuscripts from the Folger Shakespeare Library. 3,926 registered volunteers contributed, and over 94,570 anonymous sessions representing an unknown number of individuals, were recorded. This paper presents a cleaned dataset of individual volunteer transcriptions (IVT), containing all 203,389 valid classifications, along with three supplementary social datasets for a discussion forum and blog. These datasets provide insights into the transcription process, and volunteer and project owner interactions during the project’s lifespan. The datasets have significant reuse potential in digital humanities, historical linguistics, and handwritten text recognition.

Published in Journal of Open Humanities Data

ISSN: 2059-481X (Online)
Publisher: Ubiquity Press
Country of publisher: United Kingdom
LCC subjects: General Works: History of scholarship and learning. The humanities; Language and Literature
Website: https://openhumanitiesdata.metajnl.com/

About the journal

Abstract

Keywords