Journal of Open Humanities Data (Apr 2024)

Language of Mechanisation Crowdsourcing Datasets from the Living with Machines Project

  • Mia Ridge,
  • Nilo Pedrazzini,
  • Miguel Vieira,
  • Arianna Ciula,
  • Barbara McGillivray

DOI
https://doi.org/10.5334/johd.195
Journal volume & issue
Vol. 10
pp. 33 – 33

Abstract

Read online

We present the ‘Language of Mechanisation’ datasets with examples of re-use in visualisations and analysis. These reusable CSV files, published on the British Library’s Research Repository, contain automatically-transcribed text from 19th century British newspaper articles. Volunteers on the Zooniverse crowdsourcing platform took part in tasks that asked ‘How did the word x change over time and place?’ They annotated articles with pre-selected meanings (senses) for the words coach, car, trolley and bike. The datasets can support scholarship on a range of historical and linguistic research areas, including research on crowdsourcing and online volunteering behaviours, data processing and data visualisations methodologies.

Keywords