Playing with machines: Using machine learning to understand automated copyright enforcement at scale

Joanne E Gray; Nicolas P Suzor

doi:10.1177/2053951720919963

Big Data & Society (Apr 2020)

Playing with machines: Using machine learning to understand automated copyright enforcement at scale

Joanne E Gray,
Nicolas P Suzor

Affiliations

Joanne E Gray
Nicolas P Suzor

DOI: https://doi.org/10.1177/2053951720919963
Journal volume & issue: Vol. 7

Abstract

Read online

This article presents the results of methodological experimentation that utilises machine learning to investigate automated copyright enforcement on YouTube. Using a dataset of 76.7 million YouTube videos, we explore how digital and computational methods can be leveraged to better understand content moderation and copyright enforcement at a large scale.We used the BERT language model to train a machine learning classifier to identify videos in categories that reflect ongoing controversies in copyright takedowns. We use this to explore, in a granular way, how copyright is enforced on YouTube, using both statistical methods and qualitative analysis of our categorised dataset. We provide a large-scale systematic analysis of removals rates from Content ID’s automated detection system and the largely automated, text search based, Digital Millennium Copyright Act notice and takedown system. These are complex systems that are often difficult to analyse, and YouTube only makes available data at high levels of abstraction. Our analysis provides a comparison of different types of automation in content moderation, and we show how these different systems play out across different categories of content. We hope that this work provides a methodological base for continued experimentation with the use of digital and computational methods to enable large-scale analysis of the operation of automated systems.

Published in Big Data & Society

ISSN: 2053-9517 (Online)
Publisher: SAGE Publishing
Country of publisher: United States
LCC subjects: General Works
Website: https://journals.sagepub.com/home/bds

About the journal