Zeitschrift für digitale Geisteswissenschaften (Jul 2016)

Automatic Writer Identification in Historical Documents: A Case Study

  • Vincent Christlein,
  • Markus Diem,
  • Florian Kleber,
  • Günter Mühlberger,
  • Verena Schwägerl-Melchior,
  • Esther van Gelder,
  • Andreas Maier

DOI
https://doi.org/10.17175/2016_002
Journal volume & issue
no. 02

Abstract

Read online

In recent years, Automatic Writer Identification (AWI) has received a lot of attention in the document analysis community. However, most research has been conducted on contemporary benchmark sets. These datasets typically do not contain any noise or artefacts caused by the conversion methodology. This article analyses how current state-of-the-art methods in writer identification perform on historical documents. In contrast to contemporary documents, historical data often contain artefacts such as holes, rips, or water stains which make reliable identification error-prone. Experiments were conducted on two large letter collections with known authenticity and promising results of 82% and 89% TOP-1 accuracy were achieved.

Keywords