Journal of Imaging (Jun 2022)

Document Liveness Challenge Dataset (DLC-2021)

  • Dmitry V. Polevoy,
  • Irina V. Sigareva,
  • Daria M. Ershova,
  • Vladimir V. Arlazarov,
  • Dmitry P. Nikolaev,
  • Zuheng Ming,
  • Muhammad Muzzamil Luqman,
  • Jean-Christophe Burie

DOI
https://doi.org/10.3390/jimaging8070181
Journal volume & issue
Vol. 8, no. 7
p. 181

Abstract

Read online

Various government and commercial services, including, but not limited to, e-government, fintech, banking, and sharing economy services, widely use smartphones to simplify service access and user authorization. Many organizations involved in these areas use identity document analysis systems in order to improve user personal-data-input processes. The tasks of such systems are not only ID document data recognition and extraction but also fraud prevention by detecting document forgery or by checking whether the document is genuine. Modern systems of this kind are often expected to operate in unconstrained environments. A significant amount of research has been published on the topic of mobile ID document analysis, but the main difficulty for such research is the lack of public datasets due to the fact that the subject is protected by security requirements. In this paper, we present the DLC-2021 dataset, which consists of 1424 video clips captured in a wide range of real-world conditions, focused on tasks relating to ID document forensics. The novelty of the dataset is that it contains shots from video with color laminated mock ID documents, color unlaminated copies, grayscale unlaminated copies, and screen recaptures of the documents. The proposed dataset complies with the GDPR because it contains images of synthetic IDs with generated owner photos and artificial personal information. For the presented dataset, benchmark baselines are provided for tasks such as screen recapture detection and glare detection. The data presented are openly available in Zenodo.

Keywords