NetLay: Layout Classification Dataset for Enhancing Layout Analysis

Gogawale, Sharva; Bambaci, Luigi; Kurar-Barakat, Berat; Vasyutinsky Shapira, Daria; Stökl Ben Ezra, Daniel; Dershowitz, Nachum

doi:10.30687/mag/2724-3923/2024/02/003

magazén (Dec 2024)

NetLay: Layout Classification Dataset for Enhancing Layout Analysis

Gogawale, Sharva,
Bambaci, Luigi,
Kurar-Barakat, Berat,
Vasyutinsky Shapira, Daria,
Stökl Ben Ezra, Daniel,
Dershowitz, Nachum

Affiliations

Gogawale, Sharva: Tel Aviv University, Israel
Bambaci, Luigi: École Pratique des Hautes Études (EPHE), France
Kurar-Barakat, Berat: Tel Aviv University, Israel
Vasyutinsky Shapira, Daria: Tel Aviv University, Israel
Stökl Ben Ezra, Daniel: École Pratique des Hautes Études (EPHE), France
Dershowitz, Nachum: Tel Aviv University, Israel

DOI: https://doi.org/10.30687/mag/2724-3923/2024/02/003
Journal volume & issue: Vol. 5, no. 2

Abstract

Read online

Within the domain of historical document image analysis, the process of identifying the spatial structure of a document image is an essential step in many document processing tasks, such as optical character recognition and information extraction. Advancements in layout analysis promise to enhance efficiency and accuracy using specialized models tailored to distinct layouts. We introduce NetLay, a new dataset for benchmarking layout classification algorithms for historical works. It consists of over 1,300 images of pages of printed Hebrew (or Hebrew‑character) books in a variety of styles, categorized into four different classes based on their layout (the number of text columns and regions). Ground truth was crafted manually at the page level. Furthermore, we conduct an in‑depth performance evaluation of various layout classification algorithms, which are based on deep‑learning models that learn to extract spatial features from images. We evaluate our algorithms on NetLay and achieve state‑of‑the‑art results on the task of layout classification for historical books.

Convolutional neural networks. Deep learning. Historical document analysis. Layout analysis. Layout classification. Multi‑label classification

Published in magazén

ISSN: 2724-3923 (Online)
Publisher: Fondazione Università Ca’ Foscari
Country of publisher: Italy
LCC subjects: General Works: History of scholarship and learning. The humanities
Website: https://edizionicafoscari.unive.it/en/edizioni4/riviste/magazen/

About the journal

Abstract

Keywords