IEEE Access (Jan 2021)

ELECTRA for Neural Coreference Resolution in Italian

  • Raffaele Guarasci,
  • Aniello Minutolo,
  • Emanuele Damiano,
  • Giuseppe De Pietro,
  • Hamido Fujita,
  • Massimo Esposito

DOI
https://doi.org/10.1109/ACCESS.2021.3105278
Journal volume & issue
Vol. 9
pp. 115643 – 115654

Abstract

Read online

In recent years, the impact of Neural Language Models has changed every field of Natural Language Processing. In this scenario, coreference resolution has been among the least considered task, especially in language other than English. This work proposes a coreference resolution system for Italian, based on a neural end-to-end architecture integrating ELECTRA language model and trained on OntoCorefIT, a novel Italian dataset built starting from OntoNotes. Even if some approaches for Italian have been proposed in the last decade, to the best of our knowledge, this is the first neural coreference resolver aimed specifically to Italian. The performance of the system is evaluated with respect to three different metrics and also assessed by replacing ELECTRA with the widely-used BERT language model, since its usage has proven to be effective in the coreference resolution task in English. A qualitative analysis has also been conducted, showing how different grammatical categories affect performance in an inflectional and morphological-rich language like Italian. The overall results have shown the effectiveness of the proposed solution, providing a baseline for future developments of this line of research in Italian.

Keywords