Synonyms, Antonyms and Factual Knowledge in BERT Heads

Lorenzo Serina; Luca Putelli; Alfonso Emilio Gerevini; Ivan Serina

doi:10.3390/fi15070230

Future Internet (Jun 2023)

Synonyms, Antonyms and Factual Knowledge in BERT Heads

Lorenzo Serina,
Luca Putelli,
Alfonso Emilio Gerevini,
Ivan Serina

Affiliations

Lorenzo Serina: Department of Information Engineering, Università Degli Studi di Brescia, Via Branze 38, 25100 Brescia, Italy
Luca Putelli: Department of Information Engineering, Università Degli Studi di Brescia, Via Branze 38, 25100 Brescia, Italy
Alfonso Emilio Gerevini: Department of Information Engineering, Università Degli Studi di Brescia, Via Branze 38, 25100 Brescia, Italy
Ivan Serina: Department of Information Engineering, Università Degli Studi di Brescia, Via Branze 38, 25100 Brescia, Italy

DOI: https://doi.org/10.3390/fi15070230
Journal volume & issue: Vol. 15, no. 7
p. 230

Abstract

Read online

In recent years, many studies have been devoted to discovering the inner workings of Transformer-based models, such as BERT, for instance, attempting to identify what information is contained within them. However, little is known about how these models store this information in their millions of parameters and which parts of the architecture are the most important. In this work, we propose an approach to identify self-attention mechanisms, called heads, that contain semantic and real-world factual knowledge in BERT. Our approach includes a metric computed from attention weights and exploits a standard clustering algorithm for extracting the most relevant connections between tokens in a head. In our experimental analysis, we focus on how heads can connect synonyms, antonyms and several types of factual knowledge regarding subjects such as geography and medicine.

Published in Future Internet

ISSN: 1999-5903 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/futureinternet/

About the journal

Abstract

Keywords