Transparency Helps Reveal When Language Models Learn Meaning

Zhaofeng Wu; William Merrill; Hao Peng; Iz Beltagy; Noah A. Smith

doi:10.1162/tacl_a_00565

Transactions of the Association for Computational Linguistics (Jan 2023)

Transparency Helps Reveal When Language Models Learn Meaning

Zhaofeng Wu,
William Merrill,
Hao Peng,
Iz Beltagy,
Noah A. Smith

Affiliations

Zhaofeng Wu: MIT. [email protected]
William Merrill: New York University. [email protected]
Hao Peng: Allen Institute for Artificial Intelligence. [email protected]
Iz Beltagy: Allen Institute for Artificial Intelligence. [email protected]
Noah A. Smith: Allen Institute for Artificial Intelligence. [email protected]

DOI: https://doi.org/10.1162/tacl_a_00565
Journal volume & issue: Vol. 11
pp. 617 – 634

Abstract

Read online

AbstractMany current NLP systems are built from language models trained to optimize unsupervised objectives on large amounts of raw text. Under what conditions might such a procedure acquire meaning? Our systematic experiments with synthetic data reveal that, with languages where all expressions have context-independent denotations (i.e., languages with strong transparency), both autoregressive and masked language models successfully learn to emulate semantic relations between expressions. However, when denotations are changed to be context-dependent with the language otherwise unmodified, this ability degrades. Turning to natural language, our experiments with a specific phenomenon—referential opacity—add to the growing body of evidence that current language models do not represent natural language semantics well. We show this failure relates to the context-dependent nature of natural language form-meaning mappings.

Published in Transactions of the Association for Computational Linguistics

ISSN: 2307-387X (Online)
Publisher: The MIT Press
Country of publisher: United States
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing
Website: https://direct.mit.edu/tacl

About the journal