Gromov–Wasserstein unsupervised alignment reveals structural correspondences between the color similarity structures of humans and large language models

Genji Kawakita; Ariel Zeleznikow-Johnston; Naotsugu Tsuchiya; Masafumi Oizumi

doi:10.1038/s41598-024-65604-1

Scientific Reports (Jul 2024)

Gromov–Wasserstein unsupervised alignment reveals structural correspondences between the color similarity structures of humans and large language models

Genji Kawakita,
Ariel Zeleznikow-Johnston,
Naotsugu Tsuchiya,
Masafumi Oizumi

Affiliations

Genji Kawakita: Department of Bioengineering, Imperial College London
Ariel Zeleznikow-Johnston: School of Psychological Sciences, Monash University
Naotsugu Tsuchiya: School of Psychological Sciences, Monash University
Masafumi Oizumi: Graduate School of Arts and Science, The University of Tokyo

DOI: https://doi.org/10.1038/s41598-024-65604-1
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Large Language Models (LLMs), such as the General Pre-trained Transformer (GPT), have shown remarkable performance in various cognitive tasks. However, it remains unclear whether these models have the ability to accurately infer human perceptual representations. Previous research has addressed this question by quantifying correlations between similarity response patterns of humans and LLMs. Correlation provides a measure of similarity, but it relies pre-defined item labels and does not distinguish category- and item- level similarity, falling short of characterizing detailed structural correspondence between humans and LLMs. To assess their structural equivalence in more detail, we propose the use of an unsupervised alignment method based on Gromov–Wasserstein optimal transport (GWOT). GWOT allows for the comparison of similarity structures without relying on pre-defined label correspondences and can reveal fine-grained structural similarities and differences that may not be detected by simple correlation analysis. Using a large dataset of similarity judgments of 93 colors, we compared the color similarity structures of humans (color-neurotypical and color-atypical participants) and two GPT models (GPT-3.5 and GPT-4). Our results show that the similarity structure of color-neurotypical participants can be remarkably well aligned with that of GPT-4 and, to a lesser extent, to that of GPT-3.5. These results contribute to the methodological advancements of comparing LLMs with human perception, and highlight the potential of unsupervised alignment methods to reveal detailed structural correspondences.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords