Large language models can outperform humans in social situational judgments

Justin M. Mittelstädt; Julia Maier; Panja Goerke; Frank Zinn; Michael Hermes

doi:10.1038/s41598-024-79048-0

Scientific Reports (Nov 2024)

Large language models can outperform humans in social situational judgments

Justin M. Mittelstädt,
Julia Maier,
Panja Goerke,
Frank Zinn,
Michael Hermes

Affiliations

Justin M. Mittelstädt: Department of Aviation and Space Psychology, German Aerospace Center, Institute of Aerospace Medicine
Julia Maier: Department of Aviation and Space Psychology, German Aerospace Center, Institute of Aerospace Medicine
Panja Goerke: Department of Aviation and Space Psychology, German Aerospace Center, Institute of Aerospace Medicine
Frank Zinn: Department of Aviation and Space Psychology, German Aerospace Center, Institute of Aerospace Medicine
Michael Hermes: Department of Aviation and Space Psychology, German Aerospace Center, Institute of Aerospace Medicine

DOI: https://doi.org/10.1038/s41598-024-79048-0
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Large language models (LLM) have been a catalyst for the public interest in artificial intelligence (AI). These technologies perform some knowledge-based tasks better and faster than human beings. However, whether AIs can correctly assess social situations and devise socially appropriate behavior, is still unclear. We conducted an established Situational Judgment Test (SJT) with five different chatbots and compared their results with responses of human participants (N = 276). Claude, Copilot and you.com’s smart assistant performed significantly better than humans in proposing suitable behaviors in social situations. Moreover, their effectiveness rating of different behavior options aligned well with expert ratings. These results indicate that LLMs are capable of producing adept social judgments. While this constitutes an important requirement for the use as virtual social assistants, challenges and risks are still associated with their wide-spread use in social contexts.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal