On the limits of LLM surprisal as a functional explanation of the N400 and P600

Krieger, Benedict; Brouwer, Harm; Aurnhammer, Christoph; Crocker, Matthew W.

Bitte benutzen Sie diese Referenz, um auf diese Ressource zu verweisen: doi:10.22028/D291-46367

Titel:	On the limits of LLM surprisal as a functional explanation of the N400 and P600
VerfasserIn:	Krieger, Benedict Brouwer, Harm Aurnhammer, Christoph Crocker, Matthew W.
Sprache:	Englisch
Titel:	Brain Research
Bandnummer:	1865
Verlag/Plattform:	Elsevier
Erscheinungsjahr:	2025
Freie Schlagwörter:	Large language models N400 P600 Event-related potentials Human language comprehension Psycholinguistics
DDC-Sachgruppe:	400 Sprache, Linguistik
Dokumenttyp:	Journalartikel / Zeitschriftenartikel
Abstract:	Expectations about upcoming words play a central role in language comprehension, with expected words being processed more easily than less expected ones. Surprisal theory formalizes this relationship by positing that cognitive effort is proportional to a word’s negative log-probability in context, as determined by distributional, linguistic, and world knowledge constraints. The emergence of large language models (LLMs) demonstrating the capacity to compute richly contextualized surprisal estimates, has motivated their consideration as models of comprehension. We assess here the relationship of LLM surprisal with two key neural correlates of comprehension – the N400 and the P600 – which differ in sensitivity to semantic association and contextual expectancy. While prior work has focused on the N400, we propose that the P600 may offer a better index of surprisal, as it is unaffected by association while still patterning continuously with expectancy. Using regression-based ERPs (rERPs), we examine data from three German factorial studies to evaluate the extent to which LLM surprisal can account for ERP differences. Our results show that LLM surprisal captures neither component consistently. We find that it is contaminated by simple association, particularly in smaller LLMs. As a result, LLM surprisal can partially account for association-driven N400 effects, but not for the full attenuation of N400 effects. Correspondingly, this property of LLMs compromises their ability to model the P600, which is sensitive to expectancy but not to association.
DOI der Erstveröffentlichung:	10.1016/j.brainres.2025.149841
URL der Erstveröffentlichung:	https://doi.org/10.1016/j.brainres.2025.149841
Link zu diesem Datensatz:	urn:nbn:de:bsz:291--ds-463673 hdl:20.500.11880/40628 http://dx.doi.org/10.22028/D291-46367
ISSN:	0006-8993
Datum des Eintrags:	30-Sep-2025
Fakultät:	P - Philosophische Fakultät
Fachrichtung:	P - Sprachwissenschaft und Sprachtechnologie
Professur:	P - Prof. Dr. Matthew W. Crocker
Sammlung:	SciDok - Der Wissenschaftsserver der Universität des Saarlandes

Dateien zu diesem Datensatz:

Datei	Beschreibung	Größe	Format
1-s2.0-S0006899325004020-main.pdf		7,36 MB	Adobe PDF	Öffnen/Anzeigen

Export: BibTex Statistik anzeigen

Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons