PLoS ONE (Jan 2017)

Using stochastic language models (SLM) to map lexical, syntactic, and phonological information processing in the brain.

  • Alessandro Lopopolo,
  • Stefan L Frank,
  • Antal van den Bosch,
  • Roel M Willems

DOI
https://doi.org/10.1371/journal.pone.0177794
Journal volume & issue
Vol. 12, no. 5
p. e0177794

Abstract

Read online

Language comprehension involves the simultaneous processing of information at the phonological, syntactic, and lexical level. We track these three distinct streams of information in the brain by using stochastic measures derived from computational language models to detect neural correlates of phoneme, part-of-speech, and word processing in an fMRI experiment. Probabilistic language models have proven to be useful tools for studying how language is processed as a sequence of symbols unfolding in time. Conditional probabilities between sequences of words are at the basis of probabilistic measures such as surprisal and perplexity which have been successfully used as predictors of several behavioural and neural correlates of sentence processing. Here we computed perplexity from sequences of words and their parts of speech, and their phonemic transcriptions. Brain activity time-locked to each word is regressed on the three model-derived measures. We observe that the brain keeps track of the statistical structure of lexical, syntactic and phonological information in distinct areas.