Frontiers in Psychology (Jun 2014)

Distinct cortical locations for integration of audiovisual speech and the McGurk effect

  • Laura C. Erickson,
  • Laura C. Erickson,
  • Brandon A. Zielinski,
  • Brandon A. Zielinski,
  • Jennifer E.V. Zielinski,
  • Guoying eLiu,
  • Guoying eLiu,
  • Peter E. Turkeltaub,
  • Peter E. Turkeltaub,
  • Amber M. Leaver,
  • Amber M. Leaver,
  • Josef P. Rauschecker,
  • Josef P. Rauschecker

DOI
https://doi.org/10.3389/fpsyg.2014.00534
Journal volume & issue
Vol. 5

Abstract

Read online

Audiovisual (AV) speech integration is often studied using the McGurk effect, where the combination of specific incongruent auditory and visual speech cues produces the perception of a third illusory speech percept. Recently, several studies have implicated the posterior superior temporal sulcus (pSTS) in the McGurk effect; however, the exact roles of the pSTS and other brain areas in correcting differing AV sensory inputs remain unclear. Using functional magnetic resonance imaging (fMRI) in ten participants, we aimed to isolate brain areas specifically involved in processing congruent AV speech and the McGurk effect. Speech stimuli were composed of sounds and/or videos of consonant-vowel tokens resulting in four stimulus classes: congruent AV speech (AVCong), incongruent AV speech resulting in the McGurk effect (AVMcGurk), acoustic-only speech (AO), and visual-only speech (VO). In group- and single-subject-analyses, left pSTS exhibited significantly greater fMRI signal for congruent AV speech (i.e., AVCong trials) than for both AO and VO trials. Right superior temporal gyrus, medial prefrontal cortex, and cerebellum were also identified. For McGurk speech (i.e., AVMcGurk trials), two clusters in the left posterior superior temporal gyrus (pSTG), just posterior to Heschl’s gyrus or on its border, exhibited greater fMRI signal than both AO and VO trials. We propose that while some brain areas, such as left pSTS, may be more critical for the integration of AV speech, other areas, such as left pSTG, may generate the corrected or merged percept arising from conflicting auditory and visual cues (i.e., as in the McGurk effect). These findings are consistent with the concept that posterior superior temporal areas represent part of a dorsal auditory stream, which is involved in multisensory integration, sensorimotor control, and optimal state estimation (Rauschecker and Scott, 2009).

Keywords