The Cross-Modal Suppressive Role of Visual Context on Speech Intelligibility: An ERP Study

Stanley Shen; Jess R. Kerlin; Heather Bortfeld; Antoine J. Shahin

doi:10.3390/brainsci10110810

Brain Sciences (Nov 2020)

The Cross-Modal Suppressive Role of Visual Context on Speech Intelligibility: An ERP Study

Stanley Shen,
Jess R. Kerlin,
Heather Bortfeld,
Antoine J. Shahin

Affiliations

Stanley Shen: Center for Mind and Brain, University of California Davis, Davis, CA 95618, USA
Jess R. Kerlin: Center for Mind and Brain, University of California Davis, Davis, CA 95618, USA
Heather Bortfeld: Department of Psychology, University of California Merced, Merced, CA 95343, USA
Antoine J. Shahin: Center for Mind and Brain, University of California Davis, Davis, CA 95618, USA

DOI: https://doi.org/10.3390/brainsci10110810
Journal volume & issue: Vol. 10, no. 11
p. 810

Abstract

Read online

The efficacy of audiovisual (AV) integration is reflected in the degree of cross-modal suppression of the auditory event-related potentials (ERPs, P1-N1-P2), while stronger semantic encoding is reflected in enhanced late ERP negativities (e.g., N450). We hypothesized that increasing visual stimulus reliability should lead to more robust AV-integration and enhanced semantic prediction, reflected in suppression of auditory ERPs and enhanced N450, respectively. EEG was acquired while individuals watched and listened to clear and blurred videos of a speaker uttering intact or highly-intelligible degraded (vocoded) words and made binary judgments about word meaning (animate or inanimate). We found that intact speech evoked larger negativity between 280–527-ms than vocoded speech, suggestive of more robust semantic prediction for the intact signal. For visual reliability, we found that greater cross-modal ERP suppression occurred for clear than blurred videos prior to sound onset and for the P2 ERP. Additionally, the later semantic-related negativity tended to be larger for clear than blurred videos. These results suggest that the cross-modal effect is largely confined to suppression of early auditory networks with weak effect on networks associated with semantic prediction. However, the semantic-related visual effect on the late negativity may have been tempered by the vocoded signal’s high-reliability.

Published in Brain Sciences

ISSN: 2076-3425 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: https://www.mdpi.com/journal/brainsci/

About the journal

Abstract

Keywords