A Pseudo-Value Approach to Analyze the Semantic Similarity of the Speech of Children With and Without Autism Spectrum Disorder

Joel R. Adams; Alexandra C. Salem; Alexandra C. Salem; Heather MacFarlane; Heather MacFarlane; Rosemary Ingham; Steven D. Bedrick; Eric Fombonne; Eric Fombonne; Jill K. Dolata; Jill K. Dolata; Alison Presmanes Hill; Jan van Santen

doi:10.3389/fpsyg.2021.668344

Frontiers in Psychology (Jul 2021)

A Pseudo-Value Approach to Analyze the Semantic Similarity of the Speech of Children With and Without Autism Spectrum Disorder

Joel R. Adams,
Alexandra C. Salem,
Alexandra C. Salem,
Heather MacFarlane,
Heather MacFarlane,
Rosemary Ingham,
Steven D. Bedrick,
Eric Fombonne,
Eric Fombonne,
Jill K. Dolata,
Jill K. Dolata,
Alison Presmanes Hill,
Jan van Santen

Affiliations

Joel R. Adams: Center for Spoken Language Understanding, Oregon Health & Science University, Portland, OR, United States
Alexandra C. Salem: Center for Spoken Language Understanding, Oregon Health & Science University, Portland, OR, United States
Alexandra C. Salem: Department of Psychiatry, Oregon Health & Science University, Portland, OR, United States
Heather MacFarlane: Center for Spoken Language Understanding, Oregon Health & Science University, Portland, OR, United States
Heather MacFarlane: Department of Psychiatry, Oregon Health & Science University, Portland, OR, United States
Rosemary Ingham: Center for Spoken Language Understanding, Oregon Health & Science University, Portland, OR, United States
Steven D. Bedrick: Center for Spoken Language Understanding, Oregon Health & Science University, Portland, OR, United States
Eric Fombonne: Department of Psychiatry, Oregon Health & Science University, Portland, OR, United States
Eric Fombonne: Institute on Development and Disability, Oregon Health & Science University, Portland, OR, United States
Jill K. Dolata: Center for Spoken Language Understanding, Oregon Health & Science University, Portland, OR, United States
Jill K. Dolata: Institute on Development and Disability, Oregon Health & Science University, Portland, OR, United States
Alison Presmanes Hill: Center for Spoken Language Understanding, Oregon Health & Science University, Portland, OR, United States
Jan van Santen: BioSpeech Inc., Portland, OR, United States

DOI: https://doi.org/10.3389/fpsyg.2021.668344
Journal volume & issue: Vol. 12

Abstract

Read online

Conversational impairments are well known among people with autism spectrum disorder (ASD), but their measurement requires time-consuming manual annotation of language samples. Natural language processing (NLP) has shown promise in identifying semantic difficulties when compared to clinician-annotated reference transcripts. Our goal was to develop a novel measure of lexico-semantic similarity – based on recent work in natural language processing (NLP) and recent applications of pseudo-value analysis – which could be applied to transcripts of children’s conversational language, without recourse to some ground-truth reference document. We hypothesized that: (a) semantic coherence, as measured by this method, would discriminate between children with and without ASD and (b) more variability would be found in the group with ASD. We used data from 70 4- to 8-year-old males with ASD (N = 38) or typically developing (TD; N = 32) enrolled in a language study. Participants were administered a battery of standardized diagnostic tests, including the Autism Diagnostic Observation Schedule (ADOS). ADOS was recorded and transcribed, and we analyzed children’s language output during the conversation/interview ADOS tasks. Transcripts were converted to vectors via a word2vec model trained on the Google News Corpus. Pairwise similarity across all subjects and a sample grand mean were calculated. Using a leave-one-out algorithm, a pseudo-value, detailed below, representing each subject’s contribution to the grand mean was generated. Means of pseudo-values were compared between the two groups. Analyses were co-varied for nonverbal IQ, mean length of utterance, and number of distinct word roots (NDR). Statistically significant differences were observed in means of pseudo-values between TD and ASD groups (p = 0.007). TD subjects had higher pseudo-value scores suggesting that similarity scores of TD subjects were more similar to the overall group mean. Variance of pseudo-values was greater in the ASD group. Nonverbal IQ, mean length of utterance, or NDR did not account for between group differences. The findings suggest that our pseudo-value-based method can be effectively used to identify specific semantic difficulties that characterize children with ASD without requiring a reference transcript.

Published in Frontiers in Psychology

ISSN: 1664-1078 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Philosophy. Psychology. Religion: Psychology
Website: https://www.frontiersin.org/journals/psychology

About the journal

Abstract

Keywords