Model selection to achieve reproducible associations between resting state EEG features and autism

William E. Carson; Samantha Major; Harshitha Akkineni; Hannah Fung; Elias Peters; Kimberly L. H. Carpenter; Geraldine Dawson; David E. Carlson

doi:10.1038/s41598-024-76659-5

Scientific Reports (Oct 2024)

Model selection to achieve reproducible associations between resting state EEG features and autism

William E. Carson,
Samantha Major,
Harshitha Akkineni,
Hannah Fung,
Elias Peters,
Kimberly L. H. Carpenter,
Geraldine Dawson,
David E. Carlson

Affiliations

William E. Carson: Department of Biomedical Engineering, Duke University
Samantha Major: Duke Center for Autism and Brain Development, Duke University
Harshitha Akkineni: Duke Center for Autism and Brain Development, Duke University
Hannah Fung: Duke Center for Autism and Brain Development, Duke University
Elias Peters: Duke Center for Autism and Brain Development, Duke University
Kimberly L. H. Carpenter: Duke Center for Autism and Brain Development, Duke University
Geraldine Dawson: Duke Center for Autism and Brain Development, Duke University
David E. Carlson: Department of Civil and Environmental Engineering, Duke University

DOI: https://doi.org/10.1038/s41598-024-76659-5
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 17

Abstract

Read online

Abstract A concern in the field of autism electroencephalography (EEG) biomarker discovery is their lack of reproducibility. In the present study, we considered the problem of learning reproducible associations between multiple features of resting state (RS) neural activity and autism, using EEG data collected during a RS paradigm from 36 to 96 month-old children diagnosed with autism (N = 224) and neurotypical children (N = 69). Specifically, EEG spectral power and functional connectivity features were used as inputs to a regularized generalized linear model trained to predict diagnostic group (autism versus neurotypical). To evaluate our model, we proposed a procedure that quantified both the predictive generalization and reproducibility of learned associations produced by the model. When prioritizing both model predictive performance and reproducibility of associations, a highly reproducible profile of associations emerged. This profile revealed a distinct pattern of increased gamma power and connectivity in occipital and posterior midline regions associated with an autism diagnosis. Conversely, model selection based on predictive performance alone resulted in non-robust associations. Finally, we built a custom machine learning model that further empirically improved robustness of learned associations. Our results highlight the need for model selection criteria that maximize the scientific utility provided by reproducibility instead of predictive performance.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords