Neuroimaging feature extraction using a neural network classifier for imaging genetics

Cédric Beaulac; Sidi Wu; Erin Gibson; Michelle F. Miranda; Jiguo Cao; Leno Rocha; Mirza Faisal Beg; Farouk S. Nathoo

doi:10.1186/s12859-023-05394-x

BMC Bioinformatics (Jun 2023)

Neuroimaging feature extraction using a neural network classifier for imaging genetics

Cédric Beaulac,
Sidi Wu,
Erin Gibson,
Michelle F. Miranda,
Jiguo Cao,
Leno Rocha,
Mirza Faisal Beg,
Farouk S. Nathoo

Affiliations

Cédric Beaulac: School of Engineering Science, Simon Fraser University
Sidi Wu: Department of Statistics and Actuarial Sciences, Simon Fraser University
Erin Gibson: School of Engineering Science, Simon Fraser University
Michelle F. Miranda: Department of Mathematics and Statistics, University of Victoria
Jiguo Cao: Department of Statistics and Actuarial Sciences, Simon Fraser University
Leno Rocha: Department of Mathematics and Statistics, University of Victoria
Mirza Faisal Beg: School of Engineering Science, Simon Fraser University
Farouk S. Nathoo: Department of Mathematics and Statistics, University of Victoria

DOI: https://doi.org/10.1186/s12859-023-05394-x
Journal volume & issue: Vol. 24, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Background Dealing with the high dimension of both neuroimaging data and genetic data is a difficult problem in the association of genetic data to neuroimaging. In this article, we tackle the latter problem with an eye toward developing solutions that are relevant for disease prediction. Supported by a vast literature on the predictive power of neural networks, our proposed solution uses neural networks to extract from neuroimaging data features that are relevant for predicting Alzheimer’s Disease (AD) for subsequent relation to genetics. The neuroimaging-genetic pipeline we propose is comprised of image processing, neuroimaging feature extraction and genetic association steps. We present a neural network classifier for extracting neuroimaging features that are related with the disease. The proposed method is data-driven and requires no expert advice or a priori selection of regions of interest. We further propose a multivariate regression with priors specified in the Bayesian framework that allows for group sparsity at multiple levels including SNPs and genes. Results We find the features extracted with our proposed method are better predictors of AD than features used previously in the literature suggesting that single nucleotide polymorphisms (SNPs) related to the features extracted by our proposed method are also more relevant for AD. Our neuroimaging-genetic pipeline lead to the identification of some overlapping and more importantly some different SNPs when compared to those identified with previously used features. Conclusions The pipeline we propose combines machine learning and statistical methods to benefit from the strong predictive performance of blackbox models to extract relevant features while preserving the interpretation provided by Bayesian models for genetic association. Finally, we argue in favour of using automatic feature extraction, such as the method we propose, in addition to ROI or voxelwise analysis to find potentially novel disease-relevant SNPs that may not be detected when using ROIs or voxels alone.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal

Abstract

Keywords