Classifying sex with volume-matched brain MRI

Matthis Ebel; Martin Domin; Nicola Neumann; Carsten Oliver Schmidt; Martin Lotze; Mario Stanke

Neuroimage: Reports (Sep 2023)

Classifying sex with volume-matched brain MRI

Matthis Ebel,
Martin Domin,
Nicola Neumann,
Carsten Oliver Schmidt,
Martin Lotze,
Mario Stanke

Affiliations

Matthis Ebel: University of Greifswald, Institute of Mathematics and Computer Science, Greifswald, 17489, Germany
Martin Domin: University Medicine Greifswald, Functional Imaging, Institute of Diagnostic Radiology and Neuroradiology, Greifswald, 17489, Germany
Nicola Neumann: University Medicine Greifswald, Functional Imaging, Institute of Diagnostic Radiology and Neuroradiology, Greifswald, 17489, Germany
Carsten Oliver Schmidt: University Medicine Greifswald, Institute for Community Medicine, Greifswald, 17475, Germany
Martin Lotze: University Medicine Greifswald, Functional Imaging, Institute of Diagnostic Radiology and Neuroradiology, Greifswald, 17489, Germany
Mario Stanke: University of Greifswald, Institute of Mathematics and Computer Science, Greifswald, 17489, Germany; Corresponding author.

Journal volume & issue: Vol. 3, no. 3
p. 100181

Abstract

Read online

Sex differences in the size of specific brain structures have been extensively studied, but careful and reproducible statistical hypothesis testing to identify them produced overall small effect sizes and differences in brains of males and females. On the other hand, multivariate statistical or machine learning methods that analyze MR images of the whole brain have reported respectable accuracies for the task of distinguishing brains of males from brains of females. However, most existing studies lacked a careful control for brain volume differences between sexes and, if done, their accuracy often declined to 70% or below. This raises questions about the relevance of accuracies achieved without careful control of overall volume.We examined how accurately sex can be classified from gray matter properties of the human brain when matching on overall brain volume. We tested, how robust machine learning classifiers are when predicting cross-cohort, i.e. when they are used on a different cohort than they were trained on. Furthermore, we studied how their accuracy depends on the size of the training set and attempted to identify brain regions relevant for successful classification. MRI data was used from two population-based data sets of 3298 mostly older adults from the Study of Health in Pomerania (SHIP) and 399 mostly younger adults from the Human Connectome Project (HCP), respectively. We benchmarked two multivariate methods, logistic regression and a 3D convolutional neural network.We show that male and female brains of the same intracranial volume can be distinguished with >92% accuracy with logistic regression on a dataset of 1166 matched individuals. The same model also reached 85% accuracy on a different cohort without retraining. The accuracy for both methods increased with the training cohort size up to and beyond 3000 individuals, suggesting that classifiers trained on smaller cohorts likely have an accuracy disadvantage. We found no single outstanding brain region necessary for successful classification, but important features appear rather distributed across the brain.

Published in Neuroimage: Reports

ISSN: 2666-9560 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: https://www.journals.elsevier.com/neuroimage-reports

About the journal

Abstract

Keywords