A characteristic cerebellar biosignature for bipolar disorder, identified with fully automatic machine learning

Georgios V. Thomaidis; Konstantinos Papadimitriou; Sotirios Michos; Evangelos Chartampilas; Ioannis Tsamardinos

IBRO Neuroscience Reports (Dec 2023)

A characteristic cerebellar biosignature for bipolar disorder, identified with fully automatic machine learning

Georgios V. Thomaidis,
Konstantinos Papadimitriou,
Sotirios Michos,
Evangelos Chartampilas,
Ioannis Tsamardinos

Affiliations

Georgios V. Thomaidis: Greek National Health System, Psychiatric Department, Katerini General Hospital, Katerini, Greece; Corresponding author.
Konstantinos Papadimitriou: Greek National Health System, G. Papanikolaou General Hospital, Organizational Unit - Psychiatric Hospital of Thessaloniki, Thessaloniki, Greece
Sotirios Michos: Independent Researcher, Thessaloniki, Greece
Evangelos Chartampilas: Laboratory of Radiology, AHEPA General Hospital, University of Thessaloniki, Thessaloniki, Greece
Ioannis Tsamardinos: Department of Computer Science, University of Crete, Heraklion, Greece

Journal volume & issue: Vol. 15
pp. 77 – 89

Abstract

Read online

Background: Transcriptomic profile differences between patients with bipolar disorder and healthy controls can be identified using machine learning and can provide information about the potential role of the cerebellum in the pathogenesis of bipolar disorder.With this aim, user-friendly, fully automated machine learning algorithms can achieve extremely high classification scores and disease-related predictive biosignature identification, in short time frames and scaled down to small datasets. Method: A fully automated machine learning platform, based on the most suitable algorithm selection and relevant set of hyper-parameter values, was applied on a preprocessed transcriptomics dataset, in order to produce a model for biosignature selection and to classify subjects into groups of patients and controls. The parent GEO datasets were originally produced from the cerebellar and parietal lobe tissue of deceased bipolar patients and healthy controls, using Affymetrix Human Gene 1.0 ST Array. Results: Patients and controls were classified into two separate groups, with no close-to-the-boundary cases, and this classification was based on the cerebellar transcriptomic biosignature of 25 features (genes), with Area Under Curve 0.929 and Average Precision 0.955. The biosignature includes both genes connected before to bipolar disorder, depression, psychosis or epilepsy, as well as genes not linked before with any psychiatric disease. Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis revealed participation of 4 identified features in 6 pathways which have also been associated with bipolar disorder. Conclusion: Automated machine learning (AutoML) managed to identify accurately 25 genes that can jointly – in a multivariate–fashion - separate bipolar patients from healthy controls with high predictive power. The discovered features lead to new biological insights. Machine Learning (ML) analysis considers the features in combination (in contrast to standard differential expression analysis), removing both irrelevant as well as redundant markers, and thus, focusing to biological interpretation.

Published in IBRO Neuroscience Reports

ISSN: 2667-2421 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: https://www.journals.elsevier.com/ibro-neuroscience-reports

About the journal

Abstract

Keywords