Identification and prediction of Parkinson’s disease subtypes and progression using machine learning in two cohorts

Anant Dadu; Vipul Satone; Rachneet Kaur; Sayed Hadi Hashemi; Hampton Leonard; Hirotaka Iwaki; Mary B. Makarious; Kimberley J. Billingsley; Sara Bandres‐Ciga; Lana J. Sargent; Alastair J. Noyce; Ali Daneshmand; Cornelis Blauwendraat; Ken Marek; Sonja W. Scholz; Andrew B. Singleton; Mike A. Nalls; Roy H. Campbell; Faraz Faghri

doi:10.1038/s41531-022-00439-z

npj Parkinson's Disease (Dec 2022)

Identification and prediction of Parkinson’s disease subtypes and progression using machine learning in two cohorts

Anant Dadu,
Vipul Satone,
Rachneet Kaur,
Sayed Hadi Hashemi,
Hampton Leonard,
Hirotaka Iwaki,
Mary B. Makarious,
Kimberley J. Billingsley,
Sara Bandres‐Ciga,
Lana J. Sargent,
Alastair J. Noyce,
Ali Daneshmand,
Cornelis Blauwendraat,
Ken Marek,
Sonja W. Scholz,
Andrew B. Singleton,
Mike A. Nalls,
Roy H. Campbell,
Faraz Faghri

Affiliations

Anant Dadu: Department of Computer Science, University of Illinois at Urbana-Champaign
Vipul Satone: Department of Industrial and Enterprise Systems Engineering, University of Illinois at Urbana-Champaign
Rachneet Kaur: Department of Industrial and Enterprise Systems Engineering, University of Illinois at Urbana-Champaign
Sayed Hadi Hashemi: Department of Computer Science, University of Illinois at Urbana-Champaign
Hampton Leonard: Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health
Hirotaka Iwaki: Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health
Mary B. Makarious: Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health
Kimberley J. Billingsley: Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health
Sara Bandres‐Ciga: Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health
Lana J. Sargent: Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health
Alastair J. Noyce: UCL Movement Disorders Centre, University College London
Ali Daneshmand: Department of Neurology, Boston Medical Center, Boston University School of Medicine
Cornelis Blauwendraat: Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health
Ken Marek: InviCRO LLC
Sonja W. Scholz: Neurodegenerative Diseases Research Unit, National Institute of Neurological Disorders and Stroke, National Institutes of Health
Andrew B. Singleton: Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health
Mike A. Nalls: Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health
Roy H. Campbell: Department of Computer Science, University of Illinois at Urbana-Champaign
Faraz Faghri: Center for Alzheimer’s and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health

DOI: https://doi.org/10.1038/s41531-022-00439-z
Journal volume & issue: Vol. 8, no. 1
pp. 1 – 12

Abstract

Read online

Abstract The clinical manifestations of Parkinson’s disease (PD) are characterized by heterogeneity in age at onset, disease duration, rate of progression, and the constellation of motor versus non-motor features. There is an unmet need for the characterization of distinct disease subtypes as well as improved, individualized predictions of the disease course. We used unsupervised and supervised machine learning methods on comprehensive, longitudinal clinical data from the Parkinson’s Disease Progression Marker Initiative (n = 294 cases) to identify patient subtypes and to predict disease progression. The resulting models were validated in an independent, clinically well-characterized cohort from the Parkinson’s Disease Biomarker Program (n = 263 cases). Our analysis distinguished three distinct disease subtypes with highly predictable progression rates, corresponding to slow, moderate, and fast disease progression. We achieved highly accurate projections of disease progression 5 years after initial diagnosis with an average area under the curve (AUC) of 0.92 (95% CI: 0.95 ± 0.01) for the slower progressing group (PDvec1), 0.87 ± 0.03 for moderate progressors, and 0.95 ± 0.02 for the fast-progressing group (PDvec3). We identified serum neurofilament light as a significant indicator of fast disease progression among other key biomarkers of interest. We replicated these findings in an independent cohort, released the analytical code, and developed models in an open science manner. Our data-driven study provides insights to deconstruct PD heterogeneity. This approach could have immediate implications for clinical trials by improving the detection of significant clinical outcomes. We anticipate that machine learning models will improve patient counseling, clinical trial design, and ultimately individualized patient care.

Published in npj Parkinson's Disease

ISSN: 2373-8057 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry: Neurology. Diseases of the nervous system
Website: https://www.nature.com/npjparkd/

About the journal