pySuStaIn: A Python implementation of the Subtype and Stage Inference algorithm
Leon M. Aksman,
Peter A. Wijeratne,
Neil P. Oxtoby,
Arman Eshaghi,
Cameron Shand,
Andre Altmann,
Daniel C. Alexander,
Alexandra L. Young
Affiliations
Leon M. Aksman
Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, United States of America; Centre for Medical Image Computing, Departments of Computer Science and Medical Physics, University College London, United Kingdom; Corresponding author at: Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, United States of America.
Peter A. Wijeratne
Centre for Medical Image Computing, Departments of Computer Science and Medical Physics, University College London, United Kingdom
Neil P. Oxtoby
Centre for Medical Image Computing, Departments of Computer Science and Medical Physics, University College London, United Kingdom
Arman Eshaghi
Centre for Medical Image Computing, Departments of Computer Science and Medical Physics, University College London, United Kingdom; Queen Square Multiple Sclerosis Centre, Department of Neuroinflammation, UCL Queen Square Institute of Neurology, Faculty of Brain Sciences, University College London, United Kingdom
Cameron Shand
Centre for Medical Image Computing, Departments of Computer Science and Medical Physics, University College London, United Kingdom
Andre Altmann
Centre for Medical Image Computing, Departments of Computer Science and Medical Physics, University College London, United Kingdom
Daniel C. Alexander
Centre for Medical Image Computing, Departments of Computer Science and Medical Physics, University College London, United Kingdom
Alexandra L. Young
Department of Neuroimaging, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, United Kingdom
Progressive disorders are highly heterogeneous. Symptom-based clinical classification of these disorders may not reflect the underlying pathobiology. Data-driven subtyping and staging of patients has the potential to disentangle the complex spatiotemporal patterns of disease progression. Tools that enable this are in high demand from clinical and treatment-development communities. Here we describe the pySuStaIn software package, a Python-based implementation of the Subtype and Stage Inference (SuStaIn) algorithm. SuStaIn unravels the complexity of heterogeneous diseases by inferring multiple disease progression patterns (subtypes) and individual severity (stages) from cross-sectional data. The primary aims of pySuStaIn are to enable widespread application and translation of SuStaIn via an accessible Python package that supports simple extension and generalization to novel modeling situations within a single, consistent architecture.