Health Data Science (Jan 2022)
Stratification of Patients with Diabetes Using Continuous Glucose Monitoring Profiles and Machine Learning
Abstract
Background. Continuous glucose monitoring (CGM) offers an opportunity for patients with diabetes to modify their lifestyle to better manage their condition and for clinicians to provide personalized healthcare and lifestyle advice. However, analytic tools are needed to standardize and analyze the rich data that emerge from CGM devices. This would allow glucotypes of patients to be identified to aid clinical decision-making. Methods. In this paper, we develop an analysis pipeline for CGM data and apply it to 148 diabetic patients with a total of 8632 days of follow up. The pipeline projects CGM data to a lower-dimensional space of features representing centrality, spread, size, and duration of glycemic excursions and the circadian cycle. We then use principal components analysis and k-means to cluster patients’ records into one of four glucotypes and analyze cluster membership using multinomial logistic regression. Results. Glucotypes differ in the degree of control, amount of time spent in range, and on the presence and timing of hyper- and hypoglycemia. Patients on the program had statistically significant improvements in their glucose levels. Conclusions. This pipeline provides a fast automatic function to label raw CGM data without manual input.