npj Women's Health (Nov 2024)
General feature selection technique supporting sex-debiasing in chronic illness algorithms validated using wearable device data
Abstract
Abstract In tasks involving human health condition data, feature selection is heavily affected by data types, the complexity of the condition manifestation, and the variability in physiological presentation. One type of variability often overlooked or oversimplified is the effect of biological sex. As females have been chronically underrepresented in clinical research, we know less about how conditions manifest in females. Innovations in wearable technology have enabled individuals to generate high temporal resolution data for extended periods of time. With millions of days of data now available, additional feature selection pipelines should be developed to systematically identify sex-dependent variability in data, along with the effects of how many per-person data are included in analysis. Here we present a set of statistical approaches as a technique for identifying sex-dependent physiological and behavioral manifestations of complex diseases starting from longitudinal data, which are evaluated on diabetes, hypertension, and their comorbidity.