A Machine Learning Algorithm for Quantitatively Diagnosing Oxidative Stress Risks in Healthy Adult Individuals Based on Health Space Methodology: A Proof-of-Concept Study Using Korean Cross-Sectional Cohort Data
Youjin Kim,
Yunsoo Kim,
Jiyoung Hwang,
Tim J. van den Broek,
Bumjo Oh,
Ji Yeon Kim,
Suzan Wopereis,
Jildau Bouwman,
Oran Kwon
Affiliations
Youjin Kim
Department of Nutritional Science and Food Management, Ewha Womans University, 52 Ewhayeodae-gil, Seodeamun-gu, Seoul 03760, Korea
Yunsoo Kim
Department of Nutritional Science and Food Management, Ewha Womans University, 52 Ewhayeodae-gil, Seodeamun-gu, Seoul 03760, Korea
Jiyoung Hwang
Department of Nutritional Science and Food Management, Graduate Program in System Health Science and Engineering, Ewha Womans University, 52 Ewhayeodae-gil, Seodeamun-gu, Seoul 03760, Korea
Tim J. van den Broek
Netherlands Organization for Applied Scientific Research (TNO), Department of Microbiology and Systems Biology, Utrechtseweg 48, 3704 HE Zeist, The Netherlands
Bumjo Oh
Boramae Medical Center, Department of Family Medicine, Seoul Metropolitan Government-Seoul National University, 20 Boramae-ro 5-gil, Dongjak-gu, Seoul 07061, Korea
Ji Yeon Kim
Department of Food Science and Technology, Seoul National University of Science and Technology, 232 Gongneung-ro, Nowon-gu, Seoul 01811, Korea
Suzan Wopereis
Netherlands Organization for Applied Scientific Research (TNO), Department of Microbiology and Systems Biology, Utrechtseweg 48, 3704 HE Zeist, The Netherlands
Jildau Bouwman
Netherlands Organization for Applied Scientific Research (TNO), Department of Microbiology and Systems Biology, Utrechtseweg 48, 3704 HE Zeist, The Netherlands
Oran Kwon
Department of Nutritional Science and Food Management, Ewha Womans University, 52 Ewhayeodae-gil, Seodeamun-gu, Seoul 03760, Korea
Oxidative stress aggravates the progression of lifestyle-related chronic diseases. However, knowledge and practices that enable quantifying oxidative stress are still lacking. Here, we performed a proof-of-concept study to predict the oxidative stress status in a healthy population using retrospective cohort data from Boramae medical center in Korea (n = 1328). To obtain binary performance measures, we selected healthy controls versus oxidative disease cases based on the “health space” statistical methodology. We then developed a machine learning algorithm for discrimination of oxidative stress status using least absolute shrinkage and selection operator (LASSO)/elastic net regression with 10-fold cross-validation. A proposed fine-tune model included 16 features out of the full spectrum of diverse and complex data. The predictive performance was externally evaluated by generating receiver operating characteristic curves with area under the curve of 0.949 (CI 0.925 to 0.974), sensitivity of 0.923 (CI 0.879 to 0.967), and specificity of 0.855 (CI 0.795 to 0.915). Moreover, the discrimination power was confirmed by applying the proposed diagnostic model to the full dataset consisting of subjects with various degrees of oxidative stress. The results provide a feasible approach for stratifying the oxidative stress risks in the healthy population and selecting appropriate strategies for individual subjects toward implementing data-driven precision nutrition.