Clinical, Cosmetic and Investigational Dermatology (Mar 2022)

A Genome-Wide Association Study and Machine-Learning Algorithm Analysis on the Prediction of Facial Phenotypes by Genotypes in Korean Women

  • Yoo HY,
  • Lee KC,
  • Woo JE,
  • Park SH,
  • Lee S,
  • Joo J,
  • Bae JS,
  • Kwon HJ,
  • Park BJ

Journal volume & issue
Vol. Volume 15
pp. 433 – 445

Abstract

Read online

Hye-Young Yoo,1,* Ki-Chan Lee,2,* Ji-Eun Woo,1 Sung-Ha Park,1 Sunghoon Lee,2 Joungsu Joo,2 Jin-Sik Bae,2 Hyuk-Jung Kwon,2 Byoung-Jun Park1 1Skin & Natural Products Lab, Kolmar Korea Co., Ltd., Seoul, 06800, Republic of Korea; 2R&D Department, Eone Diagnomics Genome Center Co., Ltd, Songdo Incheon, 22014, Republic of Korea*These authors contributed equally to this workCorrespondence: Byoung-Jun Park, Skin & Natural Products Lab, Kolmar Korea Co., Ltd., Seoul, 06800, Republic of Korea, Tel +82 1044025395, Fax +82 229759414, Email [email protected], Hyuk-Jung Kwon, R&D Department, Eone Diagnomics Genome Center Co., Ltd, Songdo Incheon, 22014, Republic of Korea, Tel +82 1081204403, Fax +82 327132107, Email [email protected]: Changes in facial appearance are affected by various intrinsic and extrinsic factors, which vary from person to person. Therefore, each person needs to determine their skin condition accurately to care for their skin accordingly. Recently, genetic identification by skin-related phenotypes has become possible using genome-wide association studies (GWAS) and machine-learning algorithms. However, because most GWAS have focused on populations with American or European skin pigmentation, large-scale GWAS are needed for Asian populations. This study aimed to evaluate the correlation of facial phenotypes with candidate single-nucleotide polymorphisms (SNPs) to predict phenotype from genotype using machine learning.Materials and Methods: A total of 749 Korean women aged 30– 50 years were enrolled in this study and evaluated for five facial phenotypes (melanin, gloss, hydration, wrinkle, and elasticity). To find highly related SNPs with each phenotype, GWAS analysis was used. In addition, phenotype prediction was performed using three machine-learning algorithms (linear, ridge, and linear support vector regressions) using five-fold cross-validation.Results: Using GWAS analysis, we found 46 novel highly associated SNPs (p < 1× 10− 05): 3, 20, 12, 6, and 5 SNPs for melanin, gloss, hydration, wrinkle, and elasticity, respectively. On comparing the performance of each model based on phenotypes using five-fold cross-validation, the ridge regression model showed the highest accuracy (r2 = 0.6422– 0.7266) in all skin traits. Therefore, the optimal solution for personal skin diagnosis using GWAS was with the ridge regression model.Conclusion: The proposed facial phenotype prediction model in this study provided the optimal solution for accurately predicting the skin condition of an individual by identifying genotype information of target characteristics and machine-learning methods. This model has potential utility for the development of customized cosmetics.Keywords: customized cosmetics, single-nucleotide polymorphism, genome-wide association study, machine-learning algorithm, microarray

Keywords