BMC Bioinformatics (Nov 2023)

Mdwgan-gp: data augmentation for gene expression data based on multiple discriminator WGAN-GP

  • Rongyuan Li,
  • Jingli Wu,
  • Gaoshi Li,
  • Jiafei Liu,
  • Junbo Xuan,
  • Qi Zhu

DOI
https://doi.org/10.1186/s12859-023-05558-9
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 20

Abstract

Read online

Abstract Background Although gene expression data play significant roles in biological and medical studies, their applications are hampered due to the difficulty and high expenses of gathering them through biological experiments. It is an urgent problem to generate high quality gene expression data with computational methods. WGAN-GP, a generative adversarial network-based method, has been successfully applied in augmenting gene expression data. However, mode collapse or over-fitting may take place for small training samples due to just one discriminator is adopted in the method. Results In this study, an improved data augmentation approach MDWGAN-GP, a generative adversarial network model with multiple discriminators, is proposed. In addition, a novel method is devised for enriching training samples based on linear graph convolutional network. Extensive experiments were implemented on real biological data. Conclusions The experimental results have demonstrated that compared with other state-of-the-art methods, the MDWGAN-GP method can produce higher quality generated gene expression data in most cases.

Keywords