Frontiers in Genetics (Jan 2023)

SADLN: Self-attention based deep learning network of integrating multi-omics data for cancer subtype recognition

  • Qiuwen Sun,
  • Lei Cheng,
  • Ao Meng,
  • Shuguang Ge,
  • Jie Chen,
  • Longzhen Zhang,
  • Ping Gong

DOI
https://doi.org/10.3389/fgene.2022.1032768
Journal volume & issue
Vol. 13

Abstract

Read online

Integrating multi-omics data for cancer subtype recognition is an important task in bioinformatics. Recently, deep learning has been applied to recognize the subtype of cancers. However, existing studies almost integrate the multi-omics data simply by concatenation as the single data and then learn a latent low-dimensional representation through a deep learning model, which did not consider the distribution differently of omics data. Moreover, these methods ignore the relationship of samples. To tackle these problems, we proposed SADLN: A self-attention based deep learning network of integrating multi-omics data for cancer subtype recognition. SADLN combined encoder, self-attention, decoder, and discriminator into a unified framework, which can not only integrate multi-omics data but also adaptively model the sample’s relationship for learning an accurately latent low-dimensional representation. With the integrated representation learned from the network, SADLN used Gaussian Mixture Model to identify cancer subtypes. Experiments on ten cancer datasets of TCGA demonstrated the advantages of SADLN compared to ten methods. The Self-Attention Based Deep Learning Network (SADLN) is an effective method of integrating multi-omics data for cancer subtype recognition.

Keywords