BMC Bioinformatics (Jul 2024)

A permutable MLP-like architecture for disease prediction from gut metagenomic data

  • Cong Jiang,
  • Jian Yang,
  • Xiaogang Peng,
  • Xiaozheng Li

DOI
https://doi.org/10.1186/s12859-024-05856-w
Journal volume & issue
Vol. 25, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Metagenomic data plays a crucial role in analyzing the relationship between microbes and diseases. However, the limited number of samples, high dimensionality, and sparsity of metagenomic data pose significant challenges for the application of deep learning in data classification and prediction. Previous studies have shown that utilizing the phylogenetic tree structure to transform metagenomic abundance data into a 2D matrix input for convolutional neural networks (CNNs) improves classification performance. Inspired by the success of a Permutable MLP-like architecture in visual recognition, we propose Metagenomic Permutator (MetaP), which applied the Permutable MLP-like network structure to capture the phylogenetic information of microbes within the 2D matrix formed by phylogenetic tree. Our experiments demonstrate that our model achieved competitive performance compared to other deep neural networks and traditional machine learning, and has good prospects for multi-classification and large sample sizes. Furthermore, we utilize the SHAP (SHapley Additive exPlanations) method to interpret our model predictions, identifying the microbial features that are associated with diseases.

Keywords