Evaluation of gut microbiota predictive potential associated with phenotypic characteristics to identify multifactorial diseases
Danielle Cristina Fonseca,
Ilanna Marques Gomes da Rocha,
Bianca Depieri Balmant,
Leticia Callado,
Ana Paula Aguiar Prudêncio,
Juliana Tepedino Martins Alves,
Raquel Susana Torrinhas,
Gabriel da Rocha Fernandes,
Dan Linetzky Waitzberg
Affiliations
Danielle Cristina Fonseca
Laboratory of Nutrition and Metabolic Surgery of the Digestive System, LIM 35, Department of Gastroenterology, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
Ilanna Marques Gomes da Rocha
Laboratory of Nutrition and Metabolic Surgery of the Digestive System, LIM 35, Department of Gastroenterology, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
Bianca Depieri Balmant
Laboratory of Nutrition and Metabolic Surgery of the Digestive System, LIM 35, Department of Gastroenterology, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
Leticia Callado
Laboratory of Nutrition and Metabolic Surgery of the Digestive System, LIM 35, Department of Gastroenterology, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
Ana Paula Aguiar Prudêncio
Laboratory of Nutrition and Metabolic Surgery of the Digestive System, LIM 35, Department of Gastroenterology, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
Juliana Tepedino Martins Alves
Laboratory of Nutrition and Metabolic Surgery of the Digestive System, LIM 35, Department of Gastroenterology, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
Raquel Susana Torrinhas
Laboratory of Nutrition and Metabolic Surgery of the Digestive System, LIM 35, Department of Gastroenterology, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
Gabriel da Rocha Fernandes
Biosystems Informatics and Genomics Group, Instituto René Rachou - Fiocruz Minas, Belo Horizonte, Brazil
Dan Linetzky Waitzberg
Laboratory of Nutrition and Metabolic Surgery of the Digestive System, LIM 35, Department of Gastroenterology, Hospital das Clínicas HCFMUSP, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brazil
ABSTRACTGut microbiota has been implicated in various clinical conditions, yet the substantial heterogeneity in gut microbiota research results necessitates a more sophisticated approach than merely identifying statistically different microbial taxa between healthy and unhealthy individuals. Our study seeks to not only select microbial taxa but also explore their synergy with phenotypic host variables to develop novel predictive models for specific clinical conditions. Design: We assessed 50 healthy and 152 unhealthy individuals for phenotypic variables (PV) and gut microbiota (GM) composition by 16S rRNA gene sequencing. The entire modeling process was conducted in the R environment using the Random Forest algorithm. Model performance was assessed through ROC curve construction. Results: We evaluated 52 bacterial taxa and pre-selected PV (p < 0.05) for their contribution to the final models. Across all diseases, the models achieved their best performance when GM and PV data were integrated. Notably, the integrated predictive models demonstrated exceptional performance for rheumatoid arthritis (AUC = 88.03%), type 2 diabetes (AUC = 96.96%), systemic lupus erythematosus (AUC = 98.4%), and type 1 diabetes (AUC = 86.19%). Conclusion: Our findings underscore that the selection of bacterial taxa based solely on differences in relative abundance between groups is insufficient to serve as clinical markers. Machine learning techniques are essential for mitigating the considerable variability observed within gut microbiota. In our study, the use of microbial taxa alone exhibited limited predictive power for health outcomes, while the integration of phenotypic variables into predictive models substantially enhanced their predictive capabilities.