A decision tree to improve identification of pathogenic mutations in clinical practice

Priscilla Machado do Nascimento; Inácio Gomes Medeiros; Raul Maia Falcão; Beatriz Stransky; Jorge Estefano Santana de Souza

doi:10.1186/s12911-020-1060-0

BMC Medical Informatics and Decision Making (Mar 2020)

A decision tree to improve identification of pathogenic mutations in clinical practice

Priscilla Machado do Nascimento,
Inácio Gomes Medeiros,
Raul Maia Falcão,
Beatriz Stransky,
Jorge Estefano Santana de Souza

Affiliations

Priscilla Machado do Nascimento: Bioinformatics Postgraduate Program, Metrópole Digital Institute, Federal University of Rio Grande do Norte
Inácio Gomes Medeiros: Bioinformatics Postgraduate Program, Metrópole Digital Institute, Federal University of Rio Grande do Norte
Raul Maia Falcão: Bioinformatics Postgraduate Program, Metrópole Digital Institute, Federal University of Rio Grande do Norte
Beatriz Stransky: Biomedical Engineering Department, Center of Technology, Federal University of Rio Grande do Norte
Jorge Estefano Santana de Souza: Bioinformatics Postgraduate Program, Metrópole Digital Institute, Federal University of Rio Grande do Norte

DOI: https://doi.org/10.1186/s12911-020-1060-0
Journal volume & issue: Vol. 20, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background A variant of unknown significance (VUS) is a variant form of a gene that has been identified through genetic testing, but whose significance to the organism function is not known. An actual challenge in precision medicine is to precisely identify which detected mutations from a sequencing process have a suitable role in the treatment or diagnosis of a disease. The average accuracy of pathogenicity predictors is 85%. However, there is a significant discordance about the identification of mutational impact and pathogenicity among them. Therefore, manual verification is necessary for confirming the real effect of a mutation in its casuistic. Methods In this work, we use variables categorization and selection for building a decision tree model, and later we measure and compare its accuracy with four known mutation predictors and seventeen supervised machine-learning (ML) algorithms. Results The results showed that the proposed tree reached the highest precision among all tested variables: 91% for True Neutrals, 8% for False Neutrals, 9% for False Pathogenic, and 92% for True Pathogenic. Conclusions The decision tree exceptionally demonstrated high classification precision with cancer data, producing consistently relevant forecasts for the sample tests with an accuracy close to the best ones achieved from supervised ML algorithms. Besides, the decision tree algorithm is easier to apply in clinical practice by non-IT experts. From the cancer research community perspective, this approach can be successfully applied as an alternative for the determination of potential pathogenicity of VOUS.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords