Information (Dec 2022)

Explainabilty Comparison between Random Forests and Neural Networks—Case Study of Amino Acid Volume Prediction

  • Roberta De Fazio,
  • Rosy Di Giovannantonio,
  • Emanuele Bellini,
  • Stefano Marrone

DOI
https://doi.org/10.3390/info14010021
Journal volume & issue
Vol. 14, no. 1
p. 21

Abstract

Read online

As explainability seems to be the driver for a wiser adoption of Artificial Intelligence in healthcare and in critical applications, in general, a comprehensive study of this field is far from being completed. On one hand, a final definition and theoretical measurements of explainability have not been assessed, yet, on the other hand, some tools and frameworks for the practical evaluation of this feature are now present. This paper aims to present a concrete experience in using some of these explainability-related techniques in the problem of predicting the size of amino acids in real-world protein structures. In particular, the feature importance calculation embedded in Random Forest (RF) training is compared with the results of the Eli-5 tool applied to the Neural Network (NN) model. Both the predictors are trained on the same dataset, which is extracted from Protein Data Bank (PDB), considering 446 myoglobins structures and process it with several tools to implement a geometrical model and perform analyses on it. The comparison between the two models draws different conclusions about the residues’ geometry and their biological properties.

Keywords