PLoS ONE (Jan 2023)

Application of transfer learning to predict drug-induced human in vivo gene expression changes using rat in vitro and in vivo data.

  • Shauna D O'Donovan,
  • Rachel Cavill,
  • Florian Wimmenauer,
  • Alexander Lukas,
  • Tobias Stumm,
  • Evgueni Smirnov,
  • Michael Lenz,
  • Gokhan Ertaylan,
  • Danyel G J Jennen,
  • Natal A W van Riel,
  • Kurt Driessens,
  • Ralf L M Peeters,
  • Theo M C M de Kok

DOI
https://doi.org/10.1371/journal.pone.0292030
Journal volume & issue
Vol. 18, no. 11
p. e0292030

Abstract

Read online

The liver is the primary site for the metabolism and detoxification of many compounds, including pharmaceuticals. Consequently, it is also the primary location for many adverse reactions. As the liver is not readily accessible for sampling in humans; rodent or cell line models are often used to evaluate potential toxic effects of a novel compound or candidate drug. However, relating the results of animal and in vitro studies to relevant clinical outcomes for the human in vivo situation still proves challenging. In this study, we incorporate principles of transfer learning within a deep artificial neural network allowing us to leverage the relative abundance of rat in vitro and in vivo exposure data from the Open TG-GATEs data set to train a model to predict the expected pattern of human in vivo gene expression following an exposure given measured human in vitro gene expression. We show that domain adaptation has been successfully achieved, with the rat and human in vitro data no longer being separable in the common latent space generated by the network. The network produces physiologically plausible predictions of human in vivo gene expression pattern following an exposure to a previously unseen compound. Moreover, we show the integration of the human in vitro data in the training of the domain adaptation network significantly improves the temporal accuracy of the predicted rat in vivo gene expression pattern following an exposure to a previously unseen compound. In this way, we demonstrate the improvements in prediction accuracy that can be achieved by combining data from distinct domains.