PLoS Computational Biology (Dec 2018)

Predicting protein targets for drug-like compounds using transcriptomics.

  • Nicolas A Pabon,
  • Yan Xia,
  • Samuel K Estabrooks,
  • Zhaofeng Ye,
  • Amanda K Herbrand,
  • Evelyn Süß,
  • Ricardo M Biondi,
  • Victoria A Assimon,
  • Jason E Gestwicki,
  • Jeffrey L Brodsky,
  • Carlos J Camacho,
  • Ziv Bar-Joseph

DOI
https://doi.org/10.1371/journal.pcbi.1006651
Journal volume & issue
Vol. 14, no. 12
p. e1006651

Abstract

Read online

An expanded chemical space is essential for improved identification of small molecules for emerging therapeutic targets. However, the identification of targets for novel compounds is biased towards the synthesis of known scaffolds that bind familiar protein families, limiting the exploration of chemical space. To change this paradigm, we validated a new pipeline that identifies small molecule-protein interactions and works even for compounds lacking similarity to known drugs. Based on differential mRNA profiles in multiple cell types exposed to drugs and in which gene knockdowns (KD) were conducted, we showed that drugs induce gene regulatory networks that correlate with those produced after silencing protein-coding genes. Next, we applied supervised machine learning to exploit drug-KD signature correlations and enriched our predictions using an orthogonal structure-based screen. As a proof-of-principle for this regimen, top-10/top-100 target prediction accuracies of 26% and 41%, respectively, were achieved on a validation of set 152 FDA-approved drugs and 3104 potential targets. We then predicted targets for 1680 compounds and validated chemical interactors with four targets that have proven difficult to chemically modulate, including non-covalent inhibitors of HRAS and KRAS. Importantly, drug-target interactions manifest as gene expression correlations between drug treatment and both target gene KD and KD of genes that act up- or down-stream of the target, even for relatively weak binders. These correlations provide new insights on the cellular response of disrupting protein interactions and highlight the complex genetic phenotypes of drug treatment. With further refinement, our pipeline may accelerate the identification and development of novel chemical classes by screening compound-target interactions.