Knowledge synthesis of 100 million biomedical documents augments the deep expression profiling of coronavirus receptors

AJ Venkatakrishnan; Arjun Puranik; Akash Anand; David Zemmour; Xiang Yao; Xiaoying Wu; Ramakrishna Chilaka; Dariusz K Murakowski; Kristopher Standish; Bharathwaj Raghunathan; Tyler Wagner; Enrique Garcia-Rivera; Hugo Solomon; Abhinav Garg; Rakesh Barve; Anuli Anyanwu-Ofili; Najat Khan; Venky Soundararajan

doi:10.7554/eLife.58040

eLife (May 2020)

Knowledge synthesis of 100 million biomedical documents augments the deep expression profiling of coronavirus receptors

AJ Venkatakrishnan,
Arjun Puranik,
Akash Anand,
David Zemmour,
Xiang Yao,
Xiaoying Wu,
Ramakrishna Chilaka,
Dariusz K Murakowski,
Kristopher Standish,
Bharathwaj Raghunathan,
Tyler Wagner,
Enrique Garcia-Rivera,
Hugo Solomon,
Abhinav Garg,
Rakesh Barve,
Anuli Anyanwu-Ofili,
Najat Khan,
Venky Soundararajan

Affiliations

AJ Venkatakrishnan: ORCiD; nference, Cambridge, United States
Arjun Puranik: nference, Cambridge, United States
Akash Anand: nference Labs, Bengaluru, India
David Zemmour: nference, Cambridge, United States
Xiang Yao: Janssen pharmaceutical companies of Johnson & Johnson (J&J), Spring House, United States
Xiaoying Wu: Janssen pharmaceutical companies of Johnson & Johnson (J&J), Spring House, United States
Ramakrishna Chilaka: nference Labs, Bengaluru, India
Dariusz K Murakowski: ORCiD; nference, Cambridge, United States
Kristopher Standish: Janssen pharmaceutical companies of Johnson & Johnson (J&J), Spring House, United States
Bharathwaj Raghunathan: nference, Toronto, Canada
Tyler Wagner: nference, Cambridge, United States
Enrique Garcia-Rivera: nference, Cambridge, United States
Hugo Solomon: nference, Cambridge, United States
Abhinav Garg: nference Labs, Bengaluru, India
Rakesh Barve: nference Labs, Bengaluru, India
Anuli Anyanwu-Ofili: Janssen pharmaceutical companies of Johnson & Johnson (J&J), Spring House, United States
Najat Khan: Janssen pharmaceutical companies of Johnson & Johnson (J&J), Spring House, United States
Venky Soundararajan: ORCiD; nference, Cambridge, United States

DOI: https://doi.org/10.7554/eLife.58040
Journal volume & issue: Vol. 9

Abstract

Read online

The COVID-19 pandemic demands assimilation of all biomedical knowledge to decode mechanisms of pathogenesis. Despite the recent renaissance in neural networks, a platform for the real-time synthesis of the exponentially growing biomedical literature and deep omics insights is unavailable. Here, we present the nferX platform for dynamic inference from over 45 quadrillion possible conceptual associations from unstructured text, and triangulation with insights from single-cell RNA-sequencing, bulk RNA-seq and proteomics from diverse tissue types. A hypothesis-free profiling of ACE2 suggests tongue keratinocytes, olfactory epithelial cells, airway club cells and respiratory ciliated cells as potential reservoirs of the SARS-CoV-2 receptor. We find the gut as the putative hotspot of COVID-19, where a maturation correlated transcriptional signature is shared in small intestine enterocytes among coronavirus receptors (ACE2, DPP4, ANPEP). A holistic data science platform triangulating insights from structured and unstructured data holds potential for accelerating the generation of impactful biological insights and hypotheses.

Published in eLife

ISSN: 2050-084X (Online)
Publisher: eLife Sciences Publications Ltd
Country of publisher: United Kingdom
LCC subjects: Medicine; Science: Biology (General)
Website: https://elifesciences.org

About the journal

Abstract

Keywords