Causal interactions from proteomic profiles: Molecular data meet pathway knowledge
Özgün Babur,
Augustin Luna,
Anil Korkut,
Funda Durupinar,
Metin Can Siper,
Ugur Dogrusoz,
Alvaro Sebastian Vaca Jacome,
Ryan Peckner,
Karen E. Christianson,
Jacob D. Jaffe,
Paul T. Spellman,
Joseph E. Aslan,
Chris Sander,
Emek Demir
Affiliations
Özgün Babur
Computer Science Department, University of Massachusetts Boston, 100 William T. Morrissey Boulevard, Boston, MA 02125, USA; Corresponding author
Augustin Luna
cBio Center for Computational and Systems Biology, Dana-Farber Cancer Institute and Department of Cell Biology, Harvard Medical School, Boston, MA 02215, USA
Anil Korkut
Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
Funda Durupinar
Computer Science Department, University of Massachusetts Boston, 100 William T. Morrissey Boulevard, Boston, MA 02125, USA
Metin Can Siper
Computational Biology Program, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA
Ugur Dogrusoz
Computer Engineering Department, Bilkent University, Ankara 06800, Turkey
Alvaro Sebastian Vaca Jacome
The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Ryan Peckner
The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Cogen Therapeutics, Cambridge, MA 02139, USA
Karen E. Christianson
The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Jacob D. Jaffe
The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Paul T. Spellman
Computational Biology Program, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA; Department of Molecular and Medical Genetics, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA
Joseph E. Aslan
Knight Cardiovascular Institute, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA
Chris Sander
cBio Center for Computational and Systems Biology, Dana-Farber Cancer Institute and Department of Cell Biology, Harvard Medical School, Boston, MA 02215, USA
Emek Demir
Computational Biology Program, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA; Department of Molecular and Medical Genetics, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA; Pacific Northwest National Laboratories, 902 Battelle Boulevard, Richland, WA 99354, USA
Summary: We present a computational method to infer causal mechanisms in cell biology by analyzing changes in high-throughput proteomic profiles on the background of prior knowledge captured in biochemical reaction knowledge bases. The method mimics a biologist's traditional approach of explaining changes in data using prior knowledge but does this at the scale of hundreds of thousands of reactions. This is a specific example of how to automate scientific reasoning processes and illustrates the power of mapping from experimental data to prior knowledge via logic programming. The identified mechanisms can explain how experimental and physiological perturbations, propagating in a network of reactions, affect cellular responses and their phenotypic consequences. Causal pathway analysis is a powerful and flexible discovery tool for a wide range of cellular profiling data types and biological questions. The automated causation inference tool, as well as the source code, are freely available at http://causalpath.org. The bigger picture: Molecular profiling of biological organisms provides us with a great amount of information on cellular differences, but converting it to mechanistic insights is still a very challenging task. A prominent approach is to integrate new measurements with the mechanistic knowledge described in the scientific literature and build a model that is supported by both. Although this can be done in many ways, an adept approach will use the literature knowledge in detail and follow high standards of logical reasoning while integrating the known and the new. This article describes an approach that utilizes the details in human biological pathways to identify pairs of changes with a likely cause-effect relation within. The approach automatically converts comparative proteomic and other molecular profiles into hypotheses of differentially active mechanistic relations that explain how the profiles came to be.