Profiling the inflammatory bowel diseases using genetics, serum biomarkers, and smoking information
Ruize Liu,
Dalin Li,
Talin Haritunians,
Yunfeng Ruan,
Mark J. Daly,
Hailiang Huang,
Dermot P.B. McGovern
Affiliations
Ruize Liu
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Dalin Li
F. Widjaja Family Foundation Inflammatory Bowel and Immunobiology Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA
Talin Haritunians
F. Widjaja Family Foundation Inflammatory Bowel and Immunobiology Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA
Yunfeng Ruan
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Mark J. Daly
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Hailiang Huang
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Corresponding author
Dermot P.B. McGovern
F. Widjaja Family Foundation Inflammatory Bowel and Immunobiology Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA; Corresponding author
Summary: Crohn's disease (CD) and ulcerative colitis (UC) are two etiologically related yet distinctive subtypes of the inflammatory bowel diseases (IBD). Differentiating CD from UC can be challenging using conventional clinical approaches in a subset of patients. We designed and evaluated a novel molecular-based prediction model aggregating genetics, serum biomarkers, and tobacco smoking information to assist the diagnosis of CD and UC in over 30,000 samples. A joint model combining genetics, serum biomarkers and smoking explains 46% (42–50%, 95% CI) of phenotypic variation. Despite modest overlaps with serum biomarkers, genetics makes unique contributions to distinguishing IBD subtypes. Smoking status only explains 1% (0–6%, 95% CI) of the phenotypic variance suggesting it may not be an effective biomarker. This study reveals that molecular-based models combining genetics, serum biomarkers, and smoking information could complement current diagnostic strategies and help classify patients based on biologic state rather than imperfect clinical parameters.