Separating the signal from the noise in metagenomic cell-free DNA sequencing
Philip Burnham,
Nardhy Gomez-Lopez,
Michael Heyang,
Alexandre Pellan Cheng,
Joan Sesing Lenz,
Darshana M. Dadhania,
John Richard Lee,
Manikkam Suthanthiran,
Roberto Romero,
Iwijn De Vlaminck
Affiliations
Philip Burnham
Meinig School of Biomedical Engineering, Cornell University
Nardhy Gomez-Lopez
Perinatology Research Branch, Division of Obstetrics and Maternal-Fetal Medicine, Division of Intramural Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, U.S. Department of Health and Human Services (NICHD/NIH/DHHS)
Michael Heyang
Meinig School of Biomedical Engineering, Cornell University
Alexandre Pellan Cheng
Meinig School of Biomedical Engineering, Cornell University
Joan Sesing Lenz
Meinig School of Biomedical Engineering, Cornell University
Darshana M. Dadhania
Department of Transplantation Medicine, New York Presbyterian Hospital–Weill Cornell Medical Center
John Richard Lee
Department of Transplantation Medicine, New York Presbyterian Hospital–Weill Cornell Medical Center
Manikkam Suthanthiran
Department of Transplantation Medicine, New York Presbyterian Hospital–Weill Cornell Medical Center
Roberto Romero
Perinatology Research Branch, Division of Obstetrics and Maternal-Fetal Medicine, Division of Intramural Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, U.S. Department of Health and Human Services (NICHD/NIH/DHHS)
Iwijn De Vlaminck
Meinig School of Biomedical Engineering, Cornell University
Abstract Background Cell-free DNA (cfDNA) in blood, urine, and other biofluids provides a unique window into human health. A proportion of cfDNA is derived from bacteria and viruses, creating opportunities for the diagnosis of infection via metagenomic sequencing. The total biomass of microbial-derived cfDNA in clinical isolates is low, which makes metagenomic cfDNA sequencing susceptible to contamination and alignment noise. Results Here, we report low biomass background correction (LBBC), a bioinformatics noise filtering tool informed by the uniformity of the coverage of microbial genomes and the batch variation in the absolute abundance of microbial cfDNA. We demonstrate that LBBC leads to a dramatic reduction in false positive rate while minimally affecting the true positive rate for a cfDNA test to screen for urinary tract infection. We next performed high-throughput sequencing of cfDNA in amniotic fluid collected from term uncomplicated pregnancies or those complicated with clinical chorioamnionitis with and without intra-amniotic infection. Conclusions The data provide unique insight into the properties of fetal and maternal cfDNA in amniotic fluid, demonstrate the utility of cfDNA to screen for intra-amniotic infection, support the view that the amniotic fluid is sterile during normal pregnancy, and reveal cases of intra-amniotic inflammation without infection at term. Video abstract.