Genome Biology (Nov 2022)

metaMIC: reference-free misassembly identification and correction of de novo metagenomic assemblies

  • Senying Lai,
  • Shaojun Pan,
  • Chuqing Sun,
  • Luis Pedro Coelho,
  • Wei-Hua Chen,
  • Xing-Ming Zhao

DOI
https://doi.org/10.1186/s13059-022-02810-y
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 21

Abstract

Read online

Abstract Evaluating the quality of metagenomic assemblies is important for constructing reliable metagenome-assembled genomes and downstream analyses. Here, we present metaMIC ( https://github.com/ZhaoXM-Lab/metaMIC ), a machine learning-based tool for identifying and correcting misassemblies in metagenomic assemblies. Benchmarking results on both simulated and real datasets demonstrate that metaMIC outperforms existing tools when identifying misassembled contigs. Furthermore, metaMIC is able to localize the misassembly breakpoints, and the correction of misassemblies by splitting at misassembly breakpoints can improve downstream scaffolding and binning results.

Keywords