CNV-P: a machine-learning framework for predicting high confident copy number variations

Taifu Wang; Jinghua Sun; Xiuqing Zhang; Wen-Jing Wang; Qing Zhou

doi:10.7717/peerj.12564

PeerJ (Dec 2021)

CNV-P: a machine-learning framework for predicting high confident copy number variations

Taifu Wang,
Jinghua Sun,
Xiuqing Zhang,
Wen-Jing Wang,
Qing Zhou

Affiliations

Taifu Wang: BGI-Shenzhen, Shenzhen, China
Jinghua Sun: BGI-Shenzhen, Shenzhen, China
Xiuqing Zhang: BGI-Shenzhen, Shenzhen, China
Wen-Jing Wang: BGI-Shenzhen, Shenzhen, China
Qing Zhou: BGI-Shenzhen, Shenzhen, China

DOI: https://doi.org/10.7717/peerj.12564
Journal volume & issue: Vol. 9
p. e12564

Abstract

Read online Read online

Background Copy-number variants (CNVs) have been recognized as one of the major causes of genetic disorders. Reliable detection of CNVs from genome sequencing data has been a strong demand for disease research. However, current software for detecting CNVs has high false-positive rates, which needs further improvement. Methods Here, we proposed a novel and post-processing approach for CNVs prediction (CNV-P), a machine-learning framework that could efficiently remove false-positive fragments from results of CNVs detecting tools. A series of CNVs signals such as read depth (RD), split reads (SR) and read pair (RP) around the putative CNV fragments were defined as features to train a classifier. Results The prediction results on several real biological datasets showed that our models could accurately classify the CNVs at over 90% precision rate and 85% recall rate, which greatly improves the performance of state-of-the-art algorithms. Furthermore, our results indicate that CNV-P is robust to different sizes of CNVs and the platforms of sequencing. Conclusions Our framework for classifying high-confident CNVs could improve both basic research and clinical diagnosis of genetic diseases.

Published in PeerJ

ISSN: 2167-8359 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Medicine; Science: Biology (General)
Website: https://peerj.com/

About the journal

Abstract

Keywords