Lessons from discovery of true ADAR RNA editing sites in a human cell line

Fang Wang; Huifen Cao; Qiu Xia; Ziheng Liu; Ming Wang; Fan Gao; Dongyang Xu; Bolin Deng; Yong Diao; Philipp Kapranov

doi:10.1186/s12915-023-01651-w

BMC Biology (Jul 2023)

Lessons from discovery of true ADAR RNA editing sites in a human cell line

Fang Wang,
Huifen Cao,
Qiu Xia,
Ziheng Liu,
Ming Wang,
Fan Gao,
Dongyang Xu,
Bolin Deng,
Yong Diao,
Philipp Kapranov

Affiliations

Fang Wang: Institute of Genomics, School of Medicine, Huaqiao University
Huifen Cao: Institute of Genomics, School of Medicine, Huaqiao University
Qiu Xia: Institute of Genomics, School of Medicine, Huaqiao University
Ziheng Liu: Institute of Genomics, School of Medicine, Huaqiao University
Ming Wang: Institute of Genomics, School of Medicine, Huaqiao University
Fan Gao: Institute of Genomics, School of Medicine, Huaqiao University
Dongyang Xu: Institute of Genomics, School of Medicine, Huaqiao University
Bolin Deng: Institute of Genomics, School of Medicine, Huaqiao University
Yong Diao: Institute of Genomics, School of Medicine, Huaqiao University
Philipp Kapranov: Institute of Genomics, School of Medicine, Huaqiao University

DOI: https://doi.org/10.1186/s12915-023-01651-w
Journal volume & issue: Vol. 21, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Background Conversion or editing of adenosine (A) into inosine (I) catalyzed by specialized cellular enzymes represents one of the most common post-transcriptional RNA modifications with emerging connection to disease. A-to-I conversions can happen at specific sites and lead to increase in proteome diversity and changes in RNA stability, splicing, and regulation. Such sites can be detected as adenine-to-guanine sequence changes by next-generation RNA sequencing which resulted in millions reported sites from multiple genome-wide surveys. Nonetheless, the lack of extensive independent validation in such endeavors, which is critical considering the relatively high error rate of next-generation sequencing, leads to lingering questions about the validity of the current compendiums of the editing sites and conclusions based on them. Results Strikingly, we found that the current analytical methods suffer from very high false positive rates and that a significant fraction of sites in the public databases cannot be validated. In this work, we present potential solutions to these problems and provide a comprehensive and extensively validated list of A-to-I editing sites in a human cancer cell line. Our findings demonstrate that most of true A-to-I editing sites in a human cancer cell line are located in the non-coding transcripts, the so-called RNA 'dark matter'. On the other hand, many ADAR editing events occurring in exons of human protein-coding mRNAs, including those that can recode the transcriptome, represent false positives and need to be interpreted with caution. Nonetheless, yet undiscovered authentic ADAR sites that increase the diversity of human proteome exist and warrant further identification. Conclusions Accurate identification of human ADAR sites remains a challenging problem, particularly for the sites in exons of protein-coding mRNAs. As a result, genome-wide surveys of ADAR editome must still be accompanied by extensive Sanger validation efforts. However, given the vast number of unknown human ADAR sites, there is a need for further developments of the analytical techniques, potentially those that are based on deep learning solutions, in order to provide a quick and reliable identification of the editome in any sample.

Published in BMC Biology

ISSN: 1741-7007 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbiol/

About the journal

Abstract

Keywords