Long-read cDNA sequencing identifies functional pseudogenes in the human transcriptome

Robin-Lee Troskie; Yohaann Jafrani; Tim R. Mercer; Adam D. Ewing; Geoffrey J. Faulkner; Seth W. Cheetham

doi:10.1186/s13059-021-02369-0

Genome Biology (May 2021)

Long-read cDNA sequencing identifies functional pseudogenes in the human transcriptome

Robin-Lee Troskie,
Yohaann Jafrani,
Tim R. Mercer,
Adam D. Ewing,
Geoffrey J. Faulkner,
Seth W. Cheetham

Affiliations

Robin-Lee Troskie: Mater Research Institute-University of Queensland
Yohaann Jafrani: Mater Research Institute-University of Queensland
Tim R. Mercer: Australian Institute for Bioengineering and Nanotechnology, University of Queensland
Adam D. Ewing: Mater Research Institute-University of Queensland
Geoffrey J. Faulkner: Mater Research Institute-University of Queensland
Seth W. Cheetham: Mater Research Institute-University of Queensland

DOI: https://doi.org/10.1186/s13059-021-02369-0
Journal volume & issue: Vol. 22, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Pseudogenes are gene copies presumed to mainly be functionless relics of evolution due to acquired deleterious mutations or transcriptional silencing. Using deep full-length PacBio cDNA sequencing of normal human tissues and cancer cell lines, we identify here hundreds of novel transcribed pseudogenes expressed in tissue-specific patterns. Some pseudogene transcripts have intact open reading frames and are translated in cultured cells, representing unannotated protein-coding genes. To assess the biological impact of noncoding pseudogenes, we CRISPR-Cas9 delete the nucleus-enriched pseudogene PDCL3P4 and observe hundreds of perturbed genes. This study highlights pseudogenes as a complex and dynamic component of the human transcriptional landscape.

Published in Genome Biology

ISSN: 1474-760X (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Science: Biology (General): Genetics
Website: https://genomebiology.biomedcentral.com/

About the journal

Abstract

Keywords