Genome-wide Analysis of Drosophila Circular RNAs Reveals Their Structural and Sequence Properties and Age-Dependent Neural Accumulation
Jakub O. Westholm,
Pedro Miura,
Sara Olson,
Sol Shenker,
Brian Joseph,
Piero Sanfilippo,
Susan E. Celniker,
Brenton R. Graveley,
Eric C. Lai
Affiliations
Jakub O. Westholm
Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA
Pedro Miura
Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA; Department of Biology, University of Nevada, Reno, Nevada 89557, USA
Sara Olson
Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, CT 06032, USA
Sol Shenker
Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA; Tri-Institutional Program in Computational Biology and Medicine, Weill Cornell Medical College, New York, NY 10065, USA
Brian Joseph
Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA; Gerstner Sloan-Kettering Graduate Program of Biomedical Sciences, 417 East 68th Street, New York, NY 10065, USA
Piero Sanfilippo
Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA; Gerstner Sloan-Kettering Graduate Program of Biomedical Sciences, 417 East 68th Street, New York, NY 10065, USA
Susan E. Celniker
Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, CA 94701, USA
Brenton R. Graveley
Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, 400 Farmington Avenue, Farmington, CT 06032, USA
Eric C. Lai
Department of Developmental Biology, Sloan-Kettering Institute, 1275 York Avenue, Box 252, New York, NY 10065, USA; Corresponding author
Summary: Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues, and cultured cells, to rigorously annotate >2,500 fruit fly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and the circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1,000 well-conserved canonical miRNA seed matches, especially within coding regions, and coding conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase substantially relative to linear isoforms during CNS aging and constitute an aging biomarker. : Westholm et al. annotate Drosophila circular RNAs from a massive collection of total RNA-seq data, providing insights into their biogenesis and function. In particular, circularizing exons are predominantly associated with long flanking introns, are preferred locations of conserved coding miRNA sites, and accumulate to highest levels in the aging CNS.