Identification of long non-coding RNA in the horse transcriptome

E. Y. Scott; T. Mansour; R. R. Bellone; C. T. Brown; M. J. Mienaltowski; M. C. Penedo; P. J. Ross; S. J. Valberg; J. D. Murray; C. J. Finno

doi:10.1186/s12864-017-3884-2

BMC Genomics (Jul 2017)

Identification of long non-coding RNA in the horse transcriptome

E. Y. Scott,
T. Mansour,
R. R. Bellone,
C. T. Brown,
M. J. Mienaltowski,
M. C. Penedo,
P. J. Ross,
S. J. Valberg,
J. D. Murray,
C. J. Finno

Affiliations

E. Y. Scott: Department of Animal Science, University of California
T. Mansour: Department of Population Health and Reproduction, University of California
R. R. Bellone: Department of Population Health and Reproduction, University of California
C. T. Brown: Department of Population Health and Reproduction, University of California
M. J. Mienaltowski: Department of Animal Science, University of California
M. C. Penedo: Veterinary Genetics Laboratory, University of California
P. J. Ross: Department of Animal Science, University of California
S. J. Valberg: Large Animal Clinical Sciences, Michigan State University, College of Veterinary Medicine
J. D. Murray: Department of Animal Science, University of California
C. J. Finno: Department of Population Health and Reproduction, University of California

DOI: https://doi.org/10.1186/s12864-017-3884-2
Journal volume & issue: Vol. 18, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background Efforts to resolve the transcribed sequences in the equine genome have focused on protein-coding RNA. The transcription of the intergenic regions, although detected via total RNA sequencing (RNA-seq), has yet to be characterized in the horse. The most recent equine transcriptome based on RNA-seq from several tissues was a prime opportunity to obtain a concurrent long non-coding RNA (lncRNA) database. Results This lncRNA database has a breadth of eight tissues and a depth of over 20 million reads for select tissues, providing the deepest and most expansive equine lncRNA database. Utilizing the intergenic reads and three categories of novel genes from a previously published equine transcriptome pipeline, we better describe these groups by annotating the lncRNA candidates. These lncRNA candidates were filtered using an approach adapted from human lncRNA annotation, which removes transcripts based on size, expression, protein-coding capability and distance to the start or stop of annotated protein-coding transcripts. Conclusion Our equine lncRNA database has 20,800 transcripts that demonstrate characteristics unique to lncRNA including low expression, low exon diversity and low levels of sequence conservation. These candidate lncRNA will serve as a baseline lncRNA annotation and begin to describe the RNA-seq reads assigned to the intergenic space in the horse.

Published in BMC Genomics

ISSN: 1471-2164 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Chemical technology: Biotechnology; Science: Biology (General): Genetics
Website: http://bmcgenomics.biomedcentral.com

About the journal

Abstract

Keywords