Long read isoform sequencing reveals hidden transcriptional complexity between cattle subspecies

Yan Ren; Elizabeth Tseng; Timothy P. L. Smith; Stefan Hiendleder; John L. Williams; Wai Yee Low

doi:10.1186/s12864-023-09212-9

BMC Genomics (Mar 2023)

Long read isoform sequencing reveals hidden transcriptional complexity between cattle subspecies

Yan Ren,
Elizabeth Tseng,
Timothy P. L. Smith,
Stefan Hiendleder,
John L. Williams,
Wai Yee Low

Affiliations

Yan Ren: The Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide
Elizabeth Tseng: Pacific Biosciences
Timothy P. L. Smith: U.S. Meat Animal Research Center, USDA-ARS, Clay Center
Stefan Hiendleder: The Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide
John L. Williams: The Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide
Wai Yee Low: The Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide

DOI: https://doi.org/10.1186/s12864-023-09212-9
Journal volume & issue: Vol. 24, no. 1
pp. 1 – 15

Abstract

Read online

Abstract The Iso-Seq method of full-length cDNA sequencing is suitable to quantify differentially expressed genes (DEGs), transcripts (DETs) and transcript usage (DTU). However, the higher cost of Iso-Seq relative to RNA-seq has limited the comparison of both methods. Transcript abundance estimated by RNA-seq and deep Iso-Seq data for fetal liver from two cattle subspecies were compared to evaluate concordance. Inter-sample correlation of gene- and transcript-level abundance was higher within technology than between technologies. Identification of DEGs between the cattle subspecies depended on sequencing method with only 44 genes identified by both that included 6 novel genes annotated by Iso-Seq. There was a pronounced difference between Iso-Seq and RNA-seq results at transcript-level wherein Iso-Seq revealed several magnitudes more transcript abundance and usage differences between subspecies. Factors influencing DEG identification included size selection during Iso-Seq library preparation, average transcript abundance, multi-mapping of RNA-seq reads to the reference genome, and overlapping coordinates of genes. Some DEGs called by RNA-seq alone appear to be sequence duplication artifacts. Among the 44 DEGs identified by both technologies some play a role in immune system, thyroid function and cell growth. Iso-Seq revealed hidden transcriptional complexity in DEGs, DETs and DTU genes between cattle subspecies previously missed by RNA-seq.

Published in BMC Genomics

ISSN: 1471-2164 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Chemical technology: Biotechnology; Science: Biology (General): Genetics
Website: http://bmcgenomics.biomedcentral.com

About the journal

Abstract

Keywords