Frontiers in Molecular Biosciences (Jun 2022)

Efficient Detection of the Alternative Spliced Human Proteome Using Translatome Sequencing

  • Chun Wu,
  • Xiaolong Lu,
  • Shaohua Lu,
  • Shaohua Lu,
  • Hongwei Wang,
  • Dehua Li,
  • Jing Zhao,
  • Jingjie Jin,
  • Zhenghua Sun,
  • Qing-Yu He,
  • Yang Chen,
  • Gong Zhang

DOI
https://doi.org/10.3389/fmolb.2022.895746
Journal volume & issue
Vol. 9

Abstract

Read online

Alternative splicing (AS) isoforms create numerous proteoforms, expanding the complexity of the genome. Highly similar sequences, incomplete reference databases and the insufficient sequence coverage of mass spectrometry limit the identification of AS proteoforms. Here, we demonstrated full-length translating mRNAs (ribosome nascent-chain complex-bound mRNAs, RNC-mRNAs) sequencing (RNC-seq) strategy to sequence the entire translating mRNA using next-generation sequencing, including short-read and long-read technologies, to construct a protein database containing all translating AS isoforms. Taking the advantage of read length, short-read RNC-seq identified up to 15,289 genes and 15,906 AS isoforms in a single human cell line, much more than the Ribo-seq. The single-molecule long-read RNC-seq supplemented 4,429 annotated AS isoforms that were not identified by short-read datasets, and 4,525 novel AS isoforms that were not included in the public databases. Using such RNC-seq-guided database, we identified 6,766 annotated protein isoforms and 50 novel protein isoforms in mass spectrometry datasets. These results demonstrated the potential of full-length RNC-seq in investigating the proteome of AS isoforms.

Keywords