BMC Biology (Jan 2022)

Genome-wide annotation of protein-coding genes in pig

  • Max Karlsson,
  • Evelina Sjöstedt,
  • Per Oksvold,
  • Åsa Sivertsson,
  • Jinrong Huang,
  • María Bueno Álvez,
  • Muhammad Arif,
  • Xiangyu Li,
  • Lin Lin,
  • Jiaying Yu,
  • Tao Ma,
  • Fengping Xu,
  • Peng Han,
  • Hui Jiang,
  • Adil Mardinoglu,
  • Cheng Zhang,
  • Kalle von Feilitzen,
  • Xun Xu,
  • Jian Wang,
  • Huanming Yang,
  • Lars Bolund,
  • Wen Zhong,
  • Linn Fagerberg,
  • Cecilia Lindskog,
  • Fredrik Pontén,
  • Jan Mulder,
  • Yonglun Luo,
  • Mathias Uhlen

DOI
https://doi.org/10.1186/s12915-022-01229-y
Journal volume & issue
Vol. 20, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Background There is a need for functional genome-wide annotation of the protein-coding genes to get a deeper understanding of mammalian biology. Here, a new annotation strategy is introduced based on dimensionality reduction and density-based clustering of whole-body co-expression patterns. This strategy has been used to explore the gene expression landscape in pig, and we present a whole-body map of all protein-coding genes in all major pig tissues and organs. Results An open-access pig expression map ( www.rnaatlas.org ) is presented based on the expression of 350 samples across 98 well-defined pig tissues divided into 44 tissue groups. A new UMAP-based classification scheme is introduced, in which all protein-coding genes are stratified into tissue expression clusters based on body-wide expression profiles. The distribution and tissue specificity of all 22,342 protein-coding pig genes are presented. Conclusions Here, we present a new genome-wide annotation strategy based on dimensionality reduction and density-based clustering. A genome-wide resource of the transcriptome map across all major tissues and organs in pig is presented, and the data is available as an open-access resource ( www.rnaatlas.org ), including a comparison to the expression of human orthologs.

Keywords