GigaByte (Feb 2024)

SAW: an efficient and accurate data analysis workflow for Stereo-seq spatial transcriptomics

  • Chun Gong ,
  • Shengkang Li ,
  • Leying Wang ,
  • Fuxiang Zhao ,
  • Shuangsang Fang ,
  • Dong Yuan ,
  • Zijian Zhao ,
  • Qiqi He ,
  • Mei Li ,
  • Weiqing Liu ,
  • Zhaoxun Li ,
  • Hongqing Xie ,
  • Sha Liao ,
  • Ao Chen ,
  • Yong Zhang ,
  • Yuxiang Li ,
  • Xun Xu

DOI
https://doi.org/10.46471/gigabyte.111

Abstract

Read online

The basic analysis steps of spatial transcriptomics require obtaining gene expression information from both space and cells. The existing tools for these analyses incur performance issues when dealing with large datasets. These issues involve computationally intensive spatial localization, RNA genome alignment, and excessive memory usage in large chip scenarios. These problems affect the applicability and efficiency of the analysis. Here, a high-performance and accurate spatial transcriptomics data analysis workflow, called Stereo-seq Analysis Workflow (SAW), was developed for the Stereo-seq technology developed at BGI. SAW includes mRNA spatial position reconstruction, genome alignment, gene expression matrix generation, and clustering. The workflow outputs files in a universal format for subsequent personalized analysis. The execution time for the entire analysis is ∼148 min with 1 GB reads 1 × 1 cm chip test data, 1.8 times faster than with an unoptimized workflow.