Cell Reports: Methods (Feb 2023)

BOMA, a machine-learning framework for comparative gene expression analysis across brains and organoids

  • Chenfeng He,
  • Noah Cohen Kalafut,
  • Soraya O. Sandoval,
  • Ryan Risgaard,
  • Carissa L. Sirois,
  • Chen Yang,
  • Saniya Khullar,
  • Marin Suzuki,
  • Xiang Huang,
  • Qiang Chang,
  • Xinyu Zhao,
  • Andre M.M. Sousa,
  • Daifeng Wang

Journal volume & issue
Vol. 3, no. 2
p. 100409

Abstract

Read online

Summary: Our machine-learning framework, brain and organoid manifold alignment (BOMA), first performs a global alignment of developmental gene expression data between brains and organoids. It then applies manifold learning to locally refine the alignment, revealing conserved and specific developmental trajectories across brains and organoids. Using BOMA, we found that human cortical organoids better align with certain brain cortical regions than with other non-cortical regions, implying organoid-preserved developmental gene expression programs specific to brain regions. Additionally, our alignment of non-human primate and human brains reveals highly conserved gene expression around birth. Also, we integrated and analyzed developmental single-cell RNA sequencing (scRNA-seq) data of human brains and organoids, showing conserved and specific cell trajectories and clusters. Further identification of expressed genes of such clusters and enrichment analyses reveal brain- or organoid-specific developmental functions and pathways. Finally, we experimentally validated important specific expressed genes through the use of immunofluorescence. BOMA is open-source available as a web tool for community use. Motivation: Organoids have become valuable models for understanding cellular and molecular mechanisms in human development, including development of brains. However, whether developmental gene expression programs are preserved between human organoids and brains, especially in specific cell types, remains unclear. Importantly, there is a lack of effective computational approaches for comparative data analyses between organoids and developing human brains. To address this, we developed a machine-learning framework for comparative gene expression analysis of brains and organoids to identify conserved and specific developmental trajectories as well as developmentally expressed genes and functions, especially at cellular resolution.

Keywords