Journal of Statistical Software (Jul 2019)

slfm: An R Package to Evaluate Coherent Patterns in Microarray Data via Factor Analysis

  • João Daniel N. Duarte,
  • Vinícius D. Mayrink

DOI
https://doi.org/10.18637/jss.v090.i09
Journal volume & issue
Vol. 90, no. 1
pp. 1 – 22

Abstract

Read online

The development of simulation-based methods, such as Markov chain Monte Carlo (MCMC), has contributed to an increased interest in the Bayesian framework as an alternative to deal with factor models. Many studies have used Bayesian factor analysis to explore gene expression data. We are particularly interested in the application of a sparse latent factor model (SLFM) based on sparsity priors (mixtures) to assess the significance of factors. The SLFM measures how strong the observed coherent expression pattern is in the data, which is an important source of information to evaluate gene activity. In the literature, this type of model has shown better results than other approaches intended for identification of patterns and metagene groups related to the underlying biology. However, a full Bayesian factor model relying on MCMC algorithms has an expensive computational cost, which makes it unattractive for general users. In this paper, we present the package slfm which uses C++ implementation via Rcpp to improve the computational performance of the SLFM within the widely used statistical tool R. We investigate real and simulated microarray data related to breast cancer.

Keywords