Algorithms for Molecular Biology (Jul 2023)

A topology-marginal composite likelihood via a generalized phylogenetic pruning algorithm

  • Seong-Hwan Jun,
  • Hassan Nasif,
  • Chris Jennings-Shaffer,
  • David H Rich,
  • Anna Kooperberg,
  • Mathieu Fourment,
  • Cheng Zhang,
  • Marc A Suchard,
  • Frederick A Matsen

DOI
https://doi.org/10.1186/s13015-023-00235-1
Journal volume & issue
Vol. 18, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Bayesian phylogenetics is a computationally challenging inferential problem. Classical methods are based on random-walk Markov chain Monte Carlo (MCMC), where random proposals are made on the tree parameter and the continuous parameters simultaneously. Variational phylogenetics is a promising alternative to MCMC, in which one fits an approximating distribution to the unnormalized phylogenetic posterior. Previous work fit this variational approximation using stochastic gradient descent, which is the canonical way of fitting general variational approximations. However, phylogenetic trees are special structures, giving opportunities for efficient computation. In this paper we describe a new algorithm that directly generalizes the Felsenstein pruning algorithm (a.k.a. sum-product algorithm) to compute a composite-like likelihood by marginalizing out ancestral states and subtrees simultaneously. We show the utility of this algorithm by rapidly making point estimates for branch lengths of a multi-tree phylogenetic model. These estimates accord with a long MCMC run and with estimates obtained using a variational method, but are much faster to obtain. Thus, although generalized pruning does not lead to a variational algorithm as such, we believe that it will form a useful starting point for variational inference.