PLoS ONE (Jan 2016)

Prior Design for Dependent Dirichlet Processes: An Application to Marathon Modeling.

  • Melanie F Pradier,
  • Francisco J R Ruiz,
  • Fernando Perez-Cruz

DOI
https://doi.org/10.1371/journal.pone.0147402
Journal volume & issue
Vol. 11, no. 1
p. e0147402

Abstract

Read online

This paper presents a novel application of Bayesian nonparametrics (BNP) for marathon data modeling. We make use of two well-known BNP priors, the single-p dependent Dirichlet process and the hierarchical Dirichlet process, in order to address two different problems. First, we study the impact of age, gender and environment on the runners' performance. We derive a fair grading method that allows direct comparison of runners regardless of their age and gender. Unlike current grading systems, our approach is based not only on top world records, but on the performances of all runners. The presented methodology for comparison of densities can be adopted in many other applications straightforwardly, providing an interesting perspective to build dependent Dirichlet processes. Second, we analyze the running patterns of the marathoners in time, obtaining information that can be valuable for training purposes. We also show that these running patterns can be used to predict finishing time given intermediate interval measurements. We apply our models to New York City, Boston and London marathons.