Mathematical Biosciences and Engineering (Apr 2021)

A distributed quantile estimation algorithm of heavy-tailed distribution with massive datasets

  • Xiaoyue Xie,
  • Jian Shi

DOI
https://doi.org/10.3934/mbe.2021011
Journal volume & issue
Vol. 18, no. 1
pp. 214 – 230

Abstract

Read online

Quantile estimation with big data is still a challenging problem in statistics. In this paper we introduce a distributed algorithm for estimating high quantiles of heavy-tailed distributions with massive datasets. The key idea of the algorithm is to apply the alternating direction method of multipliers in parameter estimation of the generalized pareto distribution in a distributed structure and compute high quantiles based on parameter estimation by the Peak Over Threshold method. This paper proves that the proposed algorithm converges to a stationary solution when the step size is properly chosen. The numerical study and real data analysis also shows that the algorithm is feasible and efficient for estimating high quantiles of heavy-tailed distribution with massive datasets and there is a clear-cut winner for the extreme quantiles.

Keywords