Modern Stochastics: Theory and Applications (Jan 2018)

A moment-distance hybrid method for estimating a mixture of two symmetric densities

  • David Källberg,
  • Yuri Belyaev,
  • Patrik Rydén

DOI
https://doi.org/10.15559/17-VMSTA93
Journal volume & issue
Vol. 5, no. 1
pp. 1 – 36

Abstract

Read online

In clustering of high-dimensional data a variable selection is commonly applied to obtain an accurate grouping of the samples. For two-class problems this selection may be carried out by fitting a mixture distribution to each variable. We propose a hybrid method for estimating a parametric mixture of two symmetric densities. The estimator combines the method of moments with the minimum distance approach. An evaluation study including both extensive simulations and gene expression data from acute leukemia patients shows that the hybrid method outperforms a maximum-likelihood estimator in model-based clustering. The hybrid estimator is flexible and performs well also under imprecise model assumptions, suggesting that it is robust and suited for real problems.

Keywords