Journal of Biostatistics and Epidemiology (Dec 2018)
Improved confidence interval estimation of the population standard deviation using ranked set sampling: A simulation study
Abstract
Background & Aim: The sample standard deviation S is the common point estimator of σ, but S is sensitive to the presence of outliers and may not be an efficient estimator of σ in skewed and leptokurtic distributions. Although S has good efficiency in platykurtic and moderately leptokurtic distributions, its classical inferential methods may perform poorly in non-normal distributions. The classical confidence interval for σ relies on the assumption of normality of the distribution. In this paper, a performance comparison of six confidence interval estimates of σ is performed under ten distributions that vary in skewness and kurtosis. Methods and Material: A Monte Carlo simulation study is conducted under the following distributions: normal, two contaminated normal, t, Gamma, Uniform, Beta, Laplace, exponential and χ2 with specific parameters. Confidence interval estimates obtained using the more powerful ranked set sampling (RSS) are compared with the traditional simple random sampling (SRS) technique. Performance of the confidence intervals is assessed based on width and coverage probabilities. A real data example representing birth weight of 189 newborns is used for assessment. Results: It is not surprising that for normal data most of the intervals were close tot he nominal value especially using RSS. Simulation results indicated generally better performance of RSS in terms of coverage probability and smaller interval width as sample size increases, especially for contaminated and heavy-tailed skewed distributions. Conclusion: Simulation results revealed that the use of RSS improved greatly the coverage probability. Also, it was found that the interval labeled (III) due to Bonett (2006) had the best performance in terms of coverage probability over the wide range of distributions investigated in this paper and would be recommended for use by practitioners. There may be a need to develop nonparametric intervals that is robust against outliers and heavy-tailed distributions.