Stats (Nov 2022)

On the Sampling Size for Inverse Sampling

  • Daniele Cuntrera,
  • Vincenzo Falco,
  • Ornella Giambalvo

DOI
https://doi.org/10.3390/stats5040067
Journal volume & issue
Vol. 5, no. 4
pp. 1130 – 1144

Abstract

Read online

In the Big Data era, sampling remains a central theme. This paper investigates the characteristics of inverse sampling on two different datasets (real and simulated) to determine when big data become too small for inverse sampling to be used and to examine the impact of the sampling rate of the subsamples. We find that the method, using the appropriate subsample size for both the mean and proportion parameters, performs well with a smaller dataset than big data through the simulation study and real-data application. Different settings related to the selection bias severity are considered during the simulation study and real application.

Keywords