PLoS ONE (Jan 2019)

Sample size issues in multilevel logistic regression models.

  • Amjad Ali,
  • Sabz Ali,
  • Sajjad Ahmad Khan,
  • Dost Muhammad Khan,
  • Kamran Abbas,
  • Alamgir Khalil,
  • Sadaf Manzoor,
  • Umair Khalil

DOI
https://doi.org/10.1371/journal.pone.0225427
Journal volume & issue
Vol. 14, no. 11
p. e0225427

Abstract

Read online

Educational researchers, psychologists, social, epidemiological and medical scientists are often dealing with multilevel data. Sometimes, the response variable in multilevel data is categorical in nature and needs to be analyzed through Multilevel Logistic Regression Models. The main theme of this paper is to provide guidelines for the analysts to select an appropriate sample size while fitting multilevel logistic regression models for different threshold parameters and different estimation methods. Simulation studies have been performed to obtain optimum sample size for Penalized Quasi-likelihood (PQL) and Maximum Likelihood (ML) Methods of estimation. Our results suggest that Maximum Likelihood Method performs better than Penalized Quasi-likelihood Method and requires relatively small sample under chosen conditions. To achieve sufficient accuracy of fixed and random effects under ML method, we established ''50/50" and ''120/50" rule respectively. On the basis our findings, a ''50/60" and ''120/70" rules under PQL method of estimation have also been recommended.