Model-based standardization using multiple imputation

Antonio Remiro-Azócar; Anna Heath; Gianluca Baio

doi:10.1186/s12874-024-02157-x

BMC Medical Research Methodology (Feb 2024)

Model-based standardization using multiple imputation

Antonio Remiro-Azócar,
Anna Heath,
Gianluca Baio

Affiliations

Antonio Remiro-Azócar: Statistics and Data Insights, Bayer plc
Anna Heath: Child Health Evaluative Sciences, The Hospital for Sick Children
Gianluca Baio: Department of Statistical Science, University College London

DOI: https://doi.org/10.1186/s12874-024-02157-x
Journal volume & issue: Vol. 24, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Background When studying the association between treatment and a clinical outcome, a parametric multivariable model of the conditional outcome expectation is often used to adjust for covariates. The treatment coefficient of the outcome model targets a conditional treatment effect. Model-based standardization is typically applied to average the model predictions over the target covariate distribution, and generate a covariate-adjusted estimate of the marginal treatment effect. Methods The standard approach to model-based standardization involves maximum-likelihood estimation and use of the non-parametric bootstrap. We introduce a novel, general-purpose, model-based standardization method based on multiple imputation that is easily applicable when the outcome model is a generalized linear model. We term our proposed approach multiple imputation marginalization (MIM). MIM consists of two main stages: the generation of synthetic datasets and their analysis. MIM accommodates a Bayesian statistical framework, which naturally allows for the principled propagation of uncertainty, integrates the analysis into a probabilistic framework, and allows for the incorporation of prior evidence. Results We conduct a simulation study to benchmark the finite-sample performance of MIM in conjunction with a parametric outcome model. The simulations provide proof-of-principle in scenarios with binary outcomes, continuous-valued covariates, a logistic outcome model and the marginal log odds ratio as the target effect measure. When parametric modeling assumptions hold, MIM yields unbiased estimation in the target covariate distribution, valid coverage rates, and similar precision and efficiency than the standard approach to model-based standardization. Conclusion We demonstrate that multiple imputation can be used to marginalize over a target covariate distribution, providing appropriate inference with a correctly specified parametric outcome model and offering statistical performance comparable to that of the standard approach to model-based standardization.

Published in BMC Medical Research Methodology

ISSN: 1471-2288 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General)
Website: http://bmcmedresmethodol.biomedcentral.com

About the journal

Abstract

Keywords