G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study

Arthur Chatton; Florent Le Borgne; Clémence Leyrat; Florence Gillaizeau; Chloé Rousseau; Laetitia Barbin; David Laplaud; Maxime Léger; Bruno Giraudeau; Yohann Foucher

doi:10.1038/s41598-020-65917-x

Scientific Reports (Jun 2020)

G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study

Arthur Chatton,
Florent Le Borgne,
Clémence Leyrat,
Florence Gillaizeau,
Chloé Rousseau,
Laetitia Barbin,
David Laplaud,
Maxime Léger,
Bruno Giraudeau,
Yohann Foucher

Affiliations

Arthur Chatton: INSERM UMR 1246 - SPHERE, Université de Nantes, Université de Tours
Florent Le Borgne: INSERM UMR 1246 - SPHERE, Université de Nantes, Université de Tours
Clémence Leyrat: INSERM UMR 1246 - SPHERE, Université de Nantes, Université de Tours
Florence Gillaizeau: INSERM UMR 1246 - SPHERE, Université de Nantes, Université de Tours
Chloé Rousseau: INSERM UMR 1246 - SPHERE, Université de Nantes, Université de Tours
Laetitia Barbin: Centre Hospitalier Universitaire de Nantes
David Laplaud: Centre Hospitalier Universitaire de Nantes
Maxime Léger: INSERM UMR 1246 - SPHERE, Université de Nantes, Université de Tours
Bruno Giraudeau: INSERM UMR 1246 - SPHERE, Université de Nantes, Université de Tours
Yohann Foucher: INSERM UMR 1246 - SPHERE, Université de Nantes, Université de Tours

DOI: https://doi.org/10.1038/s41598-020-65917-x
Journal volume & issue: Vol. 10, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Controlling for confounding bias is crucial in causal inference. Distinct methods are currently employed to mitigate the effects of confounding bias. Each requires the introduction of a set of covariates, which remains difficult to choose, especially regarding the different methods. We conduct a simulation study to compare the relative performance results obtained by using four different sets of covariates (those causing the outcome, those causing the treatment allocation, those causing both the outcome and the treatment allocation, and all the covariates) and four methods: g-computation, inverse probability of treatment weighting, full matching and targeted maximum likelihood estimator. Our simulations are in the context of a binary treatment, a binary outcome and baseline confounders. The simulations suggest that considering all the covariates causing the outcome led to the lowest bias and variance, particularly for g-computation. The consideration of all the covariates did not decrease the bias but significantly reduced the power. We apply these methods to two real-world examples that have clinical relevance, thereby illustrating the real-world importance of using these methods. We propose an R package RISCA to encourage the use of g-computation in causal inference.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal