BMC Medical Research Methodology (Jul 2018)

Estimating causal effects of time-dependent exposures on a binary endpoint in a high-dimensional setting

  • Vahé Asvatourian,
  • Clélia Coutzac,
  • Nathalie Chaput,
  • Caroline Robert,
  • Stefan Michiels,
  • Emilie Lanoy

DOI
https://doi.org/10.1186/s12874-018-0527-5
Journal volume & issue
Vol. 18, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Recently, the intervention calculus when the DAG is absent (IDA) method was developed to estimate lower bounds of causal effects from observational high-dimensional data. Originally it was introduced to assess the effect of baseline biomarkers which do not vary over time. However, in many clinical settings, measurements of biomarkers are repeated at fixed time points during treatment and, therefore, this method needs to be extended. The purpose of this paper is to extend the first step of the IDA, the Peter Clarks (PC)-algorithm, to a time-dependent exposure in the context of a binary outcome. Methods We generalised the so-called “PC-algorithm” to take into account the chronological order of repeated measurements of the exposure and proposed to apply the IDA with our new version, the chronologically ordered PC-algorithm (COPC-algorithm). The extension includes Firth’s correction. A simulation study has been performed before applying the method for estimating causal effects of time-dependent immunological biomarkers on toxicity, death and progression in patients with metastatic melanoma. Results The simulation study showed that the completed partially directed acyclic graphs (CPDAGs) obtained using COPC-algorithm were structurally closer to the true CPDAG than CPDAGs obtained using PC-algorithm. Also, causal effects were more accurate when they were estimated based on CPDAGs obtained using COPC-algorithm. Moreover, CPDAGs obtained by COPC-algorithm allowed removing non-chronological arrows with a variable measured at a time t pointing to a variable measured at a time t´ where t´ < t. Bidirected edges were less present in CPDAGs obtained with the COPC-algorithm, supporting the fact that there was less variability in causal effects estimated from these CPDAGs. In the example, a threshold of the per-comparison error rate of 0.5% led to the selection of an interpretable set of biomarkers. Conclusions The COPC-algorithm provided CPDAGs that keep the chronological structure present in the data and thus allowed to estimate lower bounds of the causal effect of time-dependent immunological biomarkers on early toxicity, premature death and progression.

Keywords