Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology

Camille Maringe; Aurélien Belot; Francisco Javier Rubio; Bernard Rachet

doi:10.1186/s12874-019-0830-9

BMC Medical Research Methodology (Nov 2019)

Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology

Camille Maringe,
Aurélien Belot,
Francisco Javier Rubio,
Bernard Rachet

Affiliations

Camille Maringe: Cancer Survival Group, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine
Aurélien Belot: Cancer Survival Group, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine
Francisco Javier Rubio: Department of Mathematics, King’s college
Bernard Rachet: Cancer Survival Group, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine

DOI: https://doi.org/10.1186/s12874-019-0830-9
Journal volume & issue: Vol. 19, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Background Large and complex population-based cancer data are becoming broadly available, thanks to purposeful linkage between cancer registry data and health electronic records. Aiming at understanding the explanatory power of factors on cancer survival, the modelling and selection of variables need to be understood and exploited properly for improving model-based estimates of cancer survival. Method We assess the performances of well-known model selection strategies developed by Royston and Sauerbrei and Wynant and Abrahamowicz that we adapt to the relative survival data setting and to test for interaction terms. Results We apply these to all male patients diagnosed with lung cancer in England in 2012 (N = 15,688), and followed-up until 31/12/2015. We model the effects of age at diagnosis, tumour stage, deprivation, comorbidity and emergency presentation, as well as interactions between age and all of the above. Given the size of the dataset, all model selection strategies favoured virtually the same model, except for a non-linear effect of age at diagnosis selected by the backward-based selection strategies (versus a linear effect selected otherwise). Conclusion The results from extensive simulations evaluating varying model complexity and sample sizes provide guidelines on a model selection strategy in the context of excess hazard modelling.

Published in BMC Medical Research Methodology

ISSN: 1471-2288 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General)
Website: http://bmcmedresmethodol.biomedcentral.com

About the journal

Abstract

Keywords