Annals of Medicine (Jan 2021)

Effects of data preprocessing on results of the epidemiological analysis of coronary heart disease and behaviour-related risk factors

  • Ari Voutilainen,
  • Christina Brester,
  • Mikko Kolehmainen,
  • Tomi-Pekka Tuomainen

DOI
https://doi.org/10.1080/07853890.2021.1921838
Journal volume & issue
Vol. 53, no. 1
pp. 890 – 899

Abstract

Read online

AbstractBackground We carried out this study to demonstrate the effects of outcome sensitivity, participant exclusions, and covariate manipulations on results of the epidemiological analysis of coronary heart disease (CHD) and its behaviour-related risk factors.Material and methods Our study population consisted of 1592 54-year-old men, who participated in the Kuopio Ischaemic Heart Disease Risk Factor (KIHD) Study. We used the Cox proportional-hazards model to predict the hazard of CHD and applied different sets of outcomes concerning outcome sensitivity and data preprocessing procedures regarding participant exclusions and covariate manipulations.Results The mean follow-up time was 23 years, and 730 men received the CHD diagnosis. Cox regressions based on data with no participant exclusions most often discovered statistically significant associations. Loose inclusion criteria for study participants with any CVD during the follow-up and strict exclusion criteria for participants with no CVD were best in discovering the associations between risk factors and CHD. Outcome sensitivity affected the associations, whereas the covariate type, continuous or categorical, did not.Conclusions This study suggests that excluding study participants who are not disease-free at baseline is probably unnecessary for epidemiological analyses. Epidemiological research reports should present results based on no data exclusions together with results based on reasoned exclusions.

Keywords