Injury Epidemiology (Oct 2024)

Using data fusion with multiple imputation to correct for misclassification in self-reported exposure: a case-control study of cannabis use and homicide victimization

  • Seonghun Lee,
  • Guohua Li,
  • Stanford Chihuri,
  • Yuanzhi Yu,
  • Qixuan Chen

DOI
https://doi.org/10.1186/s40621-024-00545-x
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Background Cannabis use has been causally linked to violent behaviors in experimental and case studies, but its association with homicide victimization has not been rigorously assessed through epidemiologic research. Methods We performed a case-control analysis using two national data systems. Cases were homicide victims from the National Violent Death Reporting System (NVDRS), and controls were participants from the National Survey on Drug Use and Health (NSDUH). While the NVDRS contained toxicological testing data on cannabis use, the NSDUH only collected self-reported data, and thus the potential misclassification in the self-reported data needed to be corrected. We took a data fusion approach by concatenating the NSDUH with a third data system, the National Roadside Survey of Alcohol and Drug Use by Drivers (NRS), which collected toxicological testing and self-reported data on cannabis use for drivers. The data fusion approach provided multiple imputations (MIs) of toxicological testing results on cannabis use for the participants in the NSDUH, which were then used in the case-control analysis. Bootstrap was used to obtain valid statistical inference. Results The analyses revealed that cannabis use was associated with 3.55-fold (95% CI: 2.75–4.35) increased odds of homicide victimization. Alcohol use, being Black, male, aged 21–34 years, and having less than a high school education were also significantly associated with increased odds of homicide victimization. Conclusions Cannabis use is a major risk factor for homicide victimization. The data fusion with MI method is useful in integrative data analysis for harmonizing measures between different data sources.

Keywords