Arthritis Research & Therapy (May 2017)

Models solely using claims-based administrative data are poor predictors of rheumatoid arthritis disease activity

  • Brian C. Sauer,
  • Chia-Chen Teng,
  • Neil A. Accortt,
  • Zachary Burningham,
  • David Collier,
  • Mona Trivedi,
  • Grant W. Cannon

DOI
https://doi.org/10.1186/s13075-017-1294-0
Journal volume & issue
Vol. 19, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Background This study developed and validated a claims-based statistical model to predict rheumatoid arthritis (RA) disease activity, measured by the 28-joint count Disease Activity Score (DAS28). Method Veterans enrolled in the Veterans Affairs Rheumatoid Arthritis (VARA) registry with one year of data available for review before being assessed by the DAS28, were studied. Three models were developed based on initial selection of variables for analyses. The first model was based on clinically defined variables, the second leveraged grouping systems for high dimensional data and the third approach prescreened all possible predictors based on a significant bivariate association with the DAS28. The least absolute shrinkage and selection operator (LASSO) with fivefold cross-validation was used for variable selection and model development. Models were also compared for patients with 5.1) activity. Results There were 1582 Veterans who fulfilled inclusion criteria. The adjusted r-square for the three models tested ranged from 0.221 to 0.223. The models performed slightly better for patients with <5 years of RA disease than for patients with ≥5 years of RA disease. Correct classification of DAS28 categories ranged from 39.9% to 40.5% for the three models. Conclusion The multiple models tested showed weak overall predictive accuracy in measuring DAS28. The models performed poorly at predicting patients with remission and high disease activity. Future research should investigate components of disease activity measures directly from medical records and incorporate additional laboratory and other clinical data.

Keywords