PLoS ONE (Jan 2023)

Capturing drug use patterns at a glance: An n-ary word sufficient statistic for repeated univariate categorical values.

  • Gabriel J Odom,
  • Laura Brandt,
  • Clinton Castro,
  • Sean X Luo,
  • Daniel J Feaster,
  • Raymond R Balise,
  • CTN-0094 Team

DOI
https://doi.org/10.1371/journal.pone.0291248
Journal volume & issue
Vol. 18, no. 9
p. e0291248

Abstract

Read online

IntroductionThe efficacy of treatments for substance use disorders (SUD) is tested in clinical trials in which participants typically provide urine samples to detect whether the person has used certain substances via urine drug screenings (UDS). UDS data form the foundation of treatment outcome assessment in the vast majority of SUD clinical trials. However, existing methods to calculate treatment outcomes are not standardized, impeding comparability between studies and prohibiting reproducibility of results.MethodsWe extended the concept of a binary UDS variable to multiple categories: "+" [positive for substance(s) of interest], "-" [negative for substance(s)], "o" [patient failed to provide sample], "*" [inconclusive or mixed results], and "_" [no specimens required per study design]. This construct can be used to create a standardized and sufficient representation of UDS datastreams and sufficiently collapses longitudinal records into a single, compact "word", which preserves all information contained in the original data.ResultsWe developed the R software package CTNote (available on CRAN) as a tool to enable computers to parse these "words". The software package contains five groups of routines: detect a substance use pattern, account for a specific trial protocol, handle missing UDS data, measure the longest period of consecutive behavior, and count substance use events. Executing permutations of these routines result in algorithms which can define SUD clinical trial endpoints. As examples, we provide three algorithms to define primary endpoints from seminal SUD clinical trials.DiscussionRepresenting substance use patterns as a "word" allows researchers and clinicians an "at a glance" assessment of participants' responses to treatment over time. Further, machine readable use pattern summaries are a standardized method to calculate treatment outcomes and are therefore useful to all future SUD clinical trials. We discuss some caveats when applying this data summarization technique in practice and areas of future study.