Emerald Open Research (Nov 2021)

A data transformation process for using Benford’s Law with bounded data [version 1; peer review: 2 approved]

  • Daniel McCarville

Journal volume & issue
Vol. 3

Abstract

Read online

Benford’s Law is an empirical observation about the frequency of digits in a variety of naturally occurring data sets. Auditors and forensic scientists have used Benford’s Law to detect erroneous data in accounting and legal usage. One well-known limitation is that Benford’s Law fails when data have clear minimum and maximum values. Many kinds of education data, including assessment scores, typically include hard maximums and therefore do not meet the parametric assumptions of Benford’s Law. This paper implements a transformation procedure which allows for assessment data to be compared to Benford’s Law. As a case study, a data quality assessment of oral language scores from the Early Childhood Longitudinal Study, Kindergarten (ECLS-K) study is used and higher risk data segments detected. The same method could be used to evaluate other concerns, such as test fraud, or other bounded datasets.

Keywords