MethodsX (Jan 2021)

A Benford's law based method for fraud detection using R Library

  • Caio da Silva Azevedo,
  • Rodrigo Franco Gonçalves,
  • Vagner Luiz Gava,
  • Mauro de Mesquita Spinola

Journal volume & issue
Vol. 8
p. 101575

Abstract

Read online

ABSTRACT: Benford Law (BL) states that the occurrence of significant digits in many natural and human phenomena data sets are not uniformly scattered, as one could naively expect, but follow a logarithmic-type distribution. Here, we present a method that consists of the use of BL analysis over first and first-two digits, three statistical conformity tests – Z-statistics, Mean Absolute Deviation (MAD) and Chi-square (χ2) as well as the summation test which looks for excessively large numbers, having fraud detection as one of its application. We developed the method for fraud detection in the case of the Brazilian Bolsa Familia welfare program. In this case, we submitted four periods of Brazilian welfare program payments to the method with a dataset of 13,442,529 records. We provide a practical implementation of the method based on open-source R library released on a public repository. Furthermore, code implementation of the algorithm as well as datasets are freely available. Advantages of the algorithm are listed below:• The method was developed based on open source libraries• The technique is simple, rapid and ease of use• Easily applicable to other social welfare program auditing

Keywords