Issue 9, 2005

Benford's Law and the screening of analytical data: the case of pollutant concentrations in ambient air

Abstract

The need to ensure the robustness of very large data sets produced by analytical measurement processes is increasing. This requires data screening techniques that can identify formatting or transcription errors in large data sets, that have undergone multiple data-handling and manipulation procedures. The empirical observation that the digits 1 to 9 are not equally likely to appear as the initial digit in multi-digit numbers is known as Benford's Law, and may provide a solution to this requirement. Several sets of data pertaining to the measured concentrations of pollutants in ambient air in the UK in 2004 have been analysed for their initial digit frequencies in order to assess the potential for the use of Benford's Law as a data screening, and authenticity-checking, tool for these types of analytical data sets. Benford's Law has been shown to be a robust top-level data screening tool provided that the numerical range of the data set being considered is four orders of magnitude or greater. It has been shown that small changes in the deviation of a data set from Benford's Law may indicate the introduction of errors during data processing. In this way, Benford's Law provides a sensitive technique for identifying data mishandling in large data sets.

Graphical abstract: Benford's Law and the screening of analytical data: the case of pollutant concentrations in ambient air

Article information

Article type
Paper
Submitted
30 Mar 2005
Accepted
29 Jun 2005
First published
26 Jul 2005

Analyst, 2005,130, 1280-1285

Benford's Law and the screening of analytical data: the case of pollutant concentrations in ambient air

R. J. C. Brown, Analyst, 2005, 130, 1280 DOI: 10.1039/B504462F

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements