Michael
Thompson
School of Science, Birkbeck University of London, Malet Street, London, WC1E 7HX, UK. E-mail: m.thompson@bbk.ac.uk
First published on 17th May 2010
In proficiency tests the participants' results are usually converted into scores. In some schemes the participants are required to report the sum of the concentrations of a number of analytes and this total also is converted into a score. In such instances the scoring procedure for the total should be mathematically consistent with that for the separate analytes. When these analytes are determined from a single test portion, however, the errors in the results from a participant are likely to be correlated because some stages of the analysis are common to all of the analytes. For a consistent outcome, the scoring method must take account of such correlation.
In a round of a proficiency test, most scheme providers convert a participant's analytical result x into a z-score given by z = (x − xA)/σp, where the assigned value xA is the provider's best estimate of the true value. The assigned value is usually taken as the consensus of all of the participants' results, often a robust mean. The standard deviation for proficiency σp in a round—informally called the ‘target value’—is preferably a prescribed uncertainty that characterises fitness for purpose in the appropriate analytical sector.1 It is important to emphasise, for the purposes of the following discussion, that such a target value is not designed to characterise the participants' results but to specify in advance the required performance—it is independent of the participants' results. This σp value is predetermined by the scheme provider and available to the participants at the time of analysis.
In some instances, however, legislation or custom requires laboratories to report the total concentration of a group of analytes, “total aflatoxins” for example, as well as the individual concentrations. For the purposes of proficiency testing the total concentration should then be scored as well. However, the target value for the total concentration should be mathematically consistent with the target values used for the individual analytes. Were that not the case, a participant could conceivably receive ‘satisfactory’ z-scores for all of the individual analytes but an ‘unsatisfactory’ z-score for the total concentration. (‘Satisfactory’ and ‘unsatisfactory’ are used non-pejoratively as informal labels for the conditions |z| < 2 and |z| > 3 respectively.)
A consistent target value for the total σp(T) must take account of the covariances2 covp(i,j) between the analytesi and j, so that
![]() | (1) |
Here, however, the covariances (like the individual σp(i)values) have a special prescriptive meaning so cannot be estimated directly from the participants' results. They have to be evaluated from the prescribed individual target values and the observed correlation coefficients r(i, j) between analytesi and j over all participants, via the identity covp(i, j) = r(i, j)σp(i)σp(j). Then the consistent target value for the total concentration is
![]() | (2) |
In the absence of a knowledge of R, a possible ‘cautious’ approach would be to assume that the analytes are perfectly correlated, that is, with all r(i, j) = 1. Eqn (1) and (2) then reduce to σ′p(T) = ∑iiiσσpp(ii). A contrasting expedient, here called ‘naive’, would be to ignore any covariance by assuming that all of the correlation coefficients are zero-valued. The target value for the total concentration is then given by . The discrepancy between these extremes can be substantial, approaching a factor of
when the σp values are comparable in magnitude. A poor choice between these extremes could therefore have a potentially serious effect on z-scores for the total concentration.
B1 | B2 | G1 | G2 | |
B1 | 1.00 | 0.67 | 0.38 | 0.30 |
B2 | 0.67 | 1.00 | 0.45 | 0.76 |
G1 | 0.38 | 0.45 | 1.00 | 0.18 |
G2 | 0.30 | 0.76 | 0.18 | 1.00 |
The three alternative target values for the total aflatoxins were calculated as:
• consistent σp(T) = 2.09;
• ‘cautious’ σ′p(T) = 2.68;
• ‘naive’ σ″p(T) = 1.42.
The corresponding z-scores for total aflatoxins are shown in Fig. 1. There are substantial differences among the three interpretations. The ‘naive’ method for calculating σp(T) identified 12 laboratories with |z| > 3 (i.e., requiring investigative and/or remedial action on the part of the participant) but the ‘cautious’ method identified only four. The consistent method necessarily gave an outcome between these extremes. (Note: in FAPAS the value of σp(T) used falls close to ‘cautious’ and thus avoids scores of unduly large magnitude.)
![]() | ||
Fig. 1 z-Scores for total aflatoxins calculated by using different estimates of the target value. Each point shows the result from a single participant. Two extreme outliers are not shown. |
In these contexts it is important also to be aware that the correlations among a particular suite of analytes may vary considerably from round to round. This is largely determined by the concentrations of the analytes: near detection limits substantial random contributions will almost eliminate any correlations that might predominate at higher concentrations. Because of this circumstance the use of a single ‘global’ estimate of R for successive rounds would probably be inappropriate.
This journal is © The Royal Society of Chemistry 2010 |