Clinical Chemistry AACC Online Job Center
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Clinical Chemistry 50: 891-900, 2004. First published March 9, 2004; 10.1373/clinchem.2003.027953
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Data Supplement
Right arrow All Versions of this Article:
clinchem.2003.027953v1
50/5/891    most recent
Right arrow Submit an electronic Letter to
the Editor about this paper
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (8)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lahti, A.
Right arrow Articles by Solberg, H. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lahti, A.
Right arrow Articles by Solberg, H. E.
Related Collections
Right arrow Laboratory Management
Right arrow General Clinical Chemistry
Right arrow Evidence Based Laboratory Medicine and Test Utilization
(Clinical Chemistry. 2004;50:891-900.)
© 2004 American Association for Clinical Chemistry, Inc.


Laboratory Management

Partitioning of Nongaussian-Distributed Biochemical Reference Data into Subgroups

Ari Lahti1,a, Per Hyltoft Petersen2,3, James C. Boyd4, Pål Rustad5, Petter Laake6 and Helge Erik Solberg1

1 Department of Clinical Chemistry, Rikshospitalet University Hospital of Oslo, Oslo, Norway. 2 Department of Clinical Biochemistry, Odense University Hospital, Odense, Denmark. 3 NOKLUS, Norwegian Centre for External Quality Assurance of Primary Care Laboratories, Division for General Practice, University of Bergen, Bergen, Norway. 4 Department of Pathology, University of Virginia Health System, Charlottesville, VA. 5 Fürst Medical Laboratory, Oslo, Norway. 6 Section of Medical Statistics, University of Oslo, Oslo, Norway.

aAddress correspondence to this author at: Department of Clinical Chemistry, Rikshospitalet University Hospital of Oslo, N-0027 Oslo, Norway. Fax 47-2307-1080; e-mail ari.lahti{at}rikshospitalet.no.


   Abstract
Top
Abstract
Introduction
Theoretical Considerations
Discussion
Conclusions
References
 
Background: The aim of this study was to develop new methods for partitioning biochemical reference data, covering in particular nongaussian distributions.

Methods: We recently proposed partitioning criteria for gaussian distributions. These criteria relate to proportions of the subgroups outside each of the reference limits of the combined distribution (proportion criteria) and to distances between the subgroup distributions as correlates of these proportions (distance criteria). However, distance criteria do not seem to be ideal for nongaussian distributions because a generally valid relationship between proportions and distances cannot be established for these.

Results: Proportion criteria appear preferable to distance criteria for two additional reasons: (a) The prevalences of the subgroup populations may have a considerable effect on stratification, but these are hard to account for by using distance criteria. Two methods to handle prevalences are described, the root method and the multiplication method. (b) Tied reference values, another complication of the partitioning problem, could also be hard to take care of using distance criteria. Some solutions to the problems caused by tied reference values are suggested.

Conclusions: Partitioning of biochemical reference data should preferably be based on proportion criteria; this is particularly true for nongaussian distributions. Both of the described complications of the partitioning problem, the prevalences of the subgroups and tied reference values, are hard to deal with using distance criteria, but the proposed methods make it possible to account for them when proportion criteria are applied.


   Introduction
Top
Abstract
Introduction
Theoretical Considerations
Discussion
Conclusions
References
 
We have recently proposed a new method for partitioning biochemical reference data into subgroups (1)(2). This method was developed with gaussian distributions as a starting point because the statistical properties of those distributions are well known. Hence, a partitioning method based on gaussian distributions can be made mathematically accurate. Although the distributions of biochemical reference data are sometimes gaussian or can be converted to gaussian distributions by use of appropriate mathematical transformations, this is often not the case. Generally valid partitioning criteria that could be used for all types of distributions are therefore required, but such criteria do not seem to exist in the literature.

The partitioning criteria presented by Harris and Boyd (3)(4) were developed using gaussian distributions, and although practitioners seem frequently to apply them to nongaussian distributions as well, their applicability to these is questionable. In contrast to what many users of these criteria seem to believe, the basic idea of the Harris–Boyd method is not to perform statistical significance tests for distances between means, although a modified normal deviate test is part of this method. Rather, Harris and Boyd aimed at correlating these distances to proportions of the subgroup distributions outside the reference limits of the combined distribution. This correlation, established for gaussian distributions by Harris and Boyd, is not automatically valid for nongaussian distributions, however, because the proportions obtained for nongaussian distributions at particular distances between means can be quite different from those obtained for gaussian distributions at the same distances. The conclusions on partitioning made by applying the same criteria to nongaussian distributions can therefore be erroneous.

Because nongaussian distributions do not have a standard shape, generally valid mathematical expressions to correlate distances between them to proportions of the subgroup distributions outside the common reference limits cannot be derived. Hence, we focus here on measuring the proportions directly. We present numerical methods to solve the partitioning problem that are applicable to any kind of two distributions and that also account for the prevalences of the subgroups. We also discuss the importance of multiple or "tied" reference values and suggest solutions for this complication of the partitioning problem.


   Theoretical Considerations
Top
Abstract
Introduction
Theoretical Considerations
Discussion
Conclusions
References
 
partitioning criteria for nongaussian distributions
For the two nongaussian subgroup distributions a and b in Fig. 1 , their lower reference limits (thin vertical bars in Fig. 1 ) and that of the combined distribution (thick vertical bar) lie at the 2.5 percentile of the respective distributions, assuming the conventional definition of a reference interval as covering the central 95% of a reference distribution (5). Both the partitioning method suggested by Harris and Boyd (3)(4) and that suggested by us (1)(2) are essentially based on observing the proportions of the subgroup distributions outside the common reference limits (pa and pb in Fig. 1 ). These proportions should be kept reasonably close to 2.5% to avoid major changes in the sensitivity and specificity of the laboratory test in the subgroups from those values obtained by use of subgroup-specific reference limits. In the case of gaussian distributions it is possible to mathematically determine the common reference limits (although the combined distribution of two gaussian distributions is nongaussian), enabling explicit calculation of the proportions (1). Once such a method was established, it also became possible to correlate the proportions to the distance between the reference limits of the subgroup distributions and to define critical values for this distance as correlates of the critical values for the proportions, to be used as partitioning criteria for gaussian subgroup distributions (1)(2).



View larger version (27K):
[in this window]
[in a new window]
 
Figure 1. Two nongaussian subgroup distributions.

The lower ends of two nongaussian subgroup distributions a and b are shown. The lower reference limits, or the 2.5 percentiles of the respective distributions, are indicated by thin vertical bars, and the lower reference limit of the combined distribution (not shown) is depicted by a thick vertical bar. The ratio of prevalences (Fr) is assumed to be equal to 1.0, and each of the reference values in both subgroups is assumed to be unique. Under these conditions, the shaded areas on both sides of the common reference limit are kept equal to each other as the distance D between the reference limits of the subgroup distributions is changed because only in that way will the proportion of the combined distribution outside the common reference limit remain constant, or 2.5%. The localization of the common reference limit is thereby unique, and it corresponds to unique values of the proportions pa and pb (percentages of the respective distributions outside the common reference limit) for each value of the distance D.

As was stated above, in general situations similar mathematical expressions, correlating distances and proportions to each other, cannot be derived for nongaussian distributions. Hence, their partitioning must be based on proportion criteria because the distance criteria are just more or less accurate correlates of these, whereas control of the proportions is the primary concern. Therefore, direct estimation of the proportions appears necessary when partitioning nongaussian distributions.

Because proportion criteria have previously not been expressed or used explicitly, we had to prepare a suggestion for such criteria (1). A natural starting point for our suggestion was that Harris and Boyd (3) considered proportions exceeding 4.0% to be high enough to imply partitioning. This value is not presented as an explicit partitioning criterion in their method, however, but because they used it to derive their criteria, that particular proportion has probably been widely used by clinical chemists as an implicit partitioning criterion, considering the fact that the Harris–Boyd method lies behind the NCCLS recommendations on partitioning (6). Although the correlation between distances and proportions seems to be rather poor in this method (1), it is hoped that the value of 4.0% does not lie far from an ideal proportion criterion for partitioning. Because the range of possible values for this criterion is limited by 5.0% from above in a standard situation (1)(2), every 0.1% matters when such an ideal proportion is specified.

In a subsequent publication, Harris and Boyd (4) were of the opinion that their original distance criterion might be too permissive of partitioning. This suggests that if the ideal proportion criterion deviates from the value of 4.0%, that value should be increased rather than decreased from the originally proposed one. We observed (1) that for gaussian distributions, 4.1% corresponded to an approximate critical distance of 0.75 s (standard deviations) for partitioning, an easy-to-remember distance criterion together with our suggestion of 0.25 s as a criterion for combining of the subgroups. These two arguments made us choose the value of 4.1% as our primary proportion criterion. This criterion is, at present, just a suggestion that needs to be tested extensively in practical work to evaluate its true properties as a partitioning criterion.

Instead of setting only one critical proportion, categorically implying partitioning, we proposed (1)(2) the following three-stage classification: In addition to the value of 4.1%, favoring calculation of subgroup-specific reference intervals if it was exceeded, we suggested that the value of 3.2% could be appropriate as a critical proportion for combining the subgroups, justifying use of common reference limits for both of them if it was not exceeded. These values concern the larger one of the subgroup proportions outside a common reference limit (pa in Fig. 1Up ). Because the sum of the larger and the smaller proportion at each end of the distributions is equal to 5.0% in a standard situation (1)(2), these suggestions simultaneously implied that the critical partitioning and combining proportions for the other one of the subgroup distributions (pb in Fig. 1Up ) should be 0.9% and 1.8%, respectively.

We recommended partitioning if any of the four proportions (two at the lower and two at the upper end of the distributions) outside the common reference limits was >=4.1% or <=0.9% and combining if all of these proportions lay between 1.8% and 3.2%. Other types of outcomes were considered as marginal, or as nonconclusive if taken alone. We further recommended that other than purely statistical considerations, e.g., clinical judgment and data from the literature, should be involved in the decision-making on partitioning in these cases. The three-stage classification makes application of the method more flexible than if only two classes of proportions were in use, those implying partitioning and those implying combining of the subgroups. This flexibility is, in our opinion, necessary as long as one has only limited experience on the application of the proposed criteria to real-life partitioning problems.

accounting for the prevalences of the subgroups
Importance of the prevalences for the partitioning problem.
To apply the proportion criteria to nongaussian distributions, one has first to identify their common reference limits. This is ordinarily done by pooling together the two subgroups and determining the common reference limits as the nonparametric 2.5 and 97.5 percentiles of these combined data.

However, to obtain correct conclusions on partitioning, the ratio between the numbers of reference values in the subgroups should reflect the ratio between the prevalences of the subgroup populations in the reference population (2). This is often not the case. In real-life reference interval studies, the ratios between the numbers of reference values in the subgroups can in fact be almost anything compared with the respective ratios between prevalences. Typically, one aims at collecting equal numbers of reference values in each subgroup (e.g., age group), ignoring and neglecting the underlying ratios of prevalences.

Samples used in reference interval studies should be representative of the subgroup populations from which they are extracted in every respect, including the proportional sizes, whenever decisions involving both subgroups, such as those on partitioning, need to be made. To illustrate how easy it is to draw false conclusions, we provide a concrete example from the ongoing Nordic Reference Interval Project (7)(8). This project is a joint reference interval project of the five Nordic countries (Denmark, Finland, Iceland, Norway, and Sweden) that aims at establishing common reference intervals for 25 selected clinical laboratory tests in these countries. The numbers of participating laboratories and recruited reference persons varied among the countries and did not reflect their population sizes. As an example, more data were obtained from Norway, which has a population of 4.5 million, than from Sweden, which has a population of 8.9 million. In making comparisons between countries and examining, for example, whether Norway and Sweden possibly need country-specific reference intervals for a specific biochemical marker, it is important to account for their population sizes in the calculations instead of simply putting together the collected data and making the conclusions on partitioning from these.

The distributions obtained from Norway and Sweden for serum calcium are shown schematically in Fig. 2 . The empirical distributions were in reality not strictly gaussian, but we plotted them as gaussian to obtain smooth and illustrative curves. The two lowermost curves (SwedenProject and Norway) depict the distribution for Sweden as scaled by 0.65 with respect to that for Norway, reflecting the ratio between the numbers of reference values obtained for serum calcium from these two countries (532 and 819, respectively). The lower common reference limit for these distributions would lead to the conclusion that Norway and Sweden should use country-specific reference intervals for serum calcium because a proportion of 4.2% was obtained for Sweden, whereas the critical proportion for partitioning was 4.1%. The two uppermost curves (SwedenPopulation and Norway) depict the distribution for Sweden as scaled by 1.98 with respect to that for Norway, reflecting the ratio between the population sizes of these countries. Using these distributions in the calculations, we obtained a proportion of only 3.5% for Sweden. The conclusion in this case would be that Norway and Sweden should be able to use the same reference interval for serum calcium because there are few nonstatistical reasons that would require the opposite conclusion to be made (the value of 3.5% is "marginal" as interpreted in terms of our three-stage classification of the proportions, and our recommendation was that one should at such marginal proportions consider nonstatistical arguments as a support for decision-making).



View larger version (15K):
[in this window]
[in a new window]
 
Figure 2. Example illustrating the effect of prevalences on partitioning.

Reference distributions obtained from Norway and Sweden for serum calcium in the Nordic Reference Interval Project (7)(8) are shown schematically. The two lowermost curves (SwedenProject and Norway) depict the distribution for Sweden as scaled by 0.65 with respect to that for Norway, reflecting the quotient between the numbers of reference values obtained for serum calcium from these two countries in the project, whereas the two uppermost curves (SwedenPopulation and Norway) depict the distribution for Sweden as scaled by 1.98 with respect to that for Norway, reflecting the quotient between the population sizes of these countries. The lower reference limit for each country is indicated by a thin vertical bar (observe that the reference limit for the distribution in Sweden is the same for both ways to scale this distribution). The dashed thick bar is the lower reference limit of the combined distribution (not shown) obtained by use of the two lowermost distributions to calculate it, and the solid thick bar is this limit obtained by use of the two uppermost distributions to calculate it. These two common reference limits lead to opposite conclusions on partitioning, the dashed bar suggesting that Sweden and Norway should use country-specific reference intervals for serum calcium because the proportion of the distribution of Sweden outside the common reference limit (pSweden,Project) would be 4.2%, and the solid bar suggesting that these countries could use the same reference interval for serum calcium because this proportion (pSweden,Population) would be 3.5% in that case.

In this example we found a substantially smaller proportion (a decrease of 0.7% is substantial indeed on a scale from 2.5% to 5.0%) for Sweden if the population sizes, i.e., the "prevalences" of Swedes and Norwegians, are taken into account, because the weight of the distribution of Sweden compared with that of Norway is much larger than when the prevalences are not considered. The larger weight of the distribution of Sweden displaces the common reference limit toward the reference limit of that distribution, leading to the decrease in the proportion. It should be noted that when the weight of a subgroup is changed, only the common reference limits are changed; the reference limits of the subgroups remain unchanged.

Numerical methods to calculate the proportions of the subgroup distributions outside the common reference limits when the prevalences are accounted for.
From the example presented above, it is clear that there is a danger of drawing incorrect conclusions on partitioning if no adjustments are made to account for inconsistency between the ratio of the numbers of reference values in the subgroups (Nr) and the ratio between their prevalences (Fr). The simplest way to obtain common reference limits that are correctly adjusted to the prevalences would be to make these two ratios equal by eliminating an appropriate number of reference values from one of the subgroups. A solution that involves such a waste of original data would obviously not be ideal, however. In the following paragraphs, two numerical methods are proposed that make use of all the data available. One of these, the "root method", requires no manipulation of the original data, whereas the other one, the "multiplication method", is based on multiplying the subgroup distributions by appropriate factors.

Root method. Assume that the reference values are unique and consider first the case with Fr being equal to 1.0. The sum of the proportions pa and pb (Fig. 1Up ) will then be equal to 5.0%, or 0.05 (2). We denote as xpii a reference value of distribution i corresponding to proportion pi. Reference values, their ranks in the sorted subgroups, and the proportions calculated using these ranks can be treated as continuous real number variables if linear interpolation is applied between the sorted data points. The traditional rank-based method to calculate proportions, recommended by both the IFCC (5) and the NCCLS(6), is as follows:

(1)
where p is a desired proportion, r is the rank corresponding to that proportion, and n is the number of reference values in the distribution. There are other methods in the literature (9), both rank-based and others, for calculation of proportions.

To determine the proportions of two subgroups outside a common reference limit, one should scan the reference values of the sorted combined data vector, interpolating between data points to achieve a desired precision, until a reference value is found that gives values for the proportions such that they sum to 0.05:

(2)
Rearrangement of Eq. 2Up gives:

(3)
Because both the proportions and the common reference limit can be determined by solving Eq. 3Up , we call this the root method. Observe that apart from the solution itself, in the general cases also covering nongaussian distributions, no predefined relationship exists between pa and pb at various reference values. To solve Eq. 3Up , one must convert each reference value x to a rank in both subgroup distributions independently by observing its position in the sorted subgroup data vectors and calculate the respective proportions using expressions like that given above (Eq. 1Up ). Instead of Eq. 3Up , which is seemingly a function of only one variable (x), a more realistic idea of the problem is therefore obtained by considering the following two-variable function:

(4)
The expression {phi}1(pa, pb) in Eq. 4Up can be interpreted as the difference between the actual value for the proportion pb (i.e., this value at the particular reference value—original or interpolated—that is being examined) and the value of pb at the common reference limit (i.e., this value when the reference value being examined is the solution, or the common reference limit). Because this expression will be zero only at the common reference limit, the proportions and the common reference limit can be found by identifying such a reference value that makes {phi}1(pa, pb) equal to zero. As a function of two independent variables, {phi}1(pa, pb) describes a plane and has a zero line rather than a single zero point as its solution, however.

This situation is illustrated in Fig. 3 . The intersection line between the plane {phi}1(pa, pb) and the zero plane {phi}0(pa, pb) is this zero line that is not the solution for a specific pair of subgroup distributions but rather gives the solutions for all such pairs. To find the unique solution for a specific pair on the zero line, one should consider the individual path corresponding to it on the plane {phi}1(pa, pb); the black curve on this plane in Fig. 3 shows an example of such paths. All of these paths start at pa = 0.025 and end at pb = 0.025, whereas the value of the other proportion at both ends can vary between cases, and depict an irregular curve between these endpoints depending on the shapes of the subgroup distributions. Each path describes a strictly increasing function of both pa and pb because the gradient of the plane {phi}1(pa, pb) is (1, 1), and it can also be shown that between the endpoints there exists a zero point (the black circle at the intersection of the path and the zero plane in Fig. 3 ). Conclusively, there is a unique solution for the equation {phi}1(pa, pb) = 0 between the reference limits of the subgroup distributions. A slightly more formal presentation of these considerations is given in the Appendix, which is published in an electronic supplement to this report (http://www.clinchem.org/content/vol50/issue5/). One important condition for this conclusion is, however, that the reference values are unique. This condition is discussed below.



View larger version (28K):
[in this window]
[in a new window]
 
Figure 3. Determination of the proportions pa and pb in the case of unique reference values.

Difference {phi}1(pa, pb) between the proportion of distribution b outside an actual reference value and this proportion if the actual reference value were the common reference limit, plotted as a function of the proportions pa and pb (gridded plane). The intersecting plane is the zero plane {phi}0(pa, pb), and the intersection line between the two planes gives the values of the proportions at which the value of {phi}1(pa, pb) is equal to zero. Each pair of subgroup distributions describes an individual path on the gridded plane that cuts the zero line at one point, indicated by •.

Hence, to find the unique solution for {phi}1(pa, pb) = 0, one can restrict the search to the range of reference values lying between the reference limits of the subgroup distributions. Because this range is usually not very large, sophisticated root-finding methods are not necessary. For example, one could scan the reference values of distribution a until two consecutive reference values are found that give opposite signs for {phi}1(pa, pb) (Fig. 4 ). The interspace between such reference values could subsequently be scanned to a desired precision by interpolation.



View larger version (13K):
[in this window]
[in a new window]
 
Figure 4. Determination of a common reference limit by the root method.

Lower ends of two subgroup distributions a and b are shown schematically as axes having a common scale and representing the values for the laboratory test being examined. A few reference values, assumed to be unique, from each distribution are indicated as ticks, some of them denoted by a letter, with the rank a parenthesized index. The lower reference limits of the subgroup distributions are shown as thin vertical bars, and the lower common reference limit is shown as a thick vertical bar between the distributions. The arrows indicate the localizations of some of the reference values of distribution a, lying between the reference limits of the subgroup distributions, in distribution b. Because the reference values of distribution a do not necessarily coincide with those of distribution b, noninteger ranks are calculated for the reference values of distribution a in distribution b, when needed. The method, described in the text, to determine the common reference limit is based on identification of the unique zero point for the difference between the actual value of the proportion pb and the value of this proportion obtained at the common reference limit. An interval between two consecutive reference values of distribution a [a(–) and a(+)] that includes the common reference limit is first determined, and the common reference limit is identified in this interval by use of a numerical brute force method.

The method just described is readily extendable to the more general cases with unequal prevalences. In such cases the sum of the proportions pa and pb will deviate from 5.0%, depending on the value of Fr (2). If Fr is calculated separately for the lower and the upper end of the subgroup distributions, the proportions and the common reference limit for each end can be determined by solving the following equation in exactly the same way that Eq. 4Up was solved above:

(5)
This equation was derived in Appendix 2 of our previous report (2). Note that Eq. 5Up is reduced to Eq. 4Up if Fr = 1.0.

Multiplication method. A more straightforward method to take account of the prevalences would be multiplication of the distributions by appropriate factors to make Nr and Fr equal to each other. To express the relationship between these two ratios, we use the following parameter:

(6)
where the indices i and j refer to either of the two subgroup distributions. This parameter might be called "effective ratio of prevalences" because it expresses the net effect of prevalences that needs to be accounted for when the actual numbers of reference values are taken into consideration. Rearrangement of the first three terms of Eq. 6Up gives:

(7)
This rearrangement suggests that to make Nr equal to Fr, one of the distributions should be multiplied by Freff. However, to avoid noninteger numbers of reference values, the multiplied number of the number of copies for each reference value should be an integer, and because Freff most often does not fulfill that condition, both distributions will usually need to be multiplied by some factors.

To illustrate this, assume that Freff is equal to 1.27. Unity would undoubtedly be too rough as an approximation for 1.27. Rounding to one decimal, 1.3 is obtained, but this value is useless as a multiplier if we prefer to operate with integer numbers of copies for each reference value. To obtain such numbers in the multiplied distribution i, 1.3 could be multiplied further by 10 to make it 13. If the factor of distribution i is multiplied by such an additional factor, however, distribution j must also be multiplied by 10 to keep the (adjusted) ratio between the numbers ni and nj unchanged. Accordingly, distribution i could be multiplied by 13 and distribution j by 10 in this example. If the sizes of the original subgroups were of the order, e.g., 1000, virtual data vectors having a size of the order 10 000 would be obtained. Rounding to one decimal could also approximate too much, and even larger virtual data vectors may need to be used in the calculations. Hence, implementations of the multiplication method that would treat the data vectors as concrete objects, e.g., on a spreadsheet, are not feasible. Once the common reference limits have been established from the (virtual) multiplied data vectors, prevalence-adjusted values for the proportions can be readily obtained by applying those limits to the original subgroups.

coping with tied reference values
It was shown above that the proportions pa and pb obtained at the common reference limit are unique, but this conclusion was said to be valid only under the assumption that the reference values also are unique. We examine next what happens if this assumption is not true. Consider once more the path depicted on the plane {phi}1(pa, pb) in Fig. 3Up . The difference given in Eq. 4Up , described by this path, is strictly increasing with respect to both pa and pb because if the reference values are unique, their ranks in both subgroups increase constantly as one scans the range of the sorted reference values lying between the reference limits of the subgroups; the proportions derived from these ranks by use of expressions such at that given in Eq. 1Up also increase constantly. Consider what would happen if there were several copies of a reference value, or tied reference values, in one of the subgroups, such as subgroup b. While scanning through such an array of tied reference values, the rank and the proportion would be increased for subgroup b but would remain unchanged for subgroup a because there were no intervening reference values in the data vector of that subgroup. This is illustrated by line segment 1, running parallel to the axis of proportion pb, of the path plotted in Fig. 5A .



View larger version (20K):
[in this window]
[in a new window]
 
Figure 5. Determination of the proportions pa and pb in the presence of tied reference values.

(A), the plane {phi}1(pa, pb) from Eq. 4Up is plotted as in Fig. 3Up , but the zero plane has been removed to show the details of one path, different from that in Fig. 3Up , describing the behavior of the proportions pa and pb for a particular pair of subgroup distributions, both having tied reference values. The intersection line between the two planes is indicated by a dashed line. Line segment 1, running parallel to the axis for proportion pb, depicts the path as obtained from an array of tied reference values in subgroup b because the rank, and thus the proportion, is increased only within this subgroup as long as one is scanning that array. A similar array in subgroup a leads to line segment 2. In this case the proportion pb remains unchanged, whereas the proportion pa is increased. Boxes 3 and 4 illustrate the path as obtained when both subgroups have copies of a particular reference value. Box 4 lies at the area where the path crosses the zero line, making unique values for the proportions at the common reference limit hard to establish (compare with the situation in Fig. 3Up ). The conventional way to define ranks for tied observations is to set the average of their ranks as the rank for each of them. This would lead to the solution denoted by P1, lying at the midpoint of box 4. This solution has drawbacks, however, that are discussed in the text. The solution P2 divides the ranges of pa and pb that lie within box 4, using the same ratio as the common reference limit divides the array of tied reference values in the combined distribution (1:3 in this example; see panel B), and the solution P3 is the midpoint of the segment of the zero line that lies within box 4. (B), schematic, plotted using only approximately correct scaling, showing how the path in panel A could be generated. The plots of the distributions cover the range of reference values lying in the interval between the lower reference limits of distributions a and b. The numbers on the plot of the combined distribution correspond to the numbers on the path in A. The arrow shows the localization of the lower common reference limit, which divides the array of seven tied reference values roughly according to a ratio of 1:3. The same ratio could be used to define a solution for the proportions within box 4 (solution P2 in A).

Line segment 2 in Fig. 5AUp , running parallel to the axis of proportion pa, represents a similar array of tied reference values in subgroup a. If both subgroups have copies of a specific reference value, the path would have segments of line running parallel to both axes simultaneously (boxes 3 and 4 in Fig. 5AUp ). If such a box appears at the area where the path crosses the zero line, as is the case with box 4, the solution of the partitioning problem would not be unique, but rather a choice would have to be made between all of the proportions that lie within that box. This seems to introduce a considerable additional component of uncertainty to the values of pa and pb at the common reference limit and thereby to the conclusion on partitioning, as compared with the case of unique reference values illustrated in Fig. 3Up .

Another view of the situation described in Fig. 5AUp is shown in Fig. 5BUp , which shows the lower ends of distributions a and b and that of the combined distribution in detail, illustrating how the path in Fig. 5AUp could have been generated from real reference data. The numbers 1–4 in panels A and B correspond roughly to each other. If the common reference limit is localized within an array of tied reference values, as shown by the arrow in Fig. 5BUp , and both subgroups have copies of that reference value, there is apparently no way to obtain unique values for the proportions pa and pb.

The conventional method in nonparametric statistics to treat tied observations is to set the rank of each of them to the average of the ranks that these observations would have if they were not tied observations. This method, illustrated by the midpoint (P1) of box 4 (Fig. 5AUp ), is not ideal in our case. Consider an example with two similar subgroups, both having a large number of copies of the same reference value representing, e.g., 10% of the data, at the beginning of the distribution. If the average of the ranks of these tied values were in such a special situation used to define the proportions of the subgroups outside the lower common reference limit, one would obtain the proportion of 5% for each subgroup. These proportions would imply partitioning, but such a conclusion from the lower ends of those distributions would obviously be wrong if the lower ends are in reality identical. To handle situations like this, the proportions could be determined by dividing the arrays of tied reference values in the subgroups; use of the same ratio as the common reference limit divides that array in the combined distribution. Such a solution, depicted as point P2 in box 4 (Fig. 5AUp ), would in the example just considered lead to a proportion of 2.5% for both subgroups, which would correspond much better to our intuition of not partitioning them.

Both of the solutions P1 and P2 have the drawback of not lying on the segment cut off by box 4 from the zero line (Fig. 5AUp ), whereas an ideal solution should perhaps lie on this segment. The only meaningful solution on this segment seems to be its midpoint, indicated by point P3 in box 4. We suggest also considering this point when looking for a reasonable, unique solution to substitute the latitude of choice represented by the entire box.

calculation of confidence limits for the proportions
Nonparametric confidence intervals for percentiles could be estimated according to the rank-based method published by Reed et al. (10) and recommended by both the IFCC(5) and NCCLS (6). Confidence intervals for the proportions cannot, however, be calculated by applying this method to the subgroup distributions because such confidence intervals would reflect the local properties of these distributions rather than give the desired coverage probability, typically 0.90, for a range of these proportions as representing the spectrum of probable outcomes of the procedure used to calculate them. Instead, one should convert the confidence limits of the common reference limit to proportions within each subgroup distribution and consider such converted proportions as confidence limits for these proportions. Note carefully that such confidence intervals may have coverage probabilities in the subgroup distributions that differ from the nominal confidence level.

Linnet (11) has shown that the bootstrap resampling method usually gives more accurate percentiles than single nonparametric calculations, both for gaussian distributions and for a variety of skewed distributions that are representative for distributions of biochemical reference data. Linnet’s results further suggest that the bootstrap method is more effective than single calculations to estimate confidence intervals for percentiles of these distributions, provided that that the sample sizes are not very small (<100). If a distribution of bootstrap results for a percentile or a proportion has not been characterized, it is probably prudent to calculate the confidence limits as nonparametric percentiles of that distribution rather than to use gaussian approximations.

Two strategies are possible for estimating the confidence intervals for the proportions using the bootstrap method:

We feel that the latter of these two strategies is in better agreement with the spirit of resampling and therefore preferable. However, at "large" sample sizes and iteration numbers both strategies would probably lead to similar results.


   Discussion
Top
Abstract
Introduction
Theoretical Considerations
Discussion
Conclusions
References
 
We suggest direct estimation of the proportions of the subgroup distributions outside the common reference limits and application of the proportions criteria (1)(2) as the method of choice to make decisions on partitioning, particularly in the case of nongaussian distributions. Apart from the effect of prevalences on partitioning (2), we recognized tied reference values as a previously neglected complication of the partitioning problem that is hard to account for by use of distance criteria. The distance criteria are based on an implicit assumption of the reference values being unique because unique values for the proportions as correlates of the distances, an essential requirement when these criteria were developed (1)(2)(3)(4), would otherwise not exist. However, if there are tied reference values in both subgroups at the common reference limit, as in the example illustrated by box 4 in Fig. 5AUp , the distance between the means of the subgroup distributions [the Harris–Boyd method (3)(4)] or that between the reference limits of these distributions [the distance criteria suggested by us (1)(2)] would not correspond to a unique proportion in each subgroup, but to ranges of proportions similar to those indicated by the sides of this box.

To correct for prevalences, two methods are proposed: the root method and the multiplication method. Although these two methods appear rather different in technical terms, the root method being based on iterative solution of a simple equation and the multiplication method on expansions of the original data vectors, both methods actually perform the same adjustment for the prevalences. The multiplication method makes this adjustment in a straightforward way, by multiplying the subgroup distributions to enable establishment of the prevalence-corrected common reference limits as percentiles of the combined distribution of such multiplied subgroups. In the root method, no changes to the original data vectors are required. Instead of modifying the subgroups, this method focuses on the relationship between the subgroup proportions at the common reference limit. To specify that limit, it makes adjustments of both subgroup proportions in parallel, and because only proportions determined from the subgroup data vectors are used during the iteration, the numbers of reference values in them do not affect the localization of the common reference limit.

The advantage of the root method is that no manipulation of the original data is needed, but it has one drawback. If a common reference limit lies outside one of the endpoints of the subgroups, the difference in Eq. 5Up changes sign before the first reference value of that subgroup is reached when the range of reference values lying between the reference limits of the subgroups is scanned. Such a situation is technically hard to handle because rank-based, nonparametric treatment of distributions starts at the first reference value. This is not a serious problem when a single nonparametric calculation is performed because the exact localization of the common reference limit is seldom needed to make a decision on partitioning in that case; the subgroups should most often be partitioned if the proportion of one of the subgroups corresponds to a rank that is smaller than 1. As an example, if the number of reference values in a subgroup is 120, traditionally considered the minimum sample size for reference interval studies (5), the first reference value would correspond to a proportion of 0.8%, as calculated from Eq. 1Up , but this value lies below the critical proportion for partitioning, 0.9%, suggested by us (1)(2).

If the bootstrap method is used, exact values for the proportions should preferably be obtained at every iteration cycle because otherwise the final results could not be calculated as statistics of the bootstrap distribution. Note that a situation with a common reference limit lying outside one of the endpoints of the subgroups may be a rare occurrence of a bootstrap iteration process, corresponding perhaps to some very unlikely combinations among the stochastically constructed bootstrap data vectors, whereas the vast majority of the iterations could give unproblematic common reference limits that lie between the endpoints of both subgroups. Although such occurrences may be rare, they could corrupt the bootstrap calculation and make it hard to obtain reliable results. This may be avoided by the use of curve-fitting methods to delineate the subgroup distributions beyond their endpoints and also to obtain estimates for the proportions from these areas. Because the rank-based nonparametric method applies linear approximation between observations, it is perhaps not necessary to describe the behavior of the subgroup distributions beyond the endpoints in detail to obtain reasonable estimates for the reference values corresponding to the value of zero of the probability density. In other words, if bootstrap calculations are performed and the root method is applied, the conventional rank-based nonparametric approach should be extended from covering the range between the 1st and the nth reference values to the range between the extrapolated 0th and n + 1th reference values, assuming that Eq. 1Up is used to calculate proportions.

Despite this drawback of the root method, it appears technically more elegant than the multiplication method. Because the multipliers frequently need to be rounded, one should make sure that the decimals included in them suffice to obtain the proportions with a high enough precision (0.05% could be an appropriate precision). Our experiments made on data of the Nordic Reference Interval Project (7)(8) suggest that multipliers of the order 1000–10 000 may be needed to reach that level of consistency between runs performed with increasing orders of magnitude for the multipliers.

Tied reference values are a complication that can impair the precision of the proportions considerably. In the Nordic Reference Interval Project (7)(8), some of the laboratory tests had arrays of tied reference values with a length extending up to 1.0% of the subgroups. Box 4 in Fig. 5AUp is by no means exaggerated in its dimensions. Observe that the problem occurs only when there are tied values at the common reference limit in both subgroups simultaneously (at least one in one of the subgroups and more than one in the other). If all of the tied reference values originate from one of the subgroups, the path in Fig. 5AUp would cross the zero line (the dashed line) at one point, giving a unique solution. This complication is fortunately not very common, but when it occurs it can be an important source of error, especially if it remains unrecognized by the investigator.

Presenting reference values in more decimals than the analytical quality warrants is, in our opinion, not a useful way to treat the uncertainty inherent to arrays of tied reference values. Excessive decimals do not express true accuracy; they are just random noise, irrespective of whether they were generated automatically by the analytical instrument or added intentionally to the reference values by the investigator. The excessive decimals would seemingly eliminate the tied reference values by sorting these in a specified order, but such a randomly generated order can hardly be considered a solid basis for reliable decisions on partitioning.

To understand how such random orders may lead to variation of the proportions and thereby also to different conclusions, consider Fig. 5BUp , in which the four lowermost of the seven tied reference values at the common reference limit are depicted as if they were coming from distribution a and the three uppermost as if they were coming from distribution b. In reality, putting aside the excessive decimals, these reference values are equal, and there is no way they can be ascribed to either of the subgroups. The situation in Fig. 5BUp is just one of the many possible arrangements that excessive decimals could lead to by ordering, with questionable justification, these seven reference values. One can imagine from Fig. 5BUp what kind of proportions this specific arrangement would give, and if this particular randomly generated set of excessive decimals were accepted, one could consider the obtained proportions as a unique solution of the partitioning problem. However, another set of excessive decimals could have ordered the array of the seven tied reference values in the combined distribution quite differently, leading, e.g., to a situation in which the three reference values coming from subgroup b would lie lowermost in that array. Because the position of the common reference limit, indicated by the arrow in Fig. 5BUp , remains unchanged, the conversion of that limit to proportions in the subgroups would give proportions different from those in the situation just considered, and one would obtain another, seemingly unique solution. Clearly, neither of these seemingly unique solutions, corresponding to different sets of excessive decimals, should be considered as a true solution of the partitioning problem.

If this randomization procedure were repeated a large number of times in a simulation study, the observation could be made that, on average, the common reference limit will divide the arrays of tied reference values in each subgroup in the same ratio that it divides this array in the combined distribution (the solution represented by point P2 in box 4 of Fig. 5AUp ). This is another argument supporting such a solution as opposed to the conventional one represented by the midpoint (P1) of box 4. In the section on coping with tied reference values we presented a partitioning problem that the conventional solution failed to solve but which the solution represented by point P2 solved successfully. However, as discussed above, both solutions have the weakness of lying only accidentally on the zero line. Our third suggestion, represented by point P3, is admittedly heuristic, but it has the advantage of making use of the information given by the zero line to restrict the space of possible solutions within the arrays of tied reference values.

This study was focused on the rank-based method recommended by the IFCC for nongaussian distributions, attempting to improve that method by supplementing established techniques with procedures that take account of the prevalences and tied observations. As an alternative approach to the rank-based method, curve-fitting procedures could be applied to model nongaussian distributions. In that approach, tied reference values could be handled by an appropriate smoothing algorithm and the prevalence effect by continuous analogs for the root and multiplication methods.

Focusing on proportions corresponding to peripherally located percentiles of reference distributions, the new method for partitioning appears particularly susceptible to the lower statistical quality of such percentiles compared with that of means. However, the Harris–Boyd method (3)(4) also refers indirectly to the same proportions in the end, using differences between means and ratios between standard deviations as mediating parameters. Direct estimation of a parameter must be more accurate than indirect estimation, however, because the inaccuracy of the mediating parameters, added to the total uncertainty, will then be avoided. Interestingly, the issue of statistical quality of partitioning methods seems never to have been investigated in the laboratory medical literature. According to the NCCLS guidelines (6), 120 reference values in each subgroup could suffice, but this recommendation is not expressed in terms of uncertainty of the proportions or that concerning suggestions on partitioning. The traditional minimum sample size of 120 for establishment of reference limits in the case of nongaussian distributions (5) seems to have been assumed more or less heuristically as also appropriate for partitioning calculations, but that number may turn out to have been highly optimistic as an estimate once the statistical quality of partitioning methods is examined in future studies.

To illustrate the theoretical considerations presented in this report, an associated practical study was prepared (12), which apart from offering extensive practical examples on the suggested improvements for the rank-based method, also performs a comparison between the new method for partitioning and that of Harris and Boyd (3)(4), using reference data collected in the Nordic Reference Interval Project (7)(8).


   Conclusions
Top
Abstract
Introduction
Theoretical Considerations
Discussion
Conclusions
References
 
A. We recommend the following proportion criteria to make decisions on partitioning: If at least one of the four proportions of the subgroup distributions cut off by the reference limits of the combined distribution exceeds 4.1% or lies below 0.9%, subgroup-specific reference intervals should be calculated instead of using the common reference limits for the combined subgroups.

B. Proportion criteria are preferable to distance criteria, notably when partitioning nongaussian distributions, for three reasons:

C. Two methods to account for the prevalences were described, to be applied when using proportion criteria:

D. Tied reference values become a problem if there are copies of them in both subgroups at the common reference limit. The conventional solution used in nonparametric statistics of setting the rank of each of the tied observations to the average of these seems unsatisfactory for partitioning purposes, and better solutions were suggested.


   Acknowledgments
 
We gratefully acknowledge support of this study by the Department of Clinical Chemistry at Rikshospitalet University Hospital of Oslo and the Nordic Reference Interval Project.


   References
Top
Abstract
Introduction
Theoretical Considerations
Discussion
Conclusions
References
 

  1. Lahti A, Hyltoft Petersen P, Boyd JC, Fraser CG, Jørgensen N. Objective criteria for partitioning gaussian-distributed reference values into subgroups. Clin Chem 2002;48:338-352.[Abstract/Free Full Text]
  2. Lahti A, Hyltoft Petersen P, Boyd JC. Impact of subgroup prevalences on partitioning gaussian-distributed reference values. Clin Chem 2002;48:1987-1999.[Abstract/Free Full Text]
  3. Harris EK, Boyd JC. On dividing reference data into subgroups to produce separate reference ranges. Clin Chem 1990;36:265-270.[Abstract/Free Full Text]
  4. Harris EK, Boyd JC. Statistical bases of reference values in laboratory medicine 1995:77-90 Marcel Dekker New York. .
  5. Solberg HE. Approved recommendation on the theory of reference values (1987). Part 5. Statistical treatment of collected reference values. Determination of reference limits. J Clin Chem Clin Biochem 1987;25:645-656.[ISI]
  6. . National Committee for Clinical Laboratory Standards. How to define and determine reference intervals in the clinical laboratory; approved guideline, 2nd ed. NCCLS Document C28-A2 2000 NCCLS Wayne, PA. .
  7. Nordic Reference Interval Project (NORIP).http://www.furst.no/norip/ (Accessed July 28, 2003)..
  8. Rustad P. Reference intervals for 25 of the most frequently used properties in clinical chemistry. Proposal by Nordic Reference Interval Project (NORIP). Klin Biokem Norden 2003;2:10-17.
  9. Schulz EK, Willard KE, Rich SS, Connelly DP, Critchfield GC. Improved reference-interval estimation. Clin Chem 1985;31:1974-1978.[Abstract]
  10. Reed AH, Henry RJ, Mason WB. Influence of statistical method used on the resulting estimate of normal range. Clin Chem 1971;17:275-284.[Abstract]
  11. Linnet K. Nonparametric estimation of reference intervals by simple and bootstrap-based procedures. Clin Chem 2000;46:867-869.[Free Full Text]
  12. Lahti A. Are the common reference intervals truly common? Case studies on stratifying biochemical reference data by countries using two partitioning methods. Scand J Clin Lab Invest; in press..



The following articles in journals at HighWire Press have cited this article:


Home page
Clin. Chem.Home page
E. Grossi, R. Colombo, S. Cavuto, and C. Franzini
The REALAB Project: A New Method for the Formulation of Reference Intervals Based on Current Data
Clin. Chem., July 1, 2005; 51(7): 1232 - 1240.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Data Supplement
Right arrow All Versions of this Article:
clinchem.2003.027953v1
50/5/891    most recent
Right arrow Submit an electronic Letter to
the Editor about this paper
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (8)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lahti, A.
Right arrow Articles by Solberg, H. E.
Right arrow Search for Related Content
PubMed
Right arrow