In this study, we evaluated three different classes of methods for determining visual field progression in glaucoma. The AGIS and CIGTS methods are ordinal scoring methods based respectively on point-wise total deviation and their probabilities. Although the GCP analysis is commercially available with the Statpac program of the Humphrey Field Analyzer (Carl Zeiss Meditech) and is widely used, there is no consensus on the magnitude of the change required to constitute progression. Similarly, although PLRA methods have been used for many years now, there are no accepted criteria on either the number of test locations that show significant decline in sensitivity, the significance level, or the magnitude of change (decibels per year) at each location.
As a result, the performance of the GCP and PLRA methods depends critically on the criteria used and the number of repeated tests required to confirm progression. This is an inherent limitation of studies such as these that seek to evaluate the relative performance of these techniques. It can be argued for example that selecting a more liberal criterion with PLRA, such as a significant decline in sensitivity at two locations, or at P < 0.05, with no minimum requirement for slope change or number of occasions that significance has to be confirmed would have led to both a higher number of progressing fields and a shorter time to detection. It is also likely that there would have been a larger number of falsely progressing cases with such liberal criteria. Because we also evaluated specificity, we were positioned to identify those criteria that were clearly too liberal.
Computer simulation techniques have been used extensively to evaluate the performance of thresholding algorithms
24 25 26 in addition to methods for the evaluation of progression.
11 12 14 The advantages of these approaches are numerous, including control of variability parameters, the frequency of tests, and the length of follow-up. By using the same initial and final visual fields, we were able to evaluate the specificity of the methods as they related to the number of progressing cases under the same variability conditions
(Fig. 2) . An evaluation of specificity from real longitudinal data may not be meaningful, as visual field progression cannot be ruled out, even if other clinical markers of disease progression appear to be stable. We have shown that the type of computer simulation used in this study shows close agreement to real visual field data.
13 The advantage of this approach is that we did not impose a definition of what constitutes visual field progression, which would invariably result in an evaluation that is criterion dependent. The disadvantage, however, is that cases identified as progressing likely contain both true- and false-positive results. Hence although we were able to estimate specificity, because of the lack of an external standard for visual field progression, we could not evaluate sensitivity. By examining the changes in specificity and the number of fields identified as progressing under different variability conditions
(Fig. 2) , one can evaluate the effect of variability on performance.
The AGIS method was the most conservative and identified the least number of eyes as progressing. Like the AGIS method, the CIGTS method also had high specificity under moderate- and high-variability conditions, but the CIGTS method classified twice as many eyes as progressing as did the AGIS method and detected confirmed progression 1.4 to 2.3 years earlier than the AGIS method. Similar differences in progression rates between these two methods have been reported by Katz et al.,
9 ; however, no differences were found in the time to progression. The fact that for similar specificity, the CIGTS method detected both more cases of progression and earlier time to progression than the AGIS method suggests that the latter may be overly conservative. In comparison to the GCP and PLRA methods, however, the CIGTS method detected progression in substantially fewer eyes. The follow-up time to confirmed progression with the CIGTS method was on average approximately 1.5 years less than with the two PLRA methods, but was comparable to that of the GCP methods.
Because of the higher progression rates with the GCP and PLRA methods, it is likely they detect smaller changes than either the AGIS or CIGTS methods. The GCP (2 × 4) and GCP (8, 2 × 4) methods had the highest variation in the number of progressing cases, the time to detect progression and specificity suggesting a considerable vulnerability of these methods to threshold variability. Under the high-variability condition, the GCP (2 × 4) and GCP (8, 2 × 4) methods resulted in the highest number of progressing cases, but also lowest specificity. These findings suggest that these methods may result in an excessive frequency of progression in patients with moderately advanced field losses that exhibit the largest degree of measurable variability.
21 27 28 In comparison, under the low-variability condition, the GCP (2 × 4) and GCP (8, 2 × 4) methods have considerably higher specificity at a cost of a modest reduction in the number of progressing cases. It is interesting to note that although the GCP (3 × 4) method on average detected fewer cases of progression and at a later time compared with the other GCP methods, its performance was remarkably resistant to variability. These findings indicate that it is not the GCP analysis itself that results in performance variation, but probably the number confirmatory results that are necessary to declare progression.
The PLRA methods generally had high specificity. The PLRA3 criterion produced 100% specificity under both variability conditions. The PLRA methods also declared a relatively large number of fields as progressing. The drawback of these methods is that they required the longest time to detect progression (5.5–6.5 years). The criterion for progression at a single location was decay of 1.0 dB/y or more with
P < 0.01. The same criterion has recently been used by Membrey et al.,
29 but
P < 0.05 and 0.10 have also been reported.
8 The probability level directly affects the number of fields needed before progression can be confirmed. Using our criteria, the earliest confirmed progression was detectable at only at the 5.5-year visit (i.e., at the 12th semiannual visual field). These findings are in accord with the results of a clinical study by Katz et al.
30 who showed that seven fields over a 6-year period are insufficient for PLRA to detect sensitivity decay of less than 1.0 dB/y. Furthermore in another study, detection of 1.0-dB/y sensitivity loss in an individual test point with
P < 0.001 has been calculated to require, depending on the degree of long-term variability, from 10 to 14.5 years of follow-up, with semiannual testing of visual fields.
31 Finally, in a computer simulation study, a minimum of eight annual tests have been shown to be required before a 1.0-dB/y point-wise sensitivity loss at
P < 0.05 and specificity ≥75% can be detected.
11 Such long follow-up times before confirmed progression can be detected may be unacceptable for both clinical and research purposes. However, when number of fields available and the follow-up increases, PLRA may be valuable particularly because the method can be highly specific. Furthermore, PLRA allows measurement of the rate of change of threshold or threshold deviation, which may be may be important in clinical management.
The ideal method for analyzing visual field change should be sensitive, detect progression with few examinations, maintain high specificity, and be resistant to fluctuation. Our results show that none of the methods investigated could achieve all these attributes. Methods that yielded a high number of progressing cases were often less specific and were influenced by fluctuation. In contrast, those methods that yielded high specificity often required very long follow-up times. Furthermore, in agreement with a previous study,
9 there was poor concordance among the methods with respect to the patients identified as progressing. Although we have investigated with computer simulation techniques a range of methods to analyze progression, there are many possible variations in criteria for progression within a class of method. For example, the degree of decibel loss per year and probability level with the PLRA methods and other combinations of progressing points and confirmations with the GCP methods, may yield more favorable results. Our study confirms that detecting visual field progression reliably requires a considerable number of examinations and length of follow-up. For many patients, these conditions may not be acceptable to the clinician. It is not clear whether these findings are due to the nature of conventional perimetry itself or the methods of analysis. Future research should focus on both alternative methods of data analysis and other modalities of detecting glaucomatous progression.
The authors thank Alex Bates and Allen Butler for providing computer programming expertise and Paul Artes for constructive comments.