We measured the specificity and sensitivity of the ETDRS chart to various degrees of simulated acuity change. When combined with the Bland-Altman approach to determine a setting-specific change criterion of 0.11 logMAR, the ETDRS performed well at detecting changes of 0.2 logMAR (estimated sensitivity of 100%, 95% confidence interval [CI] 93%–100%; estimated specificity 96%, 95% CI 86%–99.5%). However, for an acuity change of 0.10 logMAR, test sensitivity was poor (38%, 95% CI 25%–53%). Furthermore, this study used normal subjects wearing full refractive correction, and all the measurements were taken by a single examiner in a short time under identical conditions, using an interpolated scoring method. These are all factors that may have helped to limit the degree of TRV in this study, and we may therefore expect the test to perform less well in day-to-day use. Subsequent to the main analysis, the data were rescored using the line-assignment method favored in routine clinical practice (defined as the logMAR value of the smallest letter size at which at least three of five letters were correctly named). Acuities scored this way were subject to greater TRV (95% range ±0.17 logMAR) than those scored with the interpolated method (95% range ±0.11 logMAR). Recalculating sensitivity and specificity for the line-assignment data, using 0.17 logMAR as the change criterion resulted in poorer test performance (estimated sensitivity for a change of 0.2 logMAR of 84%, 95% CI 71%–93%; estimated specificity 94%, 95% CI 83%–99%) than for the interpolated method. It should also be noted that the use of change criteria from published studies rather than an internally derived criterion had, in some instances, a substantial impact on the estimated sensitivity and specificity of the test procedure.
Our conclusions appear contrary to the those of published studies that have measured 95% ranges for TRV for the ETDRS chart of ±0.10 logMAR or less.
3 4 5 6 These studies have suggested that the ETDRS test can reliably detect changes of 0.10 logMAR or greater. We believe that their interpretation is unduly optimistic. Take the example of a setting in which the 95% TRV range is ±0.10 logMAR, and a change criterion of 0.10 logMAR is therefore set. For individuals with a real change of 0.10 logMAR, the distribution of measured changes, assuming normality, is shown in
Figure 1 . It is clear from the data that the change criterion of 0.10 logMAR can be expected to identify 50% of these individuals as having experienced a change. More generally, the sensitivity of the procedure to detect changes of a magnitude similar to the change criterion is approximately 50%.
In interpreting the results of this study, it should be borne in mind that, according to Bayes’ theorem, the predictive value of a diagnostic test depends the characteristics (sensitivity and specificity) of the test and on the incidence of change in the population to which the test is applied.
13 If the incidence of true change is very low, then even a test with high sensitivity and specificity may result in a high proportion of false-positive test results. For example, if a test with 95% sensitivity and 95% specificity is applied to a population in which the proportion of tested individuals with change is 5%, the positive predictive value (PPV) is 0.50. In other words, only half of the positive results are genuine changes. If the proportion of tested individuals with changes increases to 50%, the same test has a PPV of 0.95—that is, 95% of positive results are true positives.