Figures 4 and 5show the performance of the test and retest strategies when the entire visual field sensitivity was either reduced
(Fig. 4)or elevated
(Fig. 5)by 3 dB. The figures are plotted in the same format as
Figure 3 .
The retest strategy most biased by the original test result was Z-Cont; thus, this procedure should be most affected by a mismatch between the initial result and the sensitivity at retest. Z-Cont is represented by the diamonds in
Figures 3to
5 . When there was no change in visual field sensitivity from test to retest
(Fig. 3)Z-Cont performed well. However, this was not the case when there was a shift in the overall sensitivity of the visual field. When the field was uniformly decreased by 3 dB
(Fig. 4) , Z-Cont was slow to terminate for the no-error and FN conditions. Z-Cont terminated more quickly when FP or unreliable responses were made; however, the magnitude of the error was greater than the other retest procedures. When the whole field increased in sensitivity (by 3 dB, see
Fig. 5 ), Z-Cont was on average slower than the test procedures (FT and ZEST) and had a similar error distribution profile. Hence,
Figures 4 and 5demonstrate that Z-Cont was not a successful retest strategy when there was a change across the whole visual field.
Figures 4 and 5demonstrate that Z-Gauss (up triangles) performed better than Z-Cont and, in general, displayed better performance (similar or reduced error and faster test time) than simply repeating the test strategy ZEST. Z-Gauss performed similarly to REMU when there was a whole-field improvement in visual field sensitivity
(Fig. 5) , but was substantially slower to terminate when there was a reduction in sensitivity
(Fig. 4) . The asymmetry in the performance of Z-Gauss was due to the truncation of the
pdf at the top end of the dynamic range (40 dB). Statistical comparison of the presentations required to terminate demonstrated that REMU terminated faster on average than Z-Gauss, and that Z-Gauss terminated faster than ZEST in all error conditions, for both whole-field increases and decreases in sensitivity (ANOVA, Holm-Sidak post hoc testing;
P < 0.05). Although statistically significant, it is important to consider whether the magnitude of the difference is likely to be clinically significant. When averaged across all error conditions, when the whole field decreased by 3 dB, ZEST required approximately seven presentations to terminate, Z-Gauss approximately six, and REMU approximately five. When sensitivity increased by 3 dB, ZEST required approximately 6.5 presentations on average, Z-Gauss approximately 4.5, and REMU approximately four. For comparison, FT terminated using between five and six presentations for both the whole-field increase and decrease in sensitivity. For most error conditions, the MAE of REMU and Z-Gauss was not different (
P > 0.05). REMU was more accurate on average than Z-Gauss in the presence of FN errors or unreliable performance if there was a whole-field decrease in sensitivity (
P < 0.05, Holm-Sidak post hoc testing). The magnitude of the difference was unlikely to be of clinical significance.