To assess variability with the new and standard perimetric stimuli, homogeneous and heterogeneous fluctuations were calculated. Heterogeneous fluctuation reflects variability of the individual threshold estimates and was calculated for every location as the absolute value of the difference in the log contrast sensitivities for the first and second visits. Homogeneous fluctuation reflects an overall change in sensitivity across two visits and was computed for each subject as the mean of differences in log contrast sensitivities between the first and second tests across all the locations. Homogeneous fluctuation sets an upper limit to the effect of change in the subject’s criterion from one day to another, and hence sets a lower limit for heterogeneous fluctuation.
We asked several questions in the study:
Data were gathered from young control subjects to evaluate the effect of the number of reversals on test-retest variability. An
F-test and two linear regressions were used, and statistical significance was set at
P < 0.0125. The effect of the number of reversals on variability for Gabor patches with eight- and four-reversal staircases was assessed by using an
F-test on heterogeneous fluctuation. To assess the dependence of variability on sensitivity, linear regression was performed on heterogeneous fluctuation versus sensitivity for the two staircases, and a
z-score analysis was used to compare the slopes.
Homogeneous fluctuation was plotted against mean contrast sensitivity for each subject. The Bland-Altman analysis of agreement
24 was used to compare learning effects for CSP and CAP.
To compare the variability of responses of patients with glaucoma to the Gabor and size III stimuli, we applied an
F-test, linear regression, and slopes comparison; statistical significance was set at
P < 0.0125. An
F-test was performed on homogeneous fluctuation for the two devices. We assessed the relation between sensitivity and variability by performing linear regression on heterogeneous fluctuation, and the slopes were compared in
z-score analyses.
Bland-Altman analysis of agreement was used to estimate whether the depth of defect obtained from testing patients on the VSG system is similar to that obtained from testing on the HFA. The two tests could not be compared on a point-by-point basis because different locations were tested on the two devices. The superior and inferior nasal quadrants were extensively tested on both of the tests and were used for this analysis. The sensitivity was converted into linear units and was averaged across all the points in the given quadrant for each patient for each test day, and this average was expressed in log units. Confidence limits were used to determine the range of fluctuations expected solely from test-retest variability. These were computed as 1.96 times the vector sum of standard deviations of homogeneous fluctuation for each test separately. To compensate for the difference in dynamic range on two devices, for CAP we assigned the values of depth of defect that were deeper than −1.34 log units the value of −1.34 log units. This value represented the deepest possible defect that could be measured with CSP on this particular group of subjects.
Bland-Altman analysis was used to determine the relations between contrast sensitivity and rim area. Contrast sensitivity from the two test dates was averaged across the nasal hemifields for each device and for every patient. Possible bias due to the different units used on different devices was minimized by expressing results as percentages of the mean normal.
The effect of eccentricity on contrast sensitivity in normal eyes was assessed using data from young and old control subjects tested with Gabor patches
(Fig. 2)and size III stimuli with four-reversal staircases. Linear regression was performed on mean contrast sensitivity versus eccentricity, and
z-scores were used to compare the slopes. Statistical significance was set at
P < 0.05.