Abstract
Purpose:
We evaluated the usefulness of gaze tracking (GT) results as an index of visual field reliability in glaucoma.
Methods:
The study population consisted of 631 eyes of 400 patients with open angle glaucoma in an institutional practice, with 10 visual fields (VFs). For the observational procedure, visual fixation was assessed using the gaze fixation chart at the bottom of the VF (Humphrey Field Analyzer, 30-2 SITA standard) printout. Average frequency of eye movement between 1° and 2° (move1–2), 3° and 5° (move3–5), and greater than or equal to 6° (move≥6) were calculated. In addition, average tracking failure frequency (TFF) and average blinking frequency (BF) were calculated. The relationship between mean deviation (MD), fixation losses (FLs), false-positives (FPs), false-negatives (FNs), move1–2, move3–5, move≥6, TFF, BF, and pattern standard deviation (PSD) were evaluated using linear modeling. Main outcome measures included parameters related to over- or underestimation of MD values.
Results:
Patients' mean MD progression rate was −0.23 dB/y. The best model to predict MD values included FL rate, FP rate, move3–5, move≥6, TFF, BF, and PSD as dependent variables with coefficients of 0.90, 9.2, −0.57, −0.52, −2.2, −1.1, and −0.56, respectively (P < 0.001).
Conclusions:
High FL and FP rates tend to raise MD values. By contrast, high values of move3–5, move≥6, TFF, BF, and PSD tend to lower MD values. Thus, GT parameters can be used as new indices of VF reliability through the prediction of over- or underestimation of VF results.
Assessing the reliability of visual field (VF) results is very important at clinical settings, because the time it takes to detect progression is largely influenced by the variability of VFs,
1,2 which impedes clinicians when making medical and surgical treatment decisions. In the Humphrey Field Analyzer (HFA; Carl Zeiss Meditec, Dublin, CA, USA), several methods have been used to estimate the reliability of VF tests. Fixation loss (FL) is recorded when a stimulus projected onto the area of the blind spot is perceived, and it indicates test reliability and vision fixation. Elevated FLs can mask the presence of early scotoma.
3,4 False-positive (FP) rate is estimated by the number of positive answers that occur during a “listen time,” which starts shortly after the end of the response window and ends 180 ms after the onset of the next stimulus.
5 False-negatives (FNs) mainly occur when a patient fails to respond to a stimulus that is more intense than that to which the patient had responded previously. A high rate of FP answers is thought to indicate “trigger-happy” patients and a high rate of FN responses is thought to represent inattention during an examination.
6–8
While some past studies have reported on the usefulness of these indices,
9,10 more recent studies have pointed out their limitations; FLs also can result from the mislocalization of the blind spot
11 and fixational instability can be found even in well trained observers.
4,12 A high FN rate is reported to be associated with the amount of field loss as well as threshold reproducibility.
13
Gaze tracking (GT) is a record of eye movement monitored during the actual sensitivity measurement.
14 Its use in clinical practice has been somewhat limited, since results are merely represented as a printed line diagram at the bottom of the VF printout, and, as a result, can only be evaluated subjectively by clinicians. Nonetheless, it has been reported that GT is useful for evaluating the quality of fixation, particularly when VF defects surround the blind spot,
15 and indeed, we have recently reported the usefulness of GT parameters for VF reliability as measured by test–retest reproducibility.
16 In the study, GT results were evaluated objectively and quantitatively, and GT parameters were closely related to test–retest reproducibility; the FN rate also was significantly related to test–retest reproducibility, but the FP rate and FL rate were not. Nevertheless, the results do not deny the usefulness of the FP and FL indices, because they may be related to over- or underestimation of VF sensitivity. Indeed, Junoy Montolio et al.
17 investigated the residuals from a mean deviation (MD) trend analysis and reported that high FP rates increase MD values.
5,17 Thus, the objective of the current study was to investigate the usefulness of GT parameters, in addition to classic reliability indices, in the over- and underestimations of VF results.
In the current study GT results from 30-2 HFA VFs were evaluated quantitatively and objectively. The influence of GT results, FLs, FPs, FNs, and PSD on MD values was then investigated. It was suggested that high FL and FP rates tend to raise MD values. Misfixation of more than 3° during the sensitivity measurement, as represented by move3–5 and move>6, was significantly related to low MD values. Misfixation of less than 3° did not have a significant effect on MD values.
With regard to standard reliability indices, FL measures visual fixation during VF tests and is recorded when a patient responds to a stimulus projected onto the blind spot. However, FL also can arise when a patient is “trigger-happy,” similar to FP, as pointed out in a previous report.
7 A high FL rate can occur when a patient traces the target stimulus rather than the fixation target. In addition, pseudo-FL can occur when the blind spot is not in the expected position,
11 due to change in head tilt and eye rotation.
7 Furthermore, in our previous study, it was suggested that FL was not related to test–retest reproducibility of 24-2 and 10-2 VFs.
16 Thus, the usefulness of FL as a reliability indicator is somewhat limited, especially when compared to GT parameters, which are derived from real-time eye movements during the sensitivity measurement.
In the current results, FN rate was not selected as a predictor for MD values. It has been reported that FNs increase with the progression of glaucoma
13 and FN rate is no longer used as an official criterion of reliable VFs.
14 However, this does not deny the usefulness of FNs to assess test reliability. Indeed, we have shown that the FN rate is useful for estimating test–retest reproducibility
16 and, hence, it is not recommended to ignore FN results when interpreting VFs in the clinical setting. Furthermore, Bengtsson
19 investigated the relationship between VF reproducibility and FLs, FPs, and FNs, and found that only FNs were associated with reproducibility.
Our previous study suggested that FP rate is not a significant predictor of test–retest reproducibility.
16 In the SITA algorithm, FP rates are calculated differently from those in the Full-Threshold test, in which classic catch trials are used. In the SITA algorithm, any response before the minimum response time (approximately 180 ms), which also is adjusted according to the patient's individual mean response time, is considered an FP error.
5 This may suggest that all actual “FP” responses after the minimum response time are ignored in the FP calculation. Still, the current results suggest that a high FP rate raises MD values (the “trigger-happy” patients), which is in agreement with a previous report.
17
In contrast with the standard reliability indices, GT parameters directly measure eye position during threshold measurements. Among the GT parameters analyzed in the current study, move
3–5, move
>6, TFF, and BF were significantly associated with low MD values, probably because the patient was not well-fixated and could not see the target stimulus during blinking, as suggested in a previous report.
20 On the other hand, move
1–2 was not significantly related with MD values. This is unsurprising given the 6° spacing of VF test points in the 30-2 VF. Moreover, a previous study has reported that eye movement of less than 3° are commonly observed in VF tests, even in well-trained healthy observers.
4,21
The relationship between PSD and MD can be explained using a quadratic linear regression model,
16,22 where PSD decreases in the moderate to advanced stages of the disease.
7 PSD was included as a predictor of MD in the best linear model in the current study. This may be because the progression of MD, in our study patients, was slow in general (−0.23 dB/y on average) and so the relationship between PSD and MD was linear in this narrow range of progression.
One caveat of the current investigation is the limited information derived from the GT record. Gaze tracking parameters were merely extracted as the average frequency throughout the VF measurement. A more detailed investigation could be carried out if the “real-time” GT tracking results were available to researchers, thus making it possible to analyze fixation status at each sensitivity measurement. A further caveat is that GT results can be related to dry eye,
23 hence, further investigation is needed to shed light on this issue. In the current study, GT data were exported as JPEG images from the Beeline data filing system and various GT parameters were simply calculated by reading the JPEG image. Thus, GT parameters can be obtained on a personal computer; simple software could be built to give clinicians access to this GT information to estimate the reliability of patients' VFs at clinical settings.
In conclusion, we analyzed the influence of eye movements derived from the GT record on HFA VF tests. Gaze tracking parameters are significantly related to the underestimation of sensitivity in 30-2 VF tests.
Supported in part by Grant 26462679 (RA) from the Ministry of Education, Culture, Sports, Science, and Technology of Japan and Japan Science and Technology Agency (JST) CREST (RA, HM).
Disclosure: Y. Ishiyama, None; H. Murata, None; R. Asaoka, None