We used the FT algorithm
22 as the base procedure for our work. It is a staircase procedure that modifies stimulus luminance in steps of 4 dB until the first response reversal occurs and subsequently in steps of 2 dB. The HFA implementation of FT terminates after two response reversals and takes the stimulus luminance of the last-seen presentation as the final sensitivity estimate for a given location. In our implementation of FT, if the first estimate was more than 4 dB away from the starting point of the staircase, then the procedure was repeated using the first sensitivity estimate as the starting point, and the result of this second staircase was taken as the final estimate of sensitivity.
We computed the error distribution of FT as follows. Given the FoS curve (psychometric function) of a patient for a particular location in the visual field and the starting point of the FT procedure, we could compute the probability of obtaining any particular measurement for that location, assuming that the patient was performing the task correctly. We used a technique that we have used previously, in which paths in the binary decision tree for the FT procedure with a given start guess are labeled with probabilities from an assumed patient's FoS curve.
23 The leaves of the tree represent possible outcomes from the procedure with an associated probability. An example of two probability distributions derived in this study is shown in
Figure 1.
Both panels use a FoS curve described by Abott's formula.
24 where
fp is the false-positive rate defining the lower asymptote of Ψ;
fn is the false-negative rate defining the upper asymptote of Ψ;
s is the standard deviation of a cumulative Gaussian defining the spread of Ψ;
t is the threshold, or translation of Ψ, along the abscissa; and
G(x,
t,
s) is the value at
x of a cumulative Gaussian distribution with mean
t and SD
s. For these experiments, we assumed that the patient would make few errors, and so we set the false-positive and -negative rates to 1%. With reducing visual field sensitivity, the psychometric function slope flattens for size III SAP targets.
14,15 Our model accounts for this dependence of variability on sensitivity by varying the standard deviation of the Gaussian with sensitivity by using
s = exp(−0.066 ×
t + 2.81) as previously reported for clinical data
14 capped at a maximum of 6 dB. In the two examples shown in
Figure 1, we assumed that all starting points for the FT procedure are equally likely and summed and normalized the 41 possible distributions (start points of 0,1… 40 dB) into the ones shown.
We computed these distributions for values of
t from 0 to 40 dB, then for each distribution we fit a Gaussian:
G(0… 40,
mt ,
st ) using the nonlinear minimization (nlm) function in R to minimize the
L 1 norm) (see
Fig. 1 for examples where
t = 4 and
t = 10 dB). Because we assumed a high variability in patient response (flat FoS curve) when true sensitivity was low, there was a “floor effect” where FT returned 0 dB very often. This effect can be seen in
Figure 1, left, where, although the true sensitivity is 4 dB, 0 dB occurs nearly as often as 3 dB. This situation arises when a patient does not see 0 dB twice; then, FT terminates and returns a sensitivity of 0 dB, resulting in multimodal distributions for low true sensitivities, as in
Figure 1, left, making the Gaussian fit poor. This phenomenon has been widely reported in clinical studies of the test–retest distribution of SAP thresholds,
1,2,16 where the distribution of retest values is skewed to lower sensitivities, particularly for low test sensitivities. We note in passing that if we sample test and retest values from our fitted Gaussian distributions (raising any observed negative values to 0), we get very similar skew distributions, and so the use of symmetrical Gaussians to model measured–given–true sensitivities is not inconsistent with the observed asymmetrical test–retest distributions reported in the literature. Examples of such distributions are shown in the Results section.
Another idiosyncrasy of the FT algorithm is that it underestimates sensitivities by approximately 1 dB on average because the estimate returned is the last seen stimulus.
1,25 We “recenter” the distributions by setting the mean to the true sensitivity value of the patient. Thus, the distribution of possible outcomes from FT for true sensitivity
t is described by
G(0… 40,
t,
st ).
By choosing a true sensitivity value and then sampling from G(0… 40,
t,
st ), we get a measured sensitivity value that is typical of current SAP procedures. As the purpose of this work was to investigate the benefits of improving SAP variability, we simulated improved procedures by systematically reducing
st in steps of 10%, sampling from
G(0… 40,
t, 90% ×
st ),
G(0… 40,
t, 80% ×
st ), and so on. We also examined the ability to detect progression if the FoS slopes were consistent across the range of available sensitivity estimates—that is, if there was no flattening of psychometric function slope with decreasing sensitivity. Avoiding a flattening of FoS with sensitivity is the goal of several research groups exploring alternate stimulus types to current size III SAP targets.
15,18