February 2002
Volume 43, Issue 2
Free
Clinical and Epidemiologic Research  |   February 2002
Development of Efficient Threshold Strategies for Frequency Doubling Technology Perimetry Using Computer Simulation
Author Affiliations
  • Andrew Turpin
    From Discoveries in Sight, Devers Eye Institute, Legacy Clinical Research and Technology Center, Portland, Oregon; and the
  • Allison M. McKendrick
    From Discoveries in Sight, Devers Eye Institute, Legacy Clinical Research and Technology Center, Portland, Oregon; and the
  • Chris A. Johnson
    From Discoveries in Sight, Devers Eye Institute, Legacy Clinical Research and Technology Center, Portland, Oregon; and the
  • Algis J. Vingrys
    Department of Optometry and Vision Sciences, University of Melbourne, Victoria, Australia.
Investigative Ophthalmology & Visual Science February 2002, Vol.43, 322-331. doi:
  • Views
  • PDF
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Andrew Turpin, Allison M. McKendrick, Chris A. Johnson, Algis J. Vingrys; Development of Efficient Threshold Strategies for Frequency Doubling Technology Perimetry Using Computer Simulation. Invest. Ophthalmol. Vis. Sci. 2002;43(2):322-331.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

purpose. To develop new test procedures for frequency-doubling technology (FDT) perimetry that improve performance beyond those currently used.

methods. Two novel threshold estimation procedures were evaluated: a rapid, efficient binary search technique (REBS) and a maximum-likelihood estimation (ZEST) procedure. A computerized visual field simulation model was developed to determine the accuracy and efficiency of these procedures. This model was constructed using previously derived characteristics of FDT perimetry from both normal observers (n = 506) and those with glaucomatous visual field loss (n = 352). The computer simulation program was used to determine the best parameters for the two new procedures and the effect of variability and response errors on algorithm performance. Comparisons were made to the performance of the modified binary search (MOBS) procedure used in the current commercial implementation of the FDT perimeter.

results. Both the optimized REBS and ZEST procedures approximately halved the time required for FDT threshold testing without loss of accuracy or reproducibility.

conclusions. With suitable parameter choices, comparable performance was achieved using either ZEST or REBS. Simulation results indicate that accurate thresholds can be measured with an optimized ZEST or REBS procedure in approximately half the time required by traditional estimation methods.

Frequency-doubling technology (FDT) perimetry is a new method for assessing visual field loss. 1 2 The frequency-doubling effect occurs when a low-spatial-frequency (<4 cyc/deg) grating is counterphased at a high temporal rate (>15 Hz), resulting in the grating’s appearing to have twice its original spatial frequency. 3 4 The physiological substrate for the frequency-doubling percept has been postulated to be a subset of magnocellular ganglion cells (M-cells) with nonlinear response properties, 2 approximately 15% of the M-cell population (2% of all ganglion cells). 5 FDT perimetry generates a frequency-doubling stimulus, and contrast thresholds are measured for detection of the FDT stimulus. FDT perimetry is a reliable method of detecting glaucomatous visual field loss, 6 and its reproducibility compares favorably with conventional perimetry. 1 7  
An advantage of FDT perimetry over conventional perimetry is decreased test time, partly due to fewer test locations. The commercially available FDT perimeter (Welch Allyn Inc., Skaneateles Falls, NY; Humphrey Systems, Dublin, CA) tests 17 locations: 4 in each quadrant and 1 central for the C-20 full-threshold test and 2 additional locations along the nasal meridian (19 total) for the N-30 full-threshold test. Full-threshold testing measures contrast sensitivity using a modified binary search (MOBS) procedure 8 with an average test time of approximately 5 minutes. This compares favorably with the average test time of 15 minutes for a conventional 24-2 full-threshold or 8 minutes for a Swedish interactive threshold algorithm (SITA) standard strategy (Humphrey Systems). 9  
The purpose of this study was to develop new threshold procedures for FDT perimetry, to improve efficiency and maintain or improve accuracy. We explored the utility of maximum-likelihood procedures, specifically ZEST, 10 11 12 13 because similar procedures (SITA) have been applied successfully to conventional perimetry. 14 15 16 We also optimized the MOBS procedure for use with FDT perimetry to produce a rapid, efficient binary search (REBS). 
Computer simulation was used to evaluate ZEST and REBS. Simulation allows thousands of threshold estimates to be collected rapidly and has been used previously to evaluate clinical test algorithms. 11 16 17 We used simulation to optimize the input parameters and termination criteria of the algorithms and to evaluate the relative importance of these parameters. We compared performance of these algorithms with the MOBS procedure used in the proprietary FDT perimeter. More than 1000 test procedures were assessed. A limited number of these procedures are presented in this report. The full results are available at http://www.computing.edu.au/∼andrew/Barramundi/fdp.html. 
Methods
Overview of the Computer Simulation
For a single subject, at any location within the visual field, the simulation reads an input threshold and then applies a test procedure (ZEST, REBS, or MOBS). In the simplest mode, the simulation assumes that any stimulus presented at a lower contrast (higher decibel) than the input threshold cannot be seen (“no” response). Similarly, any stimulus presented at a higher contrast (lower decibel) than the input threshold receives a “yes” response. If the stimulus is presented at a level equal to the input threshold, then a “yes” or “no” response is chosen with 50% probability. Using these responses, the test procedure is run to completion, and the resultant threshold is output. Comparing the resultant threshold to the input threshold measures test accuracy, whereas the number of presentations required is used as a measure of efficiency. 
Using the simulation as just described confers no advantage over mathematical analysis of the test procedures. An advantage of simulation, however, is that sources of measurement error can be introduced and their effects on individual test procedures studied. We incorporated three types of measurement error into the simulation. Threshold variability was simulated by repeated sampling of a Gaussian distribution whose mean was the input threshold. Previous simulation studies have found little difference in performance between Gaussian and empiric distributions. 17 18 The SD of the Gaussian was varied as 0, 1, and 2 dB to simulate patient variability. 7 Figure 1 shows an example of an input threshold of 14 dB and a Gaussian distribution with an SD of 1 dB. The threshold used to determine a response is 14 dB with probability 0.40, 13 or 15 dB with probability 0.24, and so on. False-positive and false-negative rates were incorporated as a probability that the subject would respond yes or no, respectively, regardless of the stimulus presented. False-positive and -negative rates of 0%, 10%, and 30% were used within the simulation. 
Input Data for the Simulation
The input to our simulation consisted of FDT perimetry data collected from 506 normal eyes and 352 glaucomatous eyes, using the C-20 test. Written informed consent was previously obtained from all subjects, in accordance with the Declaration of Helsinki. Seventeen thresholds were measured in each eye: one determined from a central 5° patch and 16 from a 4 × 4 grid of 10° patches. Thresholds from all eccentricities were pooled, so that there were 506 × 17 = 8602 normal thresholds, and 5984 thresholds from patients with glaucoma. For each experimental condition, the simulation was run 8602 times to determine output for the normal thresholds and 5984 times to determine output for the glaucomatous thresholds. Attributes of the two groups are summarized in Table 1 . This data were also used to derive the ZEST procedures, as detailed in the following section. 
Maximum-Likelihood Estimation Procedure
The ZEST procedure, 11 a variant of QUEST, 13 is based on a maximum-likelihood threshold determination. For each stimulus location, an initial probability density function (pdf) is defined for all possible thresholds, which states for each possible threshold (0–20 dB for FDT perimetry) the probability that any patient will have a threshold at that location. The first stimulus is presented with a contrast equal to the mean of this pdf. The subject’s response to that stimulus is used to modify the pdf for the next presentation. 
The top panel in Figure 2a shows an initial pdf that assumes the most likely threshold for the patient is 14 dB (probability 0.13), whereas the patient is very unlikely to have a threshold of 2, 3, 4, 19, or 20 dB (P = 0.001). The mean of this pdf is 12 dB, which is presented to the subject. If the subject responds no, then the pdf is modified to give more probability to lower decibel levels (Fig. 2a , bottom). The mean of this new pdf determines the next stimulus presentation at this location (9 dB in this case). If the subject responds yes, then the pdf is modified to give a pdf as shown in the bottom panel of Figure 2b , with more weight on higher decibel levels. A stimulus of 15 dB, the mean of this new pdf, is presented next. 
The rule for generating the new pdf is to multiply the old pdf by a likelihood function, which represents the likelihood that a subject will see a stimulus. When the stimulus is at the subject’s true threshold, the maximum-likelihood function reflects what is called the psychometric function or frequency-of-seeing curve. The middle panel of Figure 2a shows one of the 12 likelihood functions used in this study. It assumes that the chance of seeing a stimulus at threshold is 50% (dotted lines) and that seeing any stimulus with much lower decibel levels is a 99% certainty, whereas seeing much higher decibel levels is only a 1% certainty. A stimulus 1 dB either side of threshold has a 75% and 25% chance of being seen. To derive a new pdf when the subject responds no, multiply the old pdf by the likelihood function, which weights lower decibel levels by 99% and higher decibel levels by 1%. When the subject responds yes, the maximum-likelihood function must be inverted before multiplying, so that high decibel levels are weighted at 99%, whereas the chance of the subject’s having a threshold at lower decibel levels is weighted only 1%. This procedure has been followed in Figures 2a and 2b 2c 2d 2e 2f 2g enerate the pdf in the bottom panel from the pdf in the top panel, with the addition of a normalizing step so that the probabilities in the pdf sum to 1. 
Once a new pdf is derived, the new mean is calculated, and a stimulus contrast equal to that mean is presented. The process is then repeated, either a fixed number of times, or until the SD of the pdf falls below a predetermined value. The subject’s threshold is the mean of the final pdf. 
The defining features of a ZEST procedure are the initial pdf, the likelihood function, and the termination rule. As Vingrys and Pianta 10 have noted, the initial pdf can be derived from demographic studies and can be biased by an examiner’s intuition about the subject’s likely thresholds. The likelihood function should reflect the variability inherent in the detection task as well as subjective false error rates. Finally, the termination rule should be chosen with realistic clinical outcomes in mind. One approach is to finish the procedure when the SD (ς) of the pdf declines below a fixed value, which assures a level of confidence about the determined threshold. For example, if ς is chosen to be 1 dB, then there is a 95% chance that the real threshold lies within a ±2-dB range of the measured threshold. One disadvantage of this approach is that unreliable observers may require many presentations. An alternative is to terminate the procedure after a fixed number of presentations, and the 95% confidence interval of the final estimate can be determined from the final pdf. 
To determine the effect of the starting pdf on FDT perimetry thresholds, four classes of pdf were tested in the simulation: uniform distributions (pdfu); pdfs equal to the normalized histogram of sensitivity from the normal eyes in Table 1 (pdfn); pdfs equal to the normalized histogram of sensitivity from the eyes with glaucomatous visual field loss in Table 1 (pdfg); and pdfs formed by combining the normal and glaucomatous pdfs (pdfc). The combined pdfs added the bottom 5% of pdfg to pdfn. 10 Before the addition, the bottom 5% of pdfg was reduced by a weighting factor so that it did not dominate the normal portion of the pdf. Twelve different weighting factors were evaluated. The results presented herein are for a weighting factor of 0.6. Each class of pdf contained a single pdf for each of the 17 visual field locations. There was little difference in the form of the 17 pdfs in each class; however, the means of the peripheral pdfs were generally lower than those for the central five locations. The resultant pdfc pdfs were trimodal (Fig. 2 , top panels), unlike the bimodal pdf of Vingrys and Pianta, 10 which was derived from advanced glaucomatous deficits. 
The following ZEST termination rules were evaluated: stopping after three or four presentations and stopping when the pdf achieved a SD of 0.5, 1, or 2 dB. A single-likelihood function was used for all ZEST procedures (Fig. 2 , middle panels). The ZEST procedures were performed on discrete functions, one for 0 dB, one for 1 dB, and so on. Similarly, the likelihood function was defined for whole-number thresholds in the range 0 to 20 dB. The mean of the pdf at each stage was rounded to the nearest decibel. This rounded mean was used to determine the alignment point for the likelihood function. Rounding introduces a maximal error of 0.5 dB when producing each new pdf. Given the 90% confidence interval for retest variability for normal observers for FDT perimetry is approximately 4 dB, 7 this rounding is unlikely to be clinically significant. 
Modified Binary Search
The optimal binary search technique 19 requires selecting the middle number of a range and then adjustment of the range according to the response. For example, to find a number between 0 and 100, the first selection would be 50. If 50 is too high, the target number falls in the range 1 to 49, and the next selection would be 25, and so on. Binary search is a special case of a maximum-likelihood procedure where the initial pdf considers all thresholds equally likely, and the likelihood function is equal to 100% for thresholds one side of the middle and 0% for other thresholds. 
The binary search assumes that the target number is known and does not change. In perimetry, the threshold varies throughout testing, according to the psychometric function, and subjects make mistakes. To use the efficiency of binary search and also allow for errors and variability, Tyrell and Owens 8 introduced MOBS. MOBS follows the binary-search strategy but also checks the range end points if two yes or no responses occur in succession. If the end point is not consistent—for example, the subject could not see the intensity at the bottom of the range—then the range is widened to the previous end point. A reversal occurs when the previous response differs from the current response. MOBS terminates when the range reaches a minimum width and a fixed number of reversals have occurred. 
Figure 3 shows examples of two MOBS procedures that terminate after two reversals, with a minimum range of 3 dB for a subject with a threshold of 8 dB. In Figure 3a the subject makes no mistakes, whereas in Figure 3b there is a mistake on the first presentation, responding yes to 10 dB. The white rectangle at each stage shows the possible range of thresholds, with the list of numbers to the left and right of the rectangle recording previous lower and upper end points. The subject’s response is shown between each rectangle. In Figure 3a , after three presentations of (A) 10 dB, (B) 5 dB, and (C) 7.5 dB, a threshold of 6.25 dB is estimated. In Figure 3b , after two “no” responses in a row to stimuli of intensity (B) 15 dB and (C) 12.5 dB, the lowest end point of the range is presented and checked. In this case the subject responds that he or she cannot see the lowest end point (10 dB), and the previous end point is restored (0 dB). The procedure then continues until the termination criterion is met after a response at presentation (F). Note that after presentation (E), two reversals have been achieved, but the range is still more than 3 dB, and therefore MOBS continues for one more presentation to determine a threshold of 8.75 dB. 
Eleven MOBS procedures were included in the simulation, each varying the termination criterion. The acronym MOBS will hereafter be used solely to refer to the procedure used in the commercial FDT perimetry machine, which terminates after four reversals with a maximum range of 3 dB. The acronym REBS hereinafter refers to procedures with different termination criteria. The number of reversals was 0, 2, or 4, with ranges of 1, 2, 3, or 20 dB. 
Analysis
Performance was compared using the mean and SD of error in threshold measurement across all test locations and number of presentations. Error was the difference between the estimated and input threshold. Results are given for ideal patients (no Gaussian variation in their input thresholds; 0% false-positive and false-negative rates), typical patients (Gaussian variation of thresholds with an SD of 1 dB; false-positive and false-negative rates of 10%), and unreliable patients (Gaussian variation of threshold with an SD of 2 dB; false-positive and false-negative rates of 30%). The error and efficiency for procedures intermediate to these fell within the boundaries of the represented data. 
Results
Maximum-Likelihood Estimation Procedure
The performance of ZEST is shown in Figures 4a b c d e f for the likelihood function shown in Figure 2a . Each figure shows the number of presentations required to estimate threshold (top panel), the mean error (middle panel), and the SD of error (bottom panel). Bars are labeled with an n, c, g, or u if they used pdfn, pdfg, pdfc, and pdfu respectively. The leftmost four bars are for ZEST procedures terminating with ς = 2 dB, the middle four bars terminating with ς = 1 dB. The final two bars represent the best of the procedures, stopping after three and four presentations, respectively. The best procedure was that with the lowest absolute value of mean error ± SD. 
Figure 4 shows that ZEST often underestimates threshold (error <0 dB) except when using pdfn or pdfc on unreliable glaucomatous subjects. In most cases, the magnitude of the mean error is unlikely to affect clinical outcomes, (between −1 and 1 dB, except when using pdfg on normal subjects or pdfn on subjects with glaucoma). 
Comparing the top panels of Figure 4 shows that between two and three extra presentations are required to reduce the SD of any of the pdfs from 2 to 1 dB. These extra presentations lead to a reduction in the SD of threshold measurement error by approximately 1 dB in ideal patients (bottom panels), and approximately 0.3 dB in unreliable patients. If ZEST is allowed to continue until the pdfs reach a SD of 0.5 dB (not shown in Fig. 4 ), 6 more presentations are required (total of 12 presentations) in unreliable patients, with only a 0.3 reduction in the spread of errors (SD of 3 dB). In patients with highly variable results, there is little gain in accuracy with continuation of testing beyond three to five presentations. The spread of errors is within reasonable limits (≤1.5 dB) terminating after three or four presentations (or ς = 1 dB) in ideal and typical patients. 
Examining the procedures terminating when ς = 1 dB, the top panels of Figure 4 show that ZEST procedures using pdfn were fastest in normal subjects, and the ZEST based on pdfg was fastest in ideal subjects with glaucoma. A surprising finding was that ZEST using pdfn was the fastest in unreliable subjects with glaucoma. The method based on the uniform pdf class (pdfu, fourth bar) was slower than the other three pdf classes, requiring two to three extra presentations, and showed greater errors than the method using pdfc. The combined pdf was approximately one presentation slower than the best method in each case. If it is known which population the patient is from, then a ZEST procedure using an appropriate pdf can achieve an accurate threshold using three to four presentations. A combined pdf requires approximately five presentations. 
The glaucomatous subject data combines performance for locations with both normal and depressed thresholds, because any single glaucoma patient is likely to have some normal visual field locations. To compare the performance of the ZEST procedures on damaged and normal locations, Figure 5 presents the performance of the ZEST procedures plotted against the patient’s threshold. The data shown are averaged over all patients in each group. Three pdfs are shown for each patient group: pdfn and pdfc are as described earlier, whereas pdfd, is the difference between pdfc and pdfn, thus reflecting a pdf for damaged locations only. 
The upper panels of Figure 5 show that ZEST using pdfd requires many presentations once thresholds exceed 5 dB, which is expected of a pdf customized for damaged locations. ZEST using pdfn uses less presentations on average at most threshold values than ZEST with pdfc in the unreliable patients, but at the cost of an increase in mean error. The lower panels of Figure 5 demonstrate that pdfc has the most consistent SD of error, with pdfn and pdfd showing large variability in error response for damaged locations and normal locations, respectively. 
Generally, ZEST with pdfc appears a good compromise between the two extremes, supporting our simulation results over the entire visual field. The number of presentations is consistent across all threshold values, particularly in patients with some erroneous responses. It sacrifices some speed to ZEST with pdfd in areas of damage (0–5 dB), and to ZEST with pdfn in the range 8 to 16 dB. Unless it is known in advance that a patient’s threshold will fall into one of these two ranges, there is no way of knowing which of these pdfs to use. The error behavior of pdfc is as expected, more variable than pdfn in the normal-threshold range, but less in the damaged range. Similarly the measurements made with pdfc are more erroneous in damaged locations than those made with pdfd, but provide better estimates than pdf in other threshold ranges. 
Modified Binary Search
Figure 6 shows the performance of the MOBS and REBS procedures. Each panel shows 10 REBS procedures, each with different termination criteria, and MOBS. The number of reversals used in the REBS procedure is indicated by r = 0 for no reversals, r = 2 for two reversals, and r = 4 for four reversals. The stopping range appears as the tick label in the x-axis of the bottom panel. MOBS is labeled M. 
Figure 6 demonstrates that, REBS and MOBS systematically underestimate thresholds, excepting for unreliable glaucomatous subjects. The final four bars in each panel show that altering the stopping distance using four reversals had little effect on performance. This is not surprising, because after four reversals it is unlikely that the range is greater than 2 dB, given that the starting range is 20 dB. 
Where subjects respond ideally (Figs. 6a 6d) , the mean error is small, with most of the means plus one SD in the range ±1 dB. Given that the clinical difference between these errors is likely to be insignificant, choosing the fastest procedure (0 reversals and a range of 3 dB, leftmost bar) seems appropriate. When unreliability is introduced (Figs. 6b 6c 6e 6f) , this binary search procedure has the greatest spread of error (Fig. 6c 6f bottom panels), and exceeds that of the methods using four reversals. 
Increasing reversals to four, rather than two, results in a small change in accuracy for all patient groups, but requires four to five additional presentations. If large errors (±6 dB) can be tolerated in unreliable patients, then the REBS procedure of choice is one that terminates with 0 reversals and a stopping distance of 3 dB. If more accuracy is required, but speed is necessary, then using two reversals with a stopping distance of either three or unlimited decibels appears most suitable. 
Comparing MOBS, REBS, and ZEST
We compared our simulated results of REBS terminating with two reversals and a range of 3 dB, the commercially available MOBS, and ZEST using pdfc and a stopping criterion of ς = 1. Figure 7 compares the number of presentations, the mean error, and the SD of the error in each of the six subject groups. 
Figure 7 confirms that all three techniques underestimate threshold, with most mean errors being less than 0. Although the difference between the error of the three procedures across each patient group is statistically significant (ANOVA, P < 0.001), the difference in mean error is minimal for clinical outcomes (≤1.0 dB) in all cases but ZEST in unreliable normal patients (−1.96 dB). Figure 7b shows that the new procedures are twice as fast as the existing MOBS, with ZEST being faster than REBS in unreliable patients. 
Discussion
ZEST makes assumptions about a patient’s threshold in the form of the initial pdf. If these assumptions are correct, then accurate thresholds can be established in only two or three presentations per location. If the assumptions are incorrect (but no worse than assuming a uniform distribution of thresholds), then five or six presentations are required to determine the threshold. Because REBS makes no advance assumptions about a patient’s thresholds, between four and six presentations per stimuli are required to establish accurate thresholds. The choice between the procedures depends on the validity of assumptions about a patient’s thresholds. 
A hybrid of the two approaches is possible. ZEST can use a similar end point checking scheme as MOBS and REBS, allowing it to recover from incorrect responses. This may be particularly helpful when a mistake is made early in the procedure, because ZEST has limited ability to recover from early errors. 10 Alternately, REBS can be modified to choose a stimulus other than the midpoint of the current range, guided by a pdf of likely thresholds within the range. 
Using a pdf including both normal and glaucomatous thresholds, pdfc, provided the best overall performance of the ZEST procedure. The possibility for further improvements in efficiency at follow-up visits may be realized by choosing the pdf for each location of the visual field on a point-wise basis, depending on the threshold at the last examination. Additional benefits in efficiency may also be possible through the incorporation of neighborhood logic to the thresholding algorithm, in that the threshold at any particular location is not independent of that of its immediate neighbors. We chose not to include such methodology in this study, because the benefits when testing only 17 locations are likely to be small. This may not be the case when greater numbers of locations are tested. 
As well as choosing a pdf for ZEST, a likelihood function should be chosen. Although we present data for only a single-likelihood function, we included 12 different likelihood functions within the simulation ranging from a relatively flat slope derived by scaling a white-on-white perimetry frequency-of-seeing curve, through to steeper curves such as that finally used. As expected, steeper likelihood functions resulted in decreased test times (for data see http://www.computing.edu.au/∼andrew/Barramundi/fdp.html). Our primary purpose was to develop a fast procedure while admitting a small and clinically acceptable loss in accuracy, and we therefore used steeper curves. Similarly, the end points of the curve were varied from 95%–5% through to 99%–1%, and again the simulation results showed that the increase in speed of convergence of the ZEST procedure when using 99%–1% was not at the expense of clinically acceptable loss in accuracy. 
Although we have attempted to include all factors within the simulation, there are several limitations that make it possible that human performance may differ from that predicted. First, the pdfs used in this study were generated from the same empiric patient thresholds that were input to the simulation. Consequently, the subjects tested were definitely represented within the pdfs—an ideal situation. The ZEST procedure may not perform as well as reported by the simulation when new subjects are tested. Our pdfs were based on a large sample of eyes (506 normal eyes and 352 glaucomatous eyes) and it seems unlikely that profound differences in performance will result when ZEST is used within a clinical setting. 
A further limitation is that throughout the simulation, error rates were set to be independent of threshold size—that is, every input threshold was tested with ideal, typical, and unreliable error conditions. Clinical variability increases with defect depth. 7 The variability of FDT perimetry thresholds increases only modestly (20%–30%), 20 in contrast to the large changes observed in achromatic perimetry (300%–400%). 7 In any patient, responses may vary from typical to unreliable at different locations within the visual field, depending on the deficit depth. In this case, the simulation predicts a different number of presentations and range of errors at these locations. A prediction of performance can be determined for each location from the data shown in Figure 5 , however overall performance will differ from that presented in Figures 3 and 4
Our results suggest that REBS can reduce the number of presentations by 45%, with an average error of less than 0.5 dB. In ideal and typical patients, ZEST matches the performance of REBS. In unreliable patients, ZEST can achieve even greater efficiency, saving 55% of presentations, but with an increase in average error to approximately 2 dB. The predicted reduced test time per presentation should result in more rapid testing of the established 17-location FDT perimetry pattern. Alternately, a greater number of test locations may be assessed within an acceptable test duration, thereby improving the spatial resolution of FDT perimetry. Clinical validation of these predictions forms the basis for ongoing study in our laboratory. 
 
Figure 1.
 
An example of the method by which threshold variability was simulated. The sample simulated patient had an input threshold of 14 dB; however, the threshold chosen by the simulation to determine the response was varied according to a Gaussian distribution with an SD of 1 dB. Dotted lines: the probabilities of the simulation’s choosing thresholds of 12, 13, 14, 15, or 16 dB.
Figure 1.
 
An example of the method by which threshold variability was simulated. The sample simulated patient had an input threshold of 14 dB; however, the threshold chosen by the simulation to determine the response was varied according to a Gaussian distribution with an SD of 1 dB. Dotted lines: the probabilities of the simulation’s choosing thresholds of 12, 13, 14, 15, or 16 dB.
Table 1.
 
Summary of the Age and Threshold Characteristics of the Patient Data Used as Input Thresholds for the Simulation and to Derive pdfs for the ZEST Procedures
Table 1.
 
Summary of the Age and Threshold Characteristics of the Patient Data Used as Input Thresholds for the Simulation and to Derive pdfs for the ZEST Procedures
Group n Age (y) Threshold (dB)
Minimum Maximum Mean
Normal 506 47 ± 16 0 20 13.52 ± 2.03
Glaucoma 352 67 ± 12 0 20 8.96 ± 4.15
Figure 2.
 
The first step of a ZEST procedure. Top: initial pdf; middle: likelihood functions with the 50% seeing point aligned with the mean of the pdf in the top panels. (a) The “no” likelihood function; (b) the“ yes” likelihood function. Bottom: the result of multiplying the initial pdf with the likelihood function to derive a new pdf. Vertical lines: The means.
Figure 2.
 
The first step of a ZEST procedure. Top: initial pdf; middle: likelihood functions with the 50% seeing point aligned with the mean of the pdf in the top panels. (a) The “no” likelihood function; (b) the“ yes” likelihood function. Bottom: the result of multiplying the initial pdf with the likelihood function to derive a new pdf. Vertical lines: The means.
Figure 3.
 
Example of a MOBS procedure. (a) A patient with perfect responses with a threshold of 8 dB; (b) the procedure when a mistake is made on the first presentation. White rectangles: the range of possible thresholds at each step of the procedure. The numbers at either end of the rectangle are a list of previous end points of the range. The number adjacent to the dotted line in the center of the rectangle is the contrast of the presented stimulus.
Figure 3.
 
Example of a MOBS procedure. (a) A patient with perfect responses with a threshold of 8 dB; (b) the procedure when a mistake is made on the first presentation. White rectangles: the range of possible thresholds at each step of the procedure. The numbers at either end of the rectangle are a list of previous end points of the range. The number adjacent to the dotted line in the center of the rectangle is the contrast of the presented stimulus.
Figure 4.
 
Performance of the ZEST procedures using pdfn (n), pdfg (g), pdfc (c), and pdfu (u). In each of the six subject groups: (top) mean number of presentations (± SD); (middle) mean error; (bottom) the SD of the mean error. The first four bars show outcomes for ZEST, terminating when ς = 2 dB; the second four bars terminate when ς = 1 dB; and the final two bars show the best performance when ZEST terminates after three and four presentations, respectively.
Figure 4.
 
Performance of the ZEST procedures using pdfn (n), pdfg (g), pdfc (c), and pdfu (u). In each of the six subject groups: (top) mean number of presentations (± SD); (middle) mean error; (bottom) the SD of the mean error. The first four bars show outcomes for ZEST, terminating when ς = 2 dB; the second four bars terminate when ς = 1 dB; and the final two bars show the best performance when ZEST terminates after three and four presentations, respectively.
Figure 5.
 
Performance of the ZEST procedures plotted against the input thresholds. The data shown are averaged over all patients in each group. Top: mean number of presentations (±SD); middle: the mean error; bottom: the SD of that error. Three pdfs are shown for each patient group: pdfn, pdfc, and pdfd.
Figure 5.
 
Performance of the ZEST procedures plotted against the input thresholds. The data shown are averaged over all patients in each group. Top: mean number of presentations (±SD); middle: the mean error; bottom: the SD of that error. Three pdfs are shown for each patient group: pdfn, pdfc, and pdfd.
Figure 6.
 
Comparison of REBS and MOBS procedures. For each of the six subject groups (af): (top) mean number of presentations (± SD); (middle) mean error; (bottom) SD of the mean error. For REBS, different stopping rules are shown along the x-axis as the number of reversals (r) to stop (r = 0, 2, and 4) and the maximum range (20, 3, 2, or 1 dB). MOBS outcomes are identified by M.
Figure 6.
 
Comparison of REBS and MOBS procedures. For each of the six subject groups (af): (top) mean number of presentations (± SD); (middle) mean error; (bottom) SD of the mean error. For REBS, different stopping rules are shown along the x-axis as the number of reversals (r) to stop (r = 0, 2, and 4) and the maximum range (20, 3, 2, or 1 dB). MOBS outcomes are identified by M.
Figure 7.
 
Comparison of the best-performing REBS and ZEST procedures with MOBS. The REBS procedure terminated after two reversals with a range of 3 dB. The ZEST procedure used pdfc and a stopping criterion ofς = 1. For each of the six subject groups: (top) mean number of presentations (± SD); (middle) mean error; (bottom) the SD of the mean error.
Figure 7.
 
Comparison of the best-performing REBS and ZEST procedures with MOBS. The REBS procedure terminated after two reversals with a range of 3 dB. The ZEST procedure used pdfc and a stopping criterion ofς = 1. For each of the six subject groups: (top) mean number of presentations (± SD); (middle) mean error; (bottom) the SD of the mean error.
Johnson CA, Samuels SJ. Screening for glaucomatous visual field loss with frequency doubling perimetry. Invest Ophthalmol Vis Sci. 1997;38:413–425. [PubMed]
Maddess T, Henry GH. Performance of nonlinear visual units in ocular hypertension and glaucoma. Clin Vis Sci. 1992;7:371–383.
Kelly DH. Frequency doubling in visual responses. J Opt Soc Am. 1966;56:1628–1633. [CrossRef]
Kelly DH. Nonlinear visual responses to flickering sinusoidal gratings. J Opt Soc Am. 1981;71:1051–1055. [CrossRef] [PubMed]
Blakemore C, Vital-Durand F. Organization and post-natal development of the monkey’s lateral geniculate nucleus. J Physiol. 1986;380:453–491. [CrossRef] [PubMed]
Cello KE, Nelson-Quigg JM, Johnson CA. Frequency doubling technology perimetry for detection of glaucomatous field loss. Am J Ophthalmol. 2000;129:314–322. [CrossRef] [PubMed]
Chauhan BC, Johnson CA. Test-retest variability of frequency-doubling perimetry and conventional perimetry in glaucoma patients and normal subjects. Invest Ophthalmol Vis Sci. 1999;40:648–656. [PubMed]
Tyrell RA, Owens DA. A rapid technique to assess the resting states of the eyes and other threshold phenomena: the Modified Binary Search (MOBS). Behav Res Methods Instr Computers. 1988;20:137–141. [CrossRef]
Wild JM, Pacey IE, O’Neill EC, Cunliffe IA. The SITA perimetric threshold algorithms in glaucoma. Invest Ophthalmol Vis Sci. 1999;40:1998–2009. [PubMed]
Vingrys AJ, Pianta M. A new look at threshold estimation algorithms for automated static perimetry. Optom Vis Sci. 1999;76:588–595. [CrossRef] [PubMed]
King-Smith PE, Grigsby SS, Vingrys AJ, Benes SC, Supowit A. Efficient and unbiased modifications of the Quest threshold method: theory, simulations, experimental evaluation and practical implementation. Vision Res. 1994;34:885–912. [CrossRef] [PubMed]
Green DM. A maximum likelihood method for estimating thresholds in a yes-no task. J Accoust Soc Am. 1993;93:2096–2105. [CrossRef]
Watson AB, Pelli DG. Quest: a Bayesian adaptive psychometric method. Percept Psychophys. 1983;33:113–120. [CrossRef] [PubMed]
Bengtsson B, Heijl A, Olsson J. Evaluation of a new threshold visual field strategy, SITA, in normal subjects. Acta Ophthalmol Scand. 1998;76:165–169. [CrossRef] [PubMed]
Bengtsson B, Heijl A. Evaluation of a new perimetric threshold strategy, SITA, in patients with manifest and suspect glaucoma. Acta Ophthalmol Scand. 1998;76:268–272. [CrossRef] [PubMed]
Bengtsson B, Olsson J, Heijl A, Rootzen H. A new generation of algorithms for computerized threshold perimetry. Acta Ophthalmol Scand. 1997;75:368–375. [PubMed]
Johnson CA, Chauhan BC, Shapiro LR. Properties of staircase procedures for estimating thresholds in automated perimetry. Invest Ophthalmol Vis Sci. 1992;33:2966–2974. [PubMed]
Chauhan BC, Johnson CA. Evaluating and optimizing test strategies in automated perimetry. J Glaucoma. 1994;3:S72–S81.
Knuth DE. The Art of Computer Programming. Sorting and Searching. 1998;3 Addison-Wesley Reading, MA.
Spry PGD, Johnson CA, McKendrick AM, Turpin A. Variability components of standard automated perimetry and frequency-doubling technology perimetry. Invest Ophthalmol Vis Sci. 2001;42:1404–1410. [PubMed]
Figure 1.
 
An example of the method by which threshold variability was simulated. The sample simulated patient had an input threshold of 14 dB; however, the threshold chosen by the simulation to determine the response was varied according to a Gaussian distribution with an SD of 1 dB. Dotted lines: the probabilities of the simulation’s choosing thresholds of 12, 13, 14, 15, or 16 dB.
Figure 1.
 
An example of the method by which threshold variability was simulated. The sample simulated patient had an input threshold of 14 dB; however, the threshold chosen by the simulation to determine the response was varied according to a Gaussian distribution with an SD of 1 dB. Dotted lines: the probabilities of the simulation’s choosing thresholds of 12, 13, 14, 15, or 16 dB.
Figure 2.
 
The first step of a ZEST procedure. Top: initial pdf; middle: likelihood functions with the 50% seeing point aligned with the mean of the pdf in the top panels. (a) The “no” likelihood function; (b) the“ yes” likelihood function. Bottom: the result of multiplying the initial pdf with the likelihood function to derive a new pdf. Vertical lines: The means.
Figure 2.
 
The first step of a ZEST procedure. Top: initial pdf; middle: likelihood functions with the 50% seeing point aligned with the mean of the pdf in the top panels. (a) The “no” likelihood function; (b) the“ yes” likelihood function. Bottom: the result of multiplying the initial pdf with the likelihood function to derive a new pdf. Vertical lines: The means.
Figure 3.
 
Example of a MOBS procedure. (a) A patient with perfect responses with a threshold of 8 dB; (b) the procedure when a mistake is made on the first presentation. White rectangles: the range of possible thresholds at each step of the procedure. The numbers at either end of the rectangle are a list of previous end points of the range. The number adjacent to the dotted line in the center of the rectangle is the contrast of the presented stimulus.
Figure 3.
 
Example of a MOBS procedure. (a) A patient with perfect responses with a threshold of 8 dB; (b) the procedure when a mistake is made on the first presentation. White rectangles: the range of possible thresholds at each step of the procedure. The numbers at either end of the rectangle are a list of previous end points of the range. The number adjacent to the dotted line in the center of the rectangle is the contrast of the presented stimulus.
Figure 4.
 
Performance of the ZEST procedures using pdfn (n), pdfg (g), pdfc (c), and pdfu (u). In each of the six subject groups: (top) mean number of presentations (± SD); (middle) mean error; (bottom) the SD of the mean error. The first four bars show outcomes for ZEST, terminating when ς = 2 dB; the second four bars terminate when ς = 1 dB; and the final two bars show the best performance when ZEST terminates after three and four presentations, respectively.
Figure 4.
 
Performance of the ZEST procedures using pdfn (n), pdfg (g), pdfc (c), and pdfu (u). In each of the six subject groups: (top) mean number of presentations (± SD); (middle) mean error; (bottom) the SD of the mean error. The first four bars show outcomes for ZEST, terminating when ς = 2 dB; the second four bars terminate when ς = 1 dB; and the final two bars show the best performance when ZEST terminates after three and four presentations, respectively.
Figure 5.
 
Performance of the ZEST procedures plotted against the input thresholds. The data shown are averaged over all patients in each group. Top: mean number of presentations (±SD); middle: the mean error; bottom: the SD of that error. Three pdfs are shown for each patient group: pdfn, pdfc, and pdfd.
Figure 5.
 
Performance of the ZEST procedures plotted against the input thresholds. The data shown are averaged over all patients in each group. Top: mean number of presentations (±SD); middle: the mean error; bottom: the SD of that error. Three pdfs are shown for each patient group: pdfn, pdfc, and pdfd.
Figure 6.
 
Comparison of REBS and MOBS procedures. For each of the six subject groups (af): (top) mean number of presentations (± SD); (middle) mean error; (bottom) SD of the mean error. For REBS, different stopping rules are shown along the x-axis as the number of reversals (r) to stop (r = 0, 2, and 4) and the maximum range (20, 3, 2, or 1 dB). MOBS outcomes are identified by M.
Figure 6.
 
Comparison of REBS and MOBS procedures. For each of the six subject groups (af): (top) mean number of presentations (± SD); (middle) mean error; (bottom) SD of the mean error. For REBS, different stopping rules are shown along the x-axis as the number of reversals (r) to stop (r = 0, 2, and 4) and the maximum range (20, 3, 2, or 1 dB). MOBS outcomes are identified by M.
Figure 7.
 
Comparison of the best-performing REBS and ZEST procedures with MOBS. The REBS procedure terminated after two reversals with a range of 3 dB. The ZEST procedure used pdfc and a stopping criterion ofς = 1. For each of the six subject groups: (top) mean number of presentations (± SD); (middle) mean error; (bottom) the SD of the mean error.
Figure 7.
 
Comparison of the best-performing REBS and ZEST procedures with MOBS. The REBS procedure terminated after two reversals with a range of 3 dB. The ZEST procedure used pdfc and a stopping criterion ofς = 1. For each of the six subject groups: (top) mean number of presentations (± SD); (middle) mean error; (bottom) the SD of the mean error.
Table 1.
 
Summary of the Age and Threshold Characteristics of the Patient Data Used as Input Thresholds for the Simulation and to Derive pdfs for the ZEST Procedures
Table 1.
 
Summary of the Age and Threshold Characteristics of the Patient Data Used as Input Thresholds for the Simulation and to Derive pdfs for the ZEST Procedures
Group n Age (y) Threshold (dB)
Minimum Maximum Mean
Normal 506 47 ± 16 0 20 13.52 ± 2.03
Glaucoma 352 67 ± 12 0 20 8.96 ± 4.15
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×