Regression models (TLR and OLSLR) were fitted to each eye's series of VF sensitivity values (for each VF test point separately) against time (the patient's age at each VF test); for example, see
Figure 1. The two locations adjacent to the blind spot were excluded, giving 52 thresholds for each VF. Next, the least squares residuals (
Ê) were extracted:
and grouped (binned) according to
(measured-sensitivity) in the range [0 to 36] dB. Residuals were also binned according to the fitted-sensitivity value (rounded to the nearest whole decibel),
Ŷ, as this value estimates “true” sensitivity. These two binning methods ask subtly different questions, which are now outlined. Binning by measured-sensitivity establishes variability conditional on the measured-threshold, and asks the question,
given a measured-threshold what is the underlying range of values for the “true” value? This approach is akin to the method used by Wall et al.,
4 where retest thresholds were compared with baseline threshold; in this case, as the authors state, “the first test is not meant to be the true sensitivity, as it has its own variability.” On the other hand, binning by fitted-sensitivity investigates variability according to the estimated true value, and asks the question,
given a “true” threshold value what is the range of measured-thresholds expected for any given test? This approach mirrors the one used in Artes et al.,
9 where thresholds were compared with the mean of several retest thresholds. The difference in the two binning strategies is shown in
Figure 1. In this figure, residuals (indicated by the dashed lines) are associated with an OLSLR fitted-sensitivity bin of 31 dB; however, if the residuals are stratified by measured-threshold, they are pooled into the following bins: 30, 31, 31, 30, 33, 31, 33, and 28 dB. All statistical analyses were carried out in the open-source programming language, R.
19