Using the iterative procedure outlined above, the prediction errors associated with each linear regression method were compared for all patients. First, prediction errors were stratified according to the number of measurements (series length) inputted into each regression approach, to determine whether the number of measurements affected the accuracy and relative accuracy of the methods to predict the subsequent measurement.
Figure 2 indicates the results of this analysis as a series of box plots. As expected, for each regression approach, absolute prediction error decreases with the number of measurements. On average, BLR was significantly more accurate for series of up to 8 VF/CLST pairs (mean difference in RMSPEs, OLSLR minus BLR, was 0.14 dB;
n 1 =
n 2 = 179,
P < 0.001, Wilcoxon signed-rank test). OLSLR of VF data alone was more accurate than BLR for longer series (mean difference in RMSPEs, OLSLR minus BLR, was −0.06 dB;
n 1 =
n 2 = 179,
P > 0.1, Wilcoxon signed-rank test), although the difference in RMSPEs was not significant. While it is not recommended to deduce a trend from as few as three VFs, a Bayesian regression analysis including three MS/RA measurement pairs results in a median prediction error smaller than that from OLSLR of a series of five or six MS measurements; however, as seen in
Figure 2, the range of prediction error is greater for BLR of three MS/RA measurements than OLSLR of longer series of MS measurements. BLR of a series of four MS/RA pairs leads to a median absolute prediction error approximately equivalent to that from OLSLR of a series of seven MS measurements, as well as a narrower error range (see dashed lines in
Fig. 2).
Considering all prediction errors for each patient, the two-sided Wilcoxon matched pairs signed rank sum test indicated that the RMSPEs resulting from OLSLR were significantly greater than those from BLR (Wilcoxon signed-rank test:
V = 12,556,
n 1 =
n 2 = 179,
P <0.001). This suggests that, overall, BLR better predicts mean sensitivity and, therefore, more accurately estimates the rate of change in MS. Furthermore, the RMSPE for the BLR was smaller than that for the OLSLR in 134 out of 179 patients (74.9%).
Figure 3A is a scatterplot (with marginal histograms) of the difference in RMSPE (over all series lengths) for the two regression approaches (BLR minus OLSLR) against the OLSLR slope derived from each patient's full series of MS measurements. Negative values in the Y-axis indicate that the RMSPE for BLR is smaller than the corresponding error for OLSLR; hence, 74.9% of the points lie below the
y = 0 line. Across all patients, the mean difference between the RMSPEs for the two linear regression approaches was −0.13 dB (shown by the dashed line in
Fig. 3A), indicating that, on average, prediction error for OLSR was 0.13 dB greater than for BLR. Furthermore, the mean absolute difference in predicted MS using BLR as opposed to OLSLR was 0.74 dB.
For those patients for whom BLR performed less accurately than OLSLR, the OLSLR RMSPEs were significantly smaller than the corresponding OLSLR RMSPEs in the patients for whom BLR provided a more accurate prediction (Wilcoxon rank sum test: W = 1334, n 1 = n 2 = 45, P < 0.01), suggesting that VF variability was much lower in these patients. There was no statistical difference in the mean pixel height standard deviation between these two groups.
In the BLR model, the posterior precision equals the prior precision plus the precision of the likelihood. Hence, the Bayesian confidence intervals for slope (rate of progression) and prediction of MS are inherently smaller than those from OLSLR. This is illustrated in
Figure 4, which plots the magnitude of the 95% confidence interval for the slope (
Fig. 4A) and the prediction of MS (
Fig. 4B) for the BLR and OLSLR methods as a function of the series length.