The “true” rates of progression for each patient were calculated using ordinary least squares regression of mean deviation (MD) over time, the same method as that used in the De Moraes study. In our model we included the difference between MDs in the second and first VFs divided by the time interval separating them. The “true” rates of progression were then regressed against the baseline age of patients and their VF status across two visits as a basic model from which adjusted
R 2 values were generated. To facilitate comparisons, the study sample was split into a reference data set and a validation data set comprising exactly the same numbers of patients as included in the De Moraes study (i.e., 587 patients in the reference data set and 62 patients in the validation data set). To gain a distribution of values for the adjusted
R 2 statistic, the 875 patients were randomly sampled without replacement 100,000 times into reference and validation data sets, to attain 100,000 adjusted
R 2 values for each model (i.e., for the reference data set, 587 patients were selected at random from the 875 patients in our complete data set, and 62 patients were sampled from the remaining 288 patients, a process that was repeated 100,000 times). The distribution of adjusted
R 2 values for the 100,000 reference models can be seen in
Figure A. The median adjusted
R 2 is 0.10, whereas the reported
R 2 for the De Moraes calculator lies at the 87th percentile (0.13). However, given that the reported
R 2 could actually have taken any value between 0.125 and 0.135, the possibility of getting this statistic by chance in our data set could, in fact, be as high as 20%.
Figure B shows the adjusted
R 2 statistic yielded when the reference model is fitted to the validation data set. The median adjusted statistic here is 0.08, but the spread of this distribution should be noted; it was possible to simulate an adjusted
R 2 statistic as high as 0.59 (due to the small sample size). The probability of gaining a better statistic than that of the De Moraes model (
R 2 = 0.11) was close to 35%. We selected one reference model from our distribution in
Figure A with an
R 2 value similar in magnitude to that of the De Moraes rate calculator. The fit of this model can be seen in
Figure C, whereas
Figure D shows the effect of applying this model to a sample validation data set (once again sampled to match the
R 2 in the model reported by De Moraes et al.
1); the 95% limits of agreement are shown by the dotted lines in
Figure D, and are more reflective of the likely range of differences between the estimated and actual rates of progression than the 95% confidence interval for the average difference (indicated by the dashed lines) reported in the abstract of the De Moraes et al. study.
Figures C and
D clearly demonstrate the inadequacy of our model, designed to mirror that of the De Moraes study,
1 for predicting rates of VF loss in spite of statistical significance.