It has been reported that there is a greater likelihood of detecting glaucomatous progression by OCT than by automated perimetry.
9 However, there is limited information regarding the ability of OCT to detect progressive glaucomatous change. The present study found that using specific criteria, Stratus OCT detected progressive RNFL atrophy with high sensitivity and moderate specificity. This appears to be the first relatively large study to investigate the use of Stratus OCT for detecting glaucoma progression.
The reproducibility of Status OCT and several other findings of the present study are similar to those reported recently by Budenz et al.
18 They reported a test–retest variability of 6.7 μm for average RNFL thickness, as measured by the Fast RNFL program. The reproducibility was variable depending on the size and location of the area that was measured. Specifically, we found less test–retest variability for average thickness than for quadrants and less test–retest variability in quadrants than in their respective clock-hour sectors. In addition, for both studies, the measurement was least reproducible (lowest ICCs and highest COVs) in the nasal quadrant and in the respective clock-hour sectors.
We tested the sensitivity and specificity of Stratus OCT for detection of glaucoma progression using criteria corresponding to the upper 95% limit of test–retest variability defined at the 95% and 80% confidence level. These criteria used different thickness values for individual clock hours or quadrants. It seemed unreasonable to use identical criteria for all clock hours (e.g., >15 μm decrease from the baseline measurement) or quadrants owing to the considerable variation between clock hour measurements and quadrant measurements. This was also the case when the test–retest variability was converted to percentages (data not presented).
For the criteria we tested, the clock-hour criterion appeared to be the most sensitive. This is somewhat contrary to our expectations based on measurements of test–retest variability. The ability to detect changes due to disease depends largely on the test–retest variability of measurements. When variability is low, small changes can be detected with confidence. Our test–retest variability was lowest for average RNFL thickness followed by quadrant thickness, and so we expected that the criteria for average RNFL thickness would have the highest sensitivity, followed by quadrant criterion. However, this was the opposite of our observations. To explain this, we propose that glaucoma progression occurred focally, at least in subjects who had expansion of a localized RNFL defect, and this had little affect on average RNFL thickness and quadrant thickness. Indeed, the difference in average RNFL thickness between the baseline and follow-up examinations in patients with progressive RNFL atrophy (4.3 ± 6.5 μm) was far less than the 95% upper limit of test–retest variability defined at the 95% confidence level (7.9 μm). Even when tested with the test–retest variability defined at the 80% confidence level, the sensitivity was still approximately 40%.
We expected that the RNFL thickness would change by no more than the upper 95% limit of our test–retest variability defined at 95% confidence level 95% of the time (assuming there was no change in patient age or disease progression). This means that, with the criteria corresponding to the upper 95% limit of test–retest variability, there should be a specificity of 95%. However, this expectation is applicable only to cases where the specificity is tested once. In the present study, the specificity for criterion based on average RNFL thickness (which needs a single test) was 98.4%. However, for the criteria based on clock-hour and quadrant thickness, the specificities were 59.7% and 77.4%, respectively. This decrease in specificity is in agreement with our statistical considerations. If parameters for each sector are set at the 95% confidence level, 12 simultaneous tests for 12 different clock-hour measurements and four simultaneous tests for four quadrant measurements may have the confidence level of 54.0% (0.95
12 ) and 81.5% (0.95
4 ), respectively.
In clinical practice, the inherent limitations of simultaneous multiple testing can be overcome by repeat OCT testing. Given that the specificity of a diagnostic device is 60% for a single test, the specificity should increase to 84% (1–0.42) when the test is performed twice and to 93.6% (1–0.43) when it is performed three times. The specificity calculated from two consecutive examinations in the present study agrees with this expectation. Meanwhile, the sensitivity of OCT did not change considerably when calculated from two consecutive examinations and maintained acceptable sensitivity for the clock-hour criterion.
In the present study, test–retest variability as well as sensitivity and specificity were measured in patients with early-to-moderate glaucoma. It is possible that the test–retest variability determined in the present study would not be valid for evaluating the ability of OCT to detect progressive change in patients with advanced glaucoma who have a much thinner RNFL at baseline. Moreover, the test–retest variability in such patients may significantly differ from that observed in the present patients. Thus, further studies may be required to evaluate the ability of OCT to detect progressive glaucomatous change in patients with advanced damage.
In the present study, progressive change in glaucoma was defined primarily based on changes identified in RNFL photographs. It is common to define progression based on VF changes when studying glaucoma progression. However, VF data may fluctuate, leading to incorrect conclusions. In addition, detectable functional change does not occur simultaneously with structural changes in glaucoma.
16 17 18 Thus, it is inappropriate to consider VF change as the gold standard when investigating the ability of a diagnostic instrument to detect progressive structural change. In contrast, both OCT RNFL thickness analysis and RNFL photography evaluate the same anatomic structure, the RNFL. As such, it is reasonable to assume that any RNFL change should be detected by OCT if OCT can accurately measure RNFL thickness. Furthermore, by defining progression based on RNFL photography, we were able to evaluate the topographic relationship between the OCT-measured RNFL thickness change location and the progressive RNFL atrophy location based on red-free RNFL photography.
We should note that the video image acquisition of the Stratus OCT is recorded after the actual scan has been taken and may not represent the exact location of the acquired scan. If this disparity is substantial, it may lead to a weak topographical relationship between the RNFL photography and OCT. In the present study, we noted an excellent topographical correlation in subjects who showed progressive atrophy in the photography. This observation suggests that the disparity between the actual scan and the true location of acquired scan in OCT is likely to be minimal.
The present study included only eyes with localized defects and with an obvious border. Although localized RNFL defects are easily identified with red-free photography, it is sometimes difficult to define clearly the diffuse atrophy borders using RNFL photography,
19 which may lead to problems in measuring any defect expansion. As such, the present study excluded eyes with diffuse atrophy.
Owing to the design of our study, our results provide information regarding the ability of OCT to detect progressive RNFL atrophy only of patients who have localized RNFL defects. The performance of OCT for the detection of glaucoma progression of patients with diffuse RNFL damage remains to be determined.
Progression of the RNFL defect was defined as the widening of the preexisting defect or development of a new defect. However, it is possible that progressive changes occur in a different fashion. Remnant nerve fibers in a region previously defined as defective may be further damaged without any change in the defect border (i.e., deepening of the defect). It is difficult to accurately detect this type of progression, and the inclusion of such change in the definition of progression may provide a source of bias. Thus, the present study considered only defect widening or a new defect as the criteria for progression.
In eyes in which we detected progressive OCT change, we often observed the progressive change even in clock hour sectors where progression was not noted in the photography. It is possible that OCT-detected changes represent true RNFL loss that can only be detected using OCT. At present, it is impossible to test this, as there is no reference standard that can be used to definitively indicate progressive structural changes in glaucoma. A prospective longitudinal study could investigate this question.
In the present study, we included only the subjects who had high-quality OCT measurements (that is, good centration on the optic disc of the scan and signal strength ≥ 6). These inclusion criteria may have biased our results toward a better performance of OCT for detection of glaucoma progression than typically occurs in clinical practice.
In summary, the present study demonstrated that within the limits of retest variability, Stratus OCT can detect progressive RNFL atrophy with high sensitivity and moderate specificity in cases showing localized progressive loss of retinal nerve fibers in red-free photographs. The OCT-measured RNFL thickness change showed excellent topographic agreement with the progressive RNFL atrophy observed using RNFL photography. We also demonstrated that the specificity of Stratus OCT can be improved by repeated testing. Stratus OCT is a potentially useful method for detection of glaucoma progression.