Abstract
Purpose.:
To evaluate the intersession repeatability of retinal thickness measurements in patients with diabetic macular edema (DME) using the Heidelberg Spectralis optical coherence tomography (OCT) algorithm and a publicly available, three-dimensional graph search-based multilayer OCT segmentation algorithm, the Iowa Reference Algorithm.
Methods.:
Thirty eyes from 21 patients diagnosed with clinically significant DME were included and underwent consecutive, registered macula-centered spectral-domain optical coherence scans (Heidelberg Spectralis). The OCT scans were segmented into separate surfaces, and the average thickness between internal limiting membrane and outer retinal pigment epithelium complex surfaces was determined using the Iowa Reference Algorithm. Variability between paired scans was analyzed and compared with the retinal thickness obtained from the manufacturer-supplied Spectralis software.
Results.:
The coefficient of repeatability (variation) for central macular thickness using the Iowa Reference Algorithm was 5.26 μm (0.62% [95% confidence interval (CI), 0.43–0.71]), while for the Spectralis algorithm this was 6.84 μm (0.81% [95% CI, 0.55–0.92]). When the central 3 mm was analyzed, the coefficient of repeatability (variation) was 2.46 μm (0.31% [95% CI, 0.23–0.38]) for the Iowa Reference Algorithm and 4.23 μm (0.53% [95% CI, 0.39–0.65]) for the Spectralis software.
Conclusions.:
The Iowa Reference Algorithm and the Spectralis software provide excellent reproducibility between serial scans in patients with clinically significant DME. The publicly available Iowa Reference Algorithm may have lower between-measurement variation than the manufacturer-supplied Spectralis software for the central 3 mm subfield. These findings have significant implications for the management of patients with DME.
Thirty eyes of 21 patients with clinically significant DME (11 male and 10 females) with an average age of 59.9 ± 11.3 years were included in the study. The average central macular thickness (CMT), defined as the central 1 mm on the ETDRS grid, calculated by the Iowa Reference Algorithm, was 435.60 μm (95% CI, 186.60–684.59 μm); calculated by the Spectralis software it was 429.15 μm (95% CI, 180.31–677.99 μm), a difference that is not significant (
P = 0.85). See the
Table. Comparing the CMT between the two algorithms revealed a coefficient of repeatability of 25.02 μm and a coefficient of variation of 2.96% (95% CI, 2.29–3.86). The average central 3 mm thickness was 407.23 μm (95% CI, 265.90–548.56 μm) for the Iowa Reference Algorithm and 405.02 μm (95% CI, 274.12–535.91 μm) for the Heidelberg Spectralis algorithm, also a nonsignificant difference (
P = 0.902). Comparing the central 3 mm thickness between the two algorithms gave a coefficient of repeatability of 15.73 μm and a coefficient of variation of 1.86% (95% CI, 1.24–2.08).
Table Intersession Coefficient of Repeatability and Variation for the Iowa Reference Algorithm and the Heidelberg Spectralis Software
Table Intersession Coefficient of Repeatability and Variation for the Iowa Reference Algorithm and the Heidelberg Spectralis Software
| Mean Macular Thickness, μm (95% CI) | Coefficient of Variation, % (95% CI) | Coefficient of Repeatability, μm |
Iowa Reference Algorithm, central 1 mm | 435.60 (186.60–684.59) | 0.62 (0.43–0.71) | 5.26 |
Spectralis software, central 1 mm | 429.15 (180.31–677.99) | 0.81 (0.55–0.92) | 6.84 |
Iowa Reference Algorithm, central 3 mm | 407.23 (265.90–548.56) | 0.31 (0.23–0.38) | 2.46 |
Spectralis software, central 3 mm | 405.02 (274.12–535.91) | 0.53 (0.39–0.65) | 4.23 |
Iowa Reference Algorithm, central 1 mm > 400 μm | 534.81 (336.65–732.98) | 0.64 (0.43–0.71) | 6.70 |
Iowa Reference Algorithm, central 1 mm < 400 μm | 336.38 (246.25–426.50) | 0.49 (0.47–0.79) | 3.25 |
Spectralis software, central 1 mm > 400 μm | 529.80 (339.98–719.62) | 0.87 (0.66–1.11) | 9.0 |
Spectralis software, central 1 mm < 400 μm | 328.50 (234.45–422.55) | 0.55 (0.40–0.67) | 3.56 |
Both the Iowa Reference Algorithm and Spectralis software consistently segmented the boundaries of the retina layers well as evidenced in
Figure 1, providing excellent intersession repeatability (
Fig. 2). The intersessional coefficient of repeatability and variation for repeat scans of the CMT was 5.26 μm (0.62% [95% CI, 0.43–0.71]) for the Iowa Reference Algorithm and 6.84 μm (0.81% [95% CI, 0.55–0.92]) for the Heidelberg Spectralis software—slightly lower for the Iowa Reference Algorithm, but not significantly different as demonstrated by the overlapping 95% confidence intervals (
Table). When the central 3 mm was analyzed, the Iowa Reference Algorithm showed a significantly lower coefficient of repeatability and variation of 2.46 μm (0.31% [95% CI, 0.23–0.38]) compared with 4.23 μm (0.53% [95% CI, 0.39–0.65]) for the Heidelberg Spectralis software (
Table). Bland–Altman plots were calculated (
Fig. 3).
When the central 1 mm was compared, the patients with the highest intersession variability typically had large amounts of macular edema that were centered adjacent to the fovea, which placed the center of the OCT analysis on the edge of the edema. Therefore a small change in locating the center of the OCT analysis introduced differences in retinal thickness measurements. This was less pronounced when the central 3 mm was identified, because it encompassed larger areas of the retina. For the central 3 mm analysis, the higher variability of the Spectralis software was mostly due to two patients. Further analysis of these patients revealed that one had an error in registration within the Spectralis software, but not with the Iowa algorithm. The other patient had vitreomacular traction, which introduced segmentation error in the Spectralis software that was not encountered with the Iowa Reference Algorithm.
There was a trend for larger degrees of macular edema to have an increase in the variability between serial scans, although this was not found to be statistically significant. In patients with CMT greater than 400 μm, the coefficient of repeatability and variation was 9 μm (0.87%) compared with a coefficient of repeatability and variation of 3.56 μm (0.55%) for patients with CMT less than 400 μm when analyzed by the Heidelberg Spectralis software (
Table). This trend was less pronounced when analyzed by the Iowa Reference Algorithm; here the coefficient of repeatability and variation was 6.7 (0.64%) and 3.25 μm (0.49%) for greater than 400 μm and less than 400 μm, respectively (
Table).
The results of this pilot study show that the Spectralis manufacturer-supplied algorithm and Iowa Reference Algorithm segmentation algorithms may have a differential impact on the reproducibility of DME quantification by SD-OCT. There is no significant difference between the retinal thicknesses measured in these subjects with DME by the two algorithms overall, so they are essentially measuring the same entity. However, in this sample of subjects with DME, the reproducibility of the manufacturer-supplied algorithm for Spectralis and the reproducibility of the Iowa Reference Algorithm on the same OCT volumes were significantly different for the 3 mm but not the 1 mm central subfield (CMT). Both provide good reproducibility for the CMT when compared to prior studies of OCT reproducibility on DME using other commonly employed OCT machines, such as time-domain Stratus and spectral-domain Cirrus.
3,7 The coefficient of variation of the CMT was lower for both the Iowa algorithm (0.62%) and the Spectralis algorithm (0.81%) than the coefficient of variation previously found by Forooghian et al. for both Cirrus OCT (2.42%) and Stratus OCT (2.63%).
7 Wolf-Schnurrbusch et al. showed that Spectralis OCT had a lower coefficient of variation when compared to other OCT devices in normal eyes.
5
The coefficient of variation of the manufacturer-supplied Spectralis software for CMT is 6.84 μm (5.26 μm for the Iowa algorithm). This is the most useful number clinically from our study because it provides the threshold for which a change in CMT is statistically significant in a patient with DME, where changes below this number are likely to be lost in variability between measurements. This number can be used to evaluate disease progression or detect true change in response to therapeutic interventions in both clinical practice and clinical trials. Interestingly, there was a trend toward worse repeatability for patients with increased DME as measured by CMT; the coefficient of repeatability was 9 μm for patients with a CMT greater than 400 μm, compared with a coefficient of repeatability of 3.56 μm for patients with CMT less than 400 μm using the Spectralis software. In measuring the CMT, the largest variability in intersession repeat scans was caused by the center of the OCT analysis falling on the edge of macular edema. In these cases, a small change in where the center of the OCT analysis is performed creates a large difference in the retinal thickness measured. This likely explains the trend for an increase in variability in patients with larger degrees of DME. Therefore, one can take the amount of DME into account when interpreting the repeatability of subsequent scans with use of the Spectralis software.
The Iowa Reference Algorithm may have better reproducibility for analyzing Spectralis OCT volumes than the manufacturer's supplied algorithm. Specifically, the coefficient of repeatability and variation is slightly lower at 5.26 μm (0.62%) for the 1 mm central subfield, and 2.46 μm (0.31%) for the 3 mm central subfield, significantly lower than with the manufacturer's supplied Spectralis software. The most likely explanation for this higher robustness is that the Iowa Reference Algorithm uses all three-dimensional information when identifying and segmenting the retinal layers. Thus we conclude that incorporation of three-dimensional data provides useful information that is most likely lost in the currently available segmentation algorithms, which we assume are all two-dimensional although the manufacturer-supplied algorithms have not been made public. However, the caveat to our analysis is the inclusion of two patients that largely contributed to the increased variability for the central 3 mm analysis in the Spectralis software. In one patient there was an error in registration performed by the Spectralis software. The other patient had vitreomacular traction that introduced segmentation errors within the Spectralis software, but the OCT volume was correctly segmented with the Iowa Reference Algorithm. Because such patients are seen among those with typical DME, they were included in our study. Interestingly, these two patients did not have much of an effect on the central 1 mm analysis, where the patients with the highest variability had the peak of the macular edema located just adjacent to the fovea as described above.
It is remarkable that such excellent repeatability can be achieved given that the axial resolution of SD-OCT is on the order of 4 to 6 μm.
5 The Iowa Reference Algorithm can analyze the OCT volumes from all major SD-OCT devices. Potentially the measured differences that have been found between retinal layer thickness in different SD-OCT devices are related to the manufacturer-specific algorithms used in these devices.
5–7,10,18 Because our results show that the layer segmentation algorithm affects the between-measurement variability, a publicly available published algorithm such as the Iowa Reference Algorithm has the potential to eliminate cross-device variability.
A limitation of this pilot study is the small sample size. It is possible that a larger sample size would be able to detect a significant difference in reliability between the Iowa Reference Algorithm and the Spectralis software when comparing the CMT. A larger sample size may also demonstrate a significant difference in reliability when comparing larger and lesser degrees of macular edema. In addition, because of the small sample size, inclusion of the two aforementioned patients with the error in registration and vitreomacular traction had a large effect on repeatability. Larger numbers would aid in teasing out the reliability for measurements of the CMT and central 3 mm. As mentioned, we have not compared the variability of the manufacturer-supplied and the Iowa Reference Algorithm across multiple SD-OCT devices; we plan to do such a study.
In summary, both the Heidelberg Spectralis software and Iowa Reference Algorithm provide excellent reproducibility between serial scans in patients with clinically significant DME. The publicly available Iowa Reference Algorithm may have lower between-measurement variability than the manufacturer-supplied algorithm for the central 3 mm subfield. Lowering between-measurement variability is crucial for optimal management of patients with DME.
Supported in part by National Institutes of Health Grants R01 EY018853, R01 EY019112, and R01 EB004640.
Disclosure: E.H. Sohn, None; J.J. Chen, None; K. Lee, None; M. Niemeijer, None; M. Sonka, P; M.D. Abràmoff, P