We have reported on a use of deep learning to predict conversion to wet AMD using OCT imaging. The results show clear separation between the progressors and the nonprogressors, and the occlusion sensitivity analysis indicates that relevant features are brought to bear by the technique. We considered, as a comparison, a clinical study where retinal specialists would try, using their clinical experience, to divine which patients would convert to neovascular AMD. We put this question to three retinal specialists who indicated that, given the lack of validated biomarkers for this problem, they would not feel comfortable making this decision, even for a research study. In the following, we add context to the findings, discuss their clinical relevance, present some limitations of the study, and close with some conclusions.
One of the major challenges in the clinical management of patients with early/intermediate AMD is the assessment of risk of conversion, and any metrics supportive of this assessment are welcome. Structural OCT data have been used to create anatomical biomarkers such as thickness and volumetric measures, but despite being researched for several years, compelling indicators of conversion have yet to emerge. Instead, interest has turned to OCTA, where subclinical neovascularization is being observed and studies are being carried out on how to quantify these observations such that they can be deployed clinically. OCTA instrumentation is, however, less widely used, and longitudinal data are less readily available. In addition, OCTA data have greater dependence on variations in signal strength across different systems and are vulnerable to projection artifacts that make it difficult to assess flow as a reliable biomarker, especially in the case of neovascularization underneath the RPE (type 1).
19 With the advent of more advanced feature extractors and classifiers facilitated through deep learning, we have revisited and further mined the OCT data sets for signals that, akin to OCTA, might be supportive of the subclinical assessment of nonexudative neovascularization.
An immediate interpretation of the findings is that the neural network has discovered specific patterns indicative of pathologic change. OCT-based features identified in early CNV have been previously reported.
20–23 The analysis we report on, however, looks at data before any clinically observable signs of conversion, so consideration must be given to more subtle features, including textural changes that are perhaps occurring as a direct result of early physiological changes. Pathology detection using OCT texture analysis has itself been previously researched.
24 Such approaches failed to gain traction, but in the advent of better computational resources and the more sophisticated learning approaches, we envisage a resurgence in such work. The texture descriptors were examples of handcrafted features, a technique that has been superseded by the ability to instead learn the features through deep learning. Similarly, in the work from de Sisternes et al.,
8 Niu et al.,
9 and Schmidt-Erfurth et al.,
11 the features were manually crafted and, through extensive use of regression, applied to temporal data in their final models. By learning the features in a systematic way afforded by deep neural networks, more powerful and better regularized solutions are now possible. Very important to the method, however, is the preprocessing of the input data via a segmentation step that (1) gives us some invariance to instrumentation and (2) allows the network to concentrate on tissue of interest. This is somewhat akin to the recent work by De Fauw et al.
25 in which their classification scheme uses a separate segmentation step, here using a U-Net deep learning architecture,
26 and then classifying the homogenous tissue regions into referral classes using a second deep learning architecture, one that is very similar in composition to that used in this study. In our work, however, we do not disregard the image intensities and distributions as they are critical to our method in differentiating the classes.
Another interesting finding is the difference in the en face occlusion sensitivity maps between progressors and nonprogressors (
Fig. 12). Further investigation is needed, but this difference could potentially be due to the presence of more photoreceptors nasally or large arterioles nasally skewing the choroidal density.
This study is not without some limitations. Although this is a large and balanced data set, more data would help better support our conclusions. To address this, unbiased estimates of performance are reported, including the cross validation approach given in the method section, where care was taken to evenly balance the cohorts in the test and training sets, ensuring same subject data were not used across data sets. As a pilot study, however, the findings are compelling.
In addition, the current deep learning model is applied only on B-scans, and those results are aggregated to make a final prediction. This is different from the traditional machine learning approaches that use features derived from consideration of the entire volume. Future work will need to investigate the application of the deep learning approach directly on the full volumes as, potentially, a more natural way of finding patterns of subclinical CNV.
Another limitation could perhaps also be considered a strength of the method given the positive results and the indication that information in the choroid is of importance to the performance. This is namely the SD-OCT scanner used (Topcon 3d OCT); it has a light source of 840 nm, which offers limited depth penetration given its relatively short wavelength. Longer wavelengths are preferred for resolving detail in the choroid even if these lose some axial resolution. However, through simple review of the B-scans (see
Figs. 1,
2, for example), one can see clear choroidal signal in the OCT data. And conversely, this speaks to the strength of the method as even with this limited penetration, there is clearly information in the choroidal regions of the data that is being used to discriminate progressors from nonprogressors (
Fig. 11). We are currently collecting data to test the method using other devices, including swept-source OCT as well as depth-enhanced imaging, a spectral-domain approach that puts the focal plane (point of greatest signal) lower in the image.
This study is on a population of unilateral neovascular AMD eyes that have a high risk of conversion. Therefore, studying the nonprogressors and progressors in this enriched cohort allowed us to better target the pathologic area. As this is the case, however, it is not known how the models and results would generalize to patients with bilateral early/intermediate AMD, who constitute the majority of the at-risk population. Again, this is an interesting avenue of research that we would also like to look at in more detail.
To conclude, we report that a deep learning CNN with layer segmentation-based preprocessing shows strong predictive power with respect to the progression of early/intermediate AMD to advanced AMD. Such adjunct analysis could be useful in, for example, setting the frequency of patient visits and guiding interventions.