In this study, OAG patients with emmetropia and highly and physiologically myopia (high myopia) were recruited prospectively, as well as age-matched control subjects. Heidelberg Retina Tomograph measurement was carried out in these subjects and the AUCs in diagnosing glaucoma were compared with HRT raw parameters, FSM, RB, MRA, and the Random Forests method. Among the 84 HRT raw parameters, the largest AUC was obtained with different parameters in emmetropic and highly myopic groups and the AUCs in highly myopic eyes became smaller than those in emmetropic eyes. The AUC associated with the Random Forests method in the highly myopic group was smaller than that in the emmetropic group, however, they were significantly larger than those associated with other parameters, in both emmetropic and highly myopic groups. In addition, the AUCs associated with the Random Forests method was significantly larger than any other parameters at the specificity above 80% and 90%, in all of the emmetropic, highly myopic and combined emmetropic and highly myopic groups.
Among 84 HRT raw values, cup-to-disc area ratio (inferior-temporal sector) showed largest AUC in emmetropic eyes. This is in agreement with a previous study
42 and it would be speculated that this is because these areas correspond to VF area, which is preferentially affected in the early stage of glaucoma.
43,44 On the other hand, vertical cup-to-disk ratio (global) showed the largest AUC among HRT raw parameters, in the highly myopic group. Moreover, the two most contributing factors in the Random Forests diagnosing system were sectorial parameters in emmetropic eyes (cup-to-disc area ratio [inferior-temporal] and cup-to-disc area ratio [inferior-nasal]), whereas those in highly myopic eyes were global parameters (maximum contour depression and vertical cup-to-disc ratio). These findings may be compatible with the previous histologic report, in that highly myopic glaucomatous eyes tend to have more diffuse than localized change: elongated shape, shallow and concentric disc cupping, and low frequency of localized RNFL defects.
45
There are four artificial parameters equipped in the HRT. Frederick S. Mikelberg discriminant function is a linear discriminant function, which uses rim volume, cup shape measure, and height variation contour in the formula.
32 Reinhard O.W. Burk discriminant function uses rim area, height variation contour, cup shape measure, and RNFL thickness (HRT II Operating Instructions). Moorfields regression analysis is also a linear discriminant function, which accounts for the relationship between optic disc size and rim area or cup-to-disc area ratio,
10 and GPS is the result of analysis of automatic stereometric data of the ONH shape using machine learning classifier (relevance vector machine).
9 The sensitivity and specificity of these methods varies according to the study populations. In the original papers, the FSM discriminant function exhibited 0.85 sensitivity and 0.84 specificity
32 and MRA yielded 0.75 sensitivity and 0.98 specificity values.
10 The AUCs associated with FSM, RB, and MRA for discriminating glaucomatous and healthy eyes have been reported as 0.76 to 0.86 (FSM), 0.75 to 0.84 (RB), 0.70 to 0.93 (MRA), and 0.88 (GPS).
46–48 The AUCs of these parameters in the emmetropic population of the current study was larger than in these previous studies, despite relatively less damaged in the current study (MD of approximately between −5 and −6 dB in the previous studies). This result could be attributed to the more strict inclusion criteria for the refraction error; it was within ±1 D in the current study, whereas ±5 or 6 D
46,47,49 or mean ± SD equaling −0.01 ± 1.89 D
48 in the previous studies.
In the current study, AUC associated with GPS was similar to those with other three artificial parameters and it was significantly smaller than that with the Random Forests method. This is in agreement with previous reports. The usefulness of GPS is merely similar to,
49–53 or even worse than MRA.
54 In particular, previous reports have reported that GPS was not as useful as MRA and FSM in detecting early glaucoma in Japanese eyes.
55,56 Moreover, it is often clinically experienced that the GPS cannot be calculated, which limits the clinical usefulness of the GPS score. Indeed, in the current study sectorial GPS score was not calculated in six (5.4%) emmetropic eyes and three (3.1%) highly myopic eyes, while this phenomenon does not occur with the Random Forests method. There is an ethnical difference in optic disc appearance
57–64 and the machine learning classifier within the GPS analysis was neither trained in Japanese eyes and in highly myopic population; hence, this may be one of the reasons for the relatively lower performance of GPS than the Random Forests method.
Not surprisingly, the AUC values in the myopic population with the HRT parameters became smaller than those in the emmetropic eyes. Most importantly, the AUC was significantly improved by applying the Random Forests method in both emmetropic and highly myopic groups, as shown in the
Figure 2, and the AUC in highly myopic group reached a value of 0.93, which is approximately equivalent to those with HRT raw parameters, FSB, RB, and MRA in emmetropic eyes. It should be noted that the reference plane used in highly myopic eyes would not be identical to that in emmetropic eyes, which would have some impact on the measured HRT parameters in highly myopic eyes (hence, it would not be appropriate to compare the measured HRT parameters directly between emmetropic and highly myopic groups). Although this problem was thought to be at least partly resolved by providing the myopic refraction-matched control group, this could give another explanation to the poor diagnostic performance in the myopic group.
A possible caveat of the current study is the usage of HRT II, instead of III. There was a considerable change between the two versions; for instance, MRA was upgraded in the newest version of software (version 3.0), due to the increase of the number of the subjects included in the normative database.
65 Nonetheless the influence of this change on the diagnostic ability has been reported to be negligible,
49,66 and hence this will not largely affect the results in the current study.
There are other machine learning methods, such as support vector machines, boosting, and bagging classifiers that could also be used to diagnose glaucoma. Previous reports suggest that the Random Forests method outperforms most other methods
67–69 ; hence, the Random Forests algorithm was used in the current study.
In the current study, the analysis associated with the Random Forests method was carried out using the leave-one out cross validation in which the original dataset was divided into testing (one patient) and learning (all of other patients) datasets and the Random Forests diagnosis system was developed using the learning dataset. This process was repeated for the number of the subjects in the original dataset so that all of the patients were used as test data once. This is same to the situation of predicting the diagnosis of a new patient (testing dataset) using the Random Forests diagnosis system trained using previous (learning dataset) at a clinical setting. Thus, it was suggested that clinicians can gain the privilege of accurate prediction of the Random Forests diagnosis, when a clinical support tool in which the Random Forests diagnosis system is implemented becomes available at the clinical settings. Thus, given a new glaucoma patient's HRT data set, clinicians should be able to diagnose this patient with known sensitivity and specificity (
Figs. 1,
2a,
3) applying the Random Forests diagnosis system currently reported to everyday practice, assuming that a personal computer or other clinical support tools in which the Random Forests diagnosis system is implemented are available. All of the analyses in the current study were carried out using existing statistical software and packages, specifically the language R, which is an open source statistical program.
In conclusion, glaucomatous changes in ONH morphology were investigated based on the HRT parameters of the emmetropic and highly myopic eyes. With any of the parameters, the diagnostic ability was lower in myopic population than in emmetropic population, however, the diagnostic accuracy was improved by interpreting multiple HRT parameters with the Random Forests method, in highly myopic eyes in particular. In Asian countries, the prevalence of high myopia is high,
24,70,71 and high myopia is a definitive risk factor for having glaucoma.
15,72 The current method of using the Random Forests decision tree classifier may be useful in facilitating screening out glaucomatous eyes in these regions.