Abstract
purpose. To determine whether combining structural (optical coherence tomography, OCT) and functional (standard automated perimetry, SAP) measurements as input for machine learning classifiers (MLCs; relevance vector machine, RVM; and subspace mixture of Gaussians, SSMoG) improves diagnostic accuracy for detecting glaucomatous eyes compared with using each measurement method alone.
methods. Sixty-nine eyes of 69 healthy control subjects (average age, 62.0, SD 9.7 years; visual field mean deviation [MD], −0.70, SD 1.41 dB) and 156 eyes of 156 patients with glaucoma (average age, 66.4, SD 10.2 years; visual field MD, −3.12, SD 3.43 dB) were imaged with OCT (Stratus OCT, Carl Zeiss Meditec, Inc., Dublin, CA) and tested with SAP (Humphrey Field Analyzer II with Swedish Interactive Thresholding Algorithm, SITA; Carl Zeiss Meditec, Inc.) within 3 months of each other. RVM and SSMoG MLCs were trained and tested on OCT-determined RNFL thickness measurements from 32 sectors (∼11.25° each) obtained in the circumpapillary area under the instrument-defined measurement ellipse and SAP pattern deviation values from 52 points from the 24-2 grid, independently and in combination. Tenfold cross-validation was used to train and test classifiers on unique subsets of the full 225-eye data set, and areas under the receiver operating characteristic curve (AUROC) for the classification of eyes in the test set were generated. AUROC results from classifiers trained on OCT and SAP alone and those trained on OCT and SAP in combination were compared. In addition, these results were compared to currently available OCT measurements (mean retinal nerve fiber layer [RNFL] thickness, inferior RNFL thickness, and superior RNFL thickness) and SAP indices (MD and pattern standard deviation [PSD]).
results. The AUROCs for RVM trained on OCT parameters alone, SAP parameters alone and OCT and SAP parameters combined were 0.809, 0.815, and 0.845, respectively. The AUROCs for SSMoG trained on OCT parameters alone, SAP parameters alone, and OCT and SAP parameters combined were 0.817, 0.841, and 0.869, respectively. Combining techniques using both RVM and SSMoG significantly improved on MLC analysis of OCT, but not SAP, measurements alone. Classification performance using RVM and SSMoG was statistically similar.
conclusions. RVM and SSMoG Bayesian MLCs trained on OCT and SAP data can successfully discriminate between healthy and early glaucomatous eyes. Combining OCT and SAP measurements using RVM and SSMoG increased diagnostic performance marginally compared with MLC analysis of data obtained using each technology alone.
Since 1990 (Goldbaum MH, et al.
IOVS 1990;31:ARVO Abstract 503), machine learning classifier (MLC) techniques have been applied to optical imaging and visual function measurements to improve glaucoma detection, with results suggesting that these techniques are as good as or better than currently available analysis techniques at classifying eyes as glaucomatous or healthy.
1 2 3 4 Bowd et al.
5 recently showed, using a relatively new supervised (i.e., trained on labeled data, in this case, glaucoma versus healthy) Bayesian MLC called the relevance vector machine (RVM),
6 7 that RVM analysis of scanning laser polarimetry (SLP) data is able to classify eyes more successfully than are standard retinal nerve fiber layer (RNFL) thickness measurements such as superior and average RNFL thicknesses. In addition, RVM analysis performed as well as support vector machine (SVM)
8 9 analyses at this task. Unlike SVM, RVM incorporates probabilistic output (probability of class membership, e.g., probability of glaucoma defined as membership in the glaucoma training set) through Bayesian inference. Its decision function depends on fewer input variables than SVM, possibly allowing better classification estimates for small data sets with high dimensionality (i.e., a large number of input variables).
7 Probabilistic output allows an intuitive interpretation of classifier output and therefore is likely to be more informative for clinical use than is output converted to a nonlinear scale by using postprocessing, similar to SVM.
Findings in some recent studies have suggested that RNFL measurements obtained by optical coherence tomography (OCT) are more sensitive to glaucomatous damage than are SLP measurements.
10 In addition, existing studies have shown that structural and functional techniques for detecting glaucoma often identify different glaucoma patients when glaucoma severity is not too advanced,
10 11 and that combining structural and functional techniques can improve glaucoma detection.
12 13 14 Based on these results, we chose to assess the ability of RVM trained and tested on OCT RNFL thickness measurements to classify glaucomatous and healthy eyes. In addition, we combined OCT RNFL thickness information with visual sensitivity measurements measured using standard automated perimetry (SAP) to determine whether combining structural and functional information using MLCs improves glaucoma detection. The purpose of this study was first to determine whether combining reasonably easy-to-obtain measurements from glaucomatous eyes, by using complex MLC techniques, would improve diagnostic performance. If performance was improved, these results might prompt the automated combination and analysis of information from structural and functional diagnostic techniques to improve glaucoma detection in the clinic. Second, successful classification of eyes using RVM trained on OCT data would provide more support for MLC use in glaucoma diagnosis with optical imaging. Finally, we introduced a new Bayesian MLC, the subspace mixture of Gaussians (SSMoG), and determined its performance on the same data set.
This observational cross-sectional study included one randomly selected eye from each of 225 study participants (156 patients with glaucoma and 69 healthy control subjects) older than 40 years and enrolled in the University of California, San Diego, Diagnostic Innovations in Glaucoma Study (DIGS). Each study participant underwent a comprehensive ophthalmic evaluation, including review of medical history, best corrected visual acuity, slit lamp biomicroscopy, intraocular pressure measurement with Goldmann applanation tonometry, gonioscopy, dilated slit lamp fundus examination with a 78-D lens, simultaneous stereoscopic optic disc photography (TRC-SS; Topcon Instruments Corp. of America, Paramus, NJ), and SAP using the 24-2 Swedish Interactive Threshold Algorithm (SITA; Humphrey Field Analyzer II, Carl Zeiss Meditec, Inc., Dublin, CA). To be included in the study, participants had to have a best-corrected acuity better than or equal to 20/40, spherical refraction within ±5.0 D, cylinder correction within ±3.0 D, and open angles on gonioscopy. Eyes with coexisting retinal disease, uveitis, or nonglaucomatous optic neuropathy were excluded.
For labeling eyes during classifier training, glaucomatous eyes were defined as either eyes with glaucomatous appearance of the optic disc (i.e., glaucomatous optic neuropathy [GON]) based on masked assessment (described later) of simultaneous stereoscopic optic disc photographs and defined as the presence of neuroretinal rim thinning and/or diffuse or localized RNFL defects indicative of glaucoma or eyes with repeatable (two consecutive) SAP results outside normal limits by pattern standard deviation (PSD; P < 5%) or Glaucoma Hemifield Test. For GON criterion, photographs were evaluated by two trained observers who were masked to the date of the photographs and the identity and diagnosis of the study participants. Discrepancies between observers were resolved by adjudication by a third trained observer.
GON eyes with normal visual fields were included in the glaucoma group to assure the representation of early glaucoma in the sample. In addition, ROC curves for discriminating between healthy and glaucomatous eyes defined by field defects when using OCT often are so high that they do not allow improvement (in this case, achievable by the addition of visual field information).
For glaucomatous eyes classified based on visual fields, the first abnormal SAP was on or before the imaging date.
Average SAP mean deviation (MD) of the glaucomatous eyes was −3.12 dB (SD 3.43; range, −20.14 to +1.15). Seventy-five percent of the glaucomatous eyes had MD ≥ −4.08 dB, and 90% had MD ≥ −6.57, indicating primarily early glaucoma. The mean age of the patients with glaucoma at the time of VF testing was 66.4 years (SD 10.2; range, 41.3–87.0 years). Intraocular pressure was not part of the inclusion criteria for the glaucoma group.
Healthy eyes were defined as of healthy volunteers with normal-appearing optic discs by examination and no history of intraocular pressure >22 mm Hg. Average SAP MD of the healthy eyes was −0.70 dB (SD 1.41; range, −5.34 to +1.86) and was significantly different from that of glaucomatous eyes (t-test; P < 0.0001). The mean age of the healthy participants was 62.0 years (SD 9.7; range, 42.3–83.6) and was significantly younger than that of the patients with glaucoma (t-test, P = 0.005); however, the age range was similar between groups.
Healthy eyes were included based on examination results and IOP, not on optic disc appearance by detailed, consensus stereoscopic photograph evaluation or visual field results, because we did not want to bias results in favor of either optical imaging (i.e., OCT measurements) or visual function (SAP) measurements (although it is arguable that requiring a healthy examination could increase somewhat the specificity of OCT measurements). In addition, this sample is more representative of a screened population than is a population requiring healthy-appearing optic discs by very detailed photograph assessment and completely normal visual field test results.
This research adhered to the tenets of the Declaration of Helsinki and Health Insurance Portability and Accountability Act guidelines. The University of California, San Diego, Human Research Protection Program approved all methodology.
The RVM learning classifier is a Bayesian model that provides probabilistic predictions (e.g., probability of glaucoma based on the training examples) through Bayesian inference.
7 Its decision function depends on fewer sample vectors (i.e., is more sparse) than previously reported SVM, because SVM minimizes the training error under the constraint of maximum smoothness, requiring more decision points.
6 The benefit of a sparser classifier is that its results are more generalizable (i.e., it decreases overfitting). In classification, RVM outputs probabilities of class membership in contrast to SVM, which expresses a nonprobabilistic value that is then related to the decision threshold. RVMs point probabilistic output provides a conditional distribution that allows a more intuitive expression of uncertainty in the prediction.
17
The RVM was implemented using a commercial software algorithm (SparseBayes ver. 1.0 algorithm; Microsoft Research, Cambridge, UK, for MatLab; The MathWorks, Natick, MA). The kernel width was optimized by a grid search with width = √(2 · α · number of input variables), where α is 0.5, 0.8, 1.0, 1.2… 3.0. The α that offered the largest AUROC was chosen.
Tenfold cross-validation was used to train and test RVM and SSMoG classifiers, to avoid training and testing on the same data. First, glaucomatous and healthy eyes were randomly divided into 10 approximately equal, exhaustive, and mutually exclusive subsets. Next, classifiers trained on 9 subsets combined were subsequently tested on the 10th subset. This process was repeated 10 times, with each subset serving as the test set one time, so that each tested eye was never part of its training set and was tested only once. The test results from all eyes were then used to plot the bias-corrected ROC curve. Sensitivities at 75% and 90% specificities, arbitrarily chosen to represent moderate and high specificity, respectively, also were reported, although these values can be estimated from visual inspection of ROC curves.
Relevance Vector Machine.
Subspace Mixture of Gaussians.
In the present study, combining OCT and SAP measurements using MLCs resulted in a trend of increased AUROCs for discriminating between healthy and glaucomatous eyes, compared to using each measurement technique alone. Optimization of the classifiers permitted significant improvements of RVM for OCT+SAP thresholds, and of SSMoG for OCT+SAP TD, over the best MLC results of OCT data alone. Accuracy of MLC analyses of OCT and SAP parameters improved on standard (currently available) parameters for each technique. These results agree with those of several studies of OCT,
23 SAP,
2 CSLO,
1 and SLP
5 and indicate that RVM and the relatively new SSMoG analyses show promise for diagnostic classification of complex imaging and visual field data. In addition, the less computationally demanding SSMoG analysis performed similarly to RVM for classification. Both techniques provide intuitive probability of group membership as output, which is more desirable than results generated using SVMs and multilayer perceptrons, for instance.
Outputs from RVM and SSMoG indicated that most of the glaucomatous eyes were assigned a high probability of belonging to the glaucoma training group (the distribution was negatively skewed;
Figs. 3 4 ). The distributions of probabilities for healthy eyes, on the other hand, were less skewed, probably a result of the often-reported wide range of structural and functional measurements in healthy eyes and thus not surprising. We also demonstrated the fact that
P > 0.50 is not optimal for classifying eyes as glaucomatous when high specificity and informative LRs are desired.
Based on the results of other studies, we were somewhat surprised that our results indicated only modest improvements when structural and functional measures were combined. First, the Ocular Hypertension Treatment Study
24 and the European Glaucoma Prevention Study
25 recently suggested that a similar percentage of suspect eyes develop glaucoma when first conversion is defined based on optic disc abnormality or on visual field abnormality, suggesting to us that OCT and SAP would detect abnormalities in a significant percentage of different eyes with early-stage glaucoma. Next, other studies have suggested that combining imaging and visual field data results in improved diagnostic accuracy for glaucoma detection.
12 13 14 Our results indicate that there is some dependency between the structural and functional measurements. Nevertheless, the significant improvement in performance of optimized MLCs trained on combined data compared to single tests alone demonstrates the benefit of combining the tests.
An early study by Caprioli
12 combined optical imaging data (obtained by a Rodenstock Instruments [Munich, Germany] computerized image analysis system) and SAP data (obtained by Humphrey [Carl Zeiss Meditec, Inc.] or Octopus [Haag Streit, Köniz, Switzerland] perimetry) from 54 healthy and 185 glaucomatous eyes in a linear discriminant analysis model. Classification performance of the combined model (sensitivity = 0.90, specificity = 0.76, and accuracy = 0.87) was better than models incorporating only structural indices (sensitivity = 0.88, specificity = 0.35, and accuracy = 0.76) and only functional indices (sensitivity = 0.99, specificity = 0.06, and accuracy = 0.77). Using the same data set, Brigatti et al.
26 combined structural and functional measurements using multilayer perceptron neural networks and demonstrated similar results. Performance of the combined model (sensitivity = 0.90, specificity = 0.84, and accuracy = 0.88) was better than models incorporating only structural indices (sensitivity = 0.87, specificity = 0.56, and accuracy = 0.80) and only functional indices (sensitivity = 0.84, specificity = 0.86, and accuracy = 0.84).
More recently, a study from our group indicated that combining imaging (current generation OCT, CSLO, and SLP) and visual field measurements (frequency doubling technology perimetry, FDT) increased sensitivity for detecting eyes with glaucoma compared with using either technique in isolation, with no significant decrease in specificity. In this study of 101 eyes, adding FDT pattern SD measurements increased sensitivity as much 21% compared with using imaging alone.
In a study using MLCs to combine CSLO and perimetry results, Mardin et al.
13 combined CSLO-measured rim volume, rim area, and cup shape measurements (global and sectoral) with Octopus perimeter–measured MD; corrected loss variance (global and sectoral) by using several classification techniques (MLC and statistical); and compared the diagnostic accuracy of these techniques to the same techniques using CSLO and perimetry results alone in 88 healthy and 88 glaucomatous (average Octopus MD = 7.13 dB) eyes. The largest AUROC reported was 0.976 for combined CSLO and perimetry measurements with a “double-bagging” technique (optimal parameters are selected by using linear discriminant analysis and included in a bootstrap aggregate, or “bagging,” regression tree). This result was an improvement over double-bagging of CSLO measurements alone (AUROC = 0.855), but not over double-bagging of perimetry measurements alone (AUROC = 0.98). Results using other classifiers were similar. These results were probably due in part to inclusion criteria that required normal visual fields in healthy eyes and abnormal visual fields in glaucomatous eyes.
Finally, in a very recent study from our laboratory,
27 combining CSLO (Heidelberg Retina Tomograph, HRT II) and short-wavelength automated perimetry (SWAP) by using RVM resulted in improved accuracy compared with RVMs using each test alone for classifying healthy eyes (
n = 68) and those with glaucoma (
n = 144, average SAP MD = −5.1 dB) defined based on the same criteria used in the present study. In this study, AUROCs for the best-performing optimized RVM analyses of CSLO parameters, SWAP parameters and combined CSLO and SWAP parameters were 0.878, 0.780, and 0.925, respectively. These values tended to be higher than those reported in the present study.
Several other studies have investigated MLC analysis of OCT data. The first such study
28 demonstrated that multilayer perceptron classifiers trained on 25 available OCT RNFL and optic disc parameters were able to discriminate between healthy eyes (
n = 100) and those with early glaucoma (
n = 89, average SAP MD = −2.7 dB) with an AUROC of 0.821 (sensitivity = 0.51 at set 0.90 specificity). This value was increased to 0.874 (sensitivity = 0.67 at set 0.90 specificity) when principal components analysis was used to reduce the input data to three parameters (based on AUROC [
x] by number of features [
y] plot peaking). In another study of 71 healthy and 64 glaucomatous eyes (average SAP MD = −4.3) by the same group,
29 self-organizing maps and decision trees were used to establish rules for classifying eyes as healthy or glaucomatous based on 25 available OCT RNFL and optic disc measurements. A self-organizing map identified the two (of five) most informative parameter clusters (i.e., features; defined based on the largest mapped distance between healthy and glaucomatous eyes), and the three top-ranked parameters from each cluster were included in a decision tree that, after “pruning,” resulted in a three-branch tree that yielded 0.73 sensitivity, 0.92 specificity, and 0.83 accuracy. In a third study,
23 the ability of several statistical and MLCs were investigated for discriminating between 42 healthy and 47 glaucomatous eyes (glaucomatous eyes were approximately one-half early glaucoma with SAP MD ≥ −6 dB, average MD = −2.18; and approximately one-half advanced glaucoma with MD < −6 dB, average MD = −10.9 dB). An SVM trained on the eight (of 38) OCT parameters with highest correlation with visual fields most accurately classified eyes with AUROC of 0.981 and sensitivity of 0.92 at a fixed specificity of 0.95. Although the results of these studies are not directly comparable to those of our study because of differences in classifiers used, patient demographics, and glaucoma severity, they further stress the usefulness of MLCs for glaucoma diagnosis.
In the present study, we used backward elimination to decrease (i.e., optimize) the dimensionality of RVM data sets. This technique identifies individual parameters that most strongly contribute to the derived classification model. In previous similar studies we reported the specific parameters included in reduced-feature optimized data sets as a means to describe important parameters and regions of interest in the peripapillary retina and visual fields. This information has been omitted from the present study because selecting these regions can be somewhat misleading because of strong correlations (i.e., dependence) among adjacent peripapillary and visual field locations that can result in one important location being excluded by the inclusion of another related, equally important location. In addition, it is possible that in more advanced glaucoma, or when using a larger training set, different sectors would be identified as most important for the classification task.
AUROC results from our MLC techniques may be somewhat overestimated because we used cross-validation instead of truly independent training and test sets. Although RVM and SSMoG were trained and tested on different data, each data set was generated from the same rather homogeneous pool. This fact may exaggerate somewhat the differences in classification ability between MLCs and standard OCT and SAP parameters, although we were careful to employ separate training and test sets at all steps in training, testing, and optimizing (when applicable) our MLCs. In addition, establishing classifier performance, per se, was not the main purpose of this study; rather, we sought to determine whether combining structural and functional measurements in a novel way could improve classification accuracy and therefore relative accuracy was most important.
It is also likely that our reference standard criteria affected AUROC results. It is clear based on the range of SAP MD present in the healthy eyes, that some eyes with glaucomatous visual field damage were included in this group, and it is likely that some healthy eyes with suspicious optic discs were included in the glaucoma group. These situations combined very likely resulted in smaller overall AUROCs. However, we chose these criteria to decrease, as much as is practical, the bias in favor of either structural or functional tests and, again, establishing classifier performance, per se, was not the main purpose of the study. In addition, it is possible that other MLCs could more accurately separate our data. Our decision to use these particular classifiers was based on experience gained from our prior studies. We chose to use RVM because it has performed well with imaging data in the past and we chose to use SSMoG because MoG has performed well with visual field data in the past.
Overall, the current results suggest that RVM and SSMoG classifiers trained on OCT and/or SAP measurements can successfully discriminate between healthy and glaucomatous eyes. In addition, techniques designed to optimize the MLCs can improve their performance. Classification by RVM and SSMoG trained on OCT and/or SAP measurements is better than classification based on individual OCT parameters and SAP global indices. Combining OCT and SAP measurements using these MLC techniques improved performance marginally compared to the same classifiers trained on OCT and SAP data alone, although AUROCs for combined analysis displayed a consistent increasing trend.
Supported by National Eye Institute Grants EY011008, EY008208, and EY013928, and participant incentive grants in the form of glaucoma medication at no cost from Alcon Laboratories Inc., Allergan, Pfizer Inc., and SANTEN Inc.
Submitted for publication August 17, 2007; revised October 8, 2007; accepted January 16, 2008.
Disclosure:
C. Bowd, Lace Elettronica (F);
J. Hao, None;
I . M. Tavares, None;
F.A. Medeiros, Carl Zeiss Meditec, Inc. (F, R), Heidelberg Engineering (R), Reichart Instruments (R);
L.M. Zangwill, Carl Zeiss Meditec, Inc. (F), Heidelberg Engineering (F), OptoVue (F);
T . -W. Lee, None;
P.A. Sample, Carl Zeiss Meditec, Inc. (F), Haag-Streit (F), Welch-Allyn (F);
R.N. Weinreb, Carl Zeiss Meditec, Inc. (F, C), Heidelberg Engineering (F);
M.H. Goldbaum, None
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked “
advertisement” in accordance with 18 U.S.C. §1734 solely to indicate this fact.
Corresponding author: Christopher Bowd, Hamilton Glaucoma Center—178, Department of Ophthalmology, University of California, San Diego, La Jolla, CA 92037-0946;
[email protected].
Table 1. Comparison of OCT and SAP Indices and MLC (RVM, SSMoG) Output between Healthy and Glaucomatous Eyes
Table 1. Comparison of OCT and SAP Indices and MLC (RVM, SSMoG) Output between Healthy and Glaucomatous Eyes
Technique | Healthy Eyes, SD (n = 69) | Glaucomatous Eyes, SD (n = 156) | P |
OCT mean RNFL thickness (μm) | 98.1, 9.8 | 83.1, 15.6 | <0.0001 |
OCT inferior RNFL thickness (μm) | 130.7, 25.9 | 102.7, 29.3 | <0.0001 |
OCT superior RNFL thickness (μm) | 126.8, 22.0 | 108.9, 28.4 | <0.0001 |
SAP MD (dB) | −0.70, 1.41 | −3.12, 3.43 | <0.0001 |
SAP PSD (dB) | 1.73, 0.75 | 3.65, 3.19 | <0.0001 |
OCT RVM full (32 features) | 0.532, 0.194 | 0.770, 0.212 | <0.0001 |
SAP Thr RVM full (53 features) | 0.541, 0.204 | 0.777, 0.240 | <0.0001 |
SAP TD RVM full (52 features) | 0.513, 0.226 | 0.770, 0.241 | <0.0001 |
SAP PD RVM full (52 features) | 0.530, 0.251 | 0.750, 0.237 | <0.0001 |
OCT RVM optimized (16 features) | 0.513, 0.190 | 0.768, 0.219 | <0.0001 |
SAP Thr RVM optimized (20 features) | 0.498, 0.226 | 0.779, 0.237 | <0.0001 |
SAP TD RVM optimized (12 features) | 0.481, 0.233 | 0.783, 0.243 | <0.0001 |
SAP PD RVM optimized (10 features) | 0.484, 0.287 | 0.796, 0.244 | <0.0001 |
OCT SSMoG | 0.509, 0.163 | 0.744, 0.232 | <0.0001 |
SAP Thr SSMoG | 0.381, 0.182 | 0.654, 0.242 | <0.0001 |
SAP TD SSMoG | 0.374, 0.179 | 0.642, 0.247 | <0.0001 |
SAP PD SSMoG | 0.273, 0.193 | 0.523, 0.281 | <0.0001 |
OCT SSMoG optimized | 0.475, 0.195 | 0.758, 0.238 | <0.0001 |
SAP Thr SSMoG optimized | 0.397, 0.196 | 0.714, 0.244 | <0.0001 |
SAP TD SSMoG optimized | 0.332, 0.187 | 0.660, 0.262 | <0.0001 |
SAP PD SSMoG optimized | 0.235, 0.202 | 0.536, 0.304 | <0.0001 |
OCT+Thr RVM full (85 features) | 0.483, 0.218 | 0.788, 0.225 | <0.0001 |
OCT+TD RVM full (84 features) | 0.490, 0.225 | 0.792, 0.240 | <0.0001 |
OCT+PD RVM full (84 features) | 0.523, 0.216 | 0.773, 0.258 | <0.0001 |
OCT+Thr RVM optimized (24 features) | 0.461, 0.227 | 0.798, 0.234 | <0.0001 |
OCT+TD RVM optimized (29 features) | 0.465, 0.230 | 0.787, 0.241 | <0.0001 |
OCT+PD RVM optimized (24 features) | 0.479, 0.227 | 0.786, 0.227 | <0.0001 |
OCT+Thr SSMoG | 0.349, 0.147 | 0.692, 0.236 | <0.0001 |
OCT+TD SSMoG | 0.411, 0.149 | 0.670, 0.230 | <0.0001 |
OCT+PD SSMoG | 0.298, 0.139 | 0.583, 0.254 | <0.0001 |
OCT+Thr SSMoG optimized | 0.390, 0.162 | 0.738, 0.242 | <0.0001 |
OCT+TD SSMoG optimized | 0.415, 0.157 | 0.747, 0.224 | <0.0001 |
OCT+PD SSMoG optimized | 0.310, 0.147 | 0.647, 0.268 | <0.0001 |
Table 2. AUROCs and Sensitivities at Fixed Specificities for Classifying Eyes as Healthy or Glaucomatous for All Techniques/Parameters
Table 2. AUROCs and Sensitivities at Fixed Specificities for Classifying Eyes as Healthy or Glaucomatous for All Techniques/Parameters
Technique | AUROC, SE | Sensitivity at 0.75 Specificity | Sensitivity at 0.90 Specificity |
OCT mean RNFL thickness | 0.777, 0.03 | 0.64 | 0.58 |
OCT inferior RNFL thickness | 0.767, 0.03 | 0.70 | 0.45 |
OCT superior RNFL thickness | 0.680, 0.04 | 0.48 | 0.30 |
SAP MD | 0.775, 0.03 | 0.67 | 0.49 |
SAP PSD | 0.772, 0.03 | 0.71 | 0.45 |
OCT RVM full (32 features) | 0.789, 0.03 | 0.72 | 0.56 |
SAP Thr RVM full (53 features) | 0.777, 0.03 | 0.69 | 0.56 |
SAP TD RVM full (52 features) | 0.783, 0.03 | 0.65 | 0.48 |
SAP PD RVM full (52 features) | 0.745, 0.03 | 0.59 | 0.44 |
OCT RVM optimized* (16 features) | 0.806, 0.03 | 0.74 | 0.61 |
SAP Thr RVM optimized* (20 features) | 0.809, 0.03 | 0.77 | 0.55 |
SAP TD RVM optimized* (12 features) | 0.815, 0.03 | 0.74 | 0.50 |
SAP PD RVM optimized* (10 features) | 0.802, 0.03 | 0.77 | 0.45 |
OCT SSMoG | 0.787, 0.03 | 0.72 | 0.53 |
SAP Thr SSMoG | 0.784, 0.03 | 0.70 | 0.49 |
SAP TD SSMoG | 0.790, 0.03 | 0.69 | 0.49 |
SAP PD SSMoG | 0.763, 0.03 | 0.70 | 0.46 |
OCT SSMoG optimized, † | 0.817, 0.03 | 0.67 | 0.51 |
SAP Thr SSMoG optimized, † | 0.827, 0.03 | 0.74 | 0.56 |
SAP TD SSMoG optimized, † | 0.841, 0.03 | 0.77 | 0.53 |
SAP PD SSMoG optimized, † | 0.801, 0.03 | 0.70 | 0.50 |
OCT+Thr RVM full (85 features) | 0.834, 0.03 | 0.74 | 0.65 |
OCT+TD RVM full (84 features) | 0.826, 0.03 | 0.73 | 0.60 |
OCT+PD RVM full (84 features) | 0.789, 0.03 | 0.71 | 0.53 |
OCT+Thr RVM optimized* (24 features) | 0.845, 0.03 | 0.78 | 0.62 |
OCT+TD RVM optimized* (29 features) | 0.830, 0.03 | 0.78 | 0.63 |
OCT+PD RVM optimized* (24 features) | 0.820, 0.03 | 0.76 | 0.56 |
OCT+Thr SSMoG | 0.842, 0.03 | 0.76 | 0.67 |
OCT+TD SSMoG | 0.838, 0.03 | 0.76 | 0.67 |
OCT+PD SSMoG | 0.829, 0.03 | 0.71 | 0.56 |
OCT+Thr SSMoG optimized, † | 0.865, 0.02 | 0.80 | 0.54 |
OCT+TD SSMoG optimized, † | 0.869, 0.02 | 0.80 | 0.68 |
OCT+PD SSMoG optimized, † | 0.848, 0.02 | 0.74 | 0.65 |
Table 3. Sensitivities and Positive Likelihood Ratios
Table 3. Sensitivities and Positive Likelihood Ratios
Specificity | RVM | | | SSMoG | | |
| Sensitivity | Cutoff | Positive Likelihood Ratio | Sensitivity | Cutoff | Positive Likelihood Ratio |
0.75 | 0.78 | 0.631 | 3.10 (2.04, 4.71) | 0.80 | 0.517 | 3.21 (2.11, 4.86) |
0.85 | 0.70 | 0.748 | 4.66 (2.63, 8.24) | 0.75 | 0.578 | 5.0 (2.83, 8.83) |
0.90 | 0.63 | 0.801 | 6.28 (3.06, 13.00) | 0.68 | 0.642 | 6.80 (3.32, 14.00) |
0.96 | 0.56 | 0.859 | 14.00 (4.41, 45.00) | 0.65 | 0.677 | 16.00 (5.12, 52.00) |
BowdC, ChanK, ZangwillLM, et al. Comparing neural networks and linear discriminant functions for glaucoma detection using confocal scanning laser ophthalmoscopy of the optic disc. Invest Ophthalmol Vis Sci. 2002;43:3444–3454.
[PubMed]GoldbaumMH, SamplePA, ChanK, et al. Comparing machine learning classifiers for diagnosing glaucoma from standard automated perimetry. Invest Ophthalmol Vis Sci. 2002;43:162–169.
[PubMed]LietmanT, EngJ, KatzJ, QuigleyHA. Neural networks for visual field analysis: how do they compare with other algorithms?. J Glaucoma. 1999;8:77–80.
[PubMed]BiziosD, HeijlA, BengtssonB. Trained artificial neural network for glaucoma diagnosis using visual field data: a comparison with conventional algorithms. J Glaucoma. 2007;16:20–28.
[CrossRef] [PubMed]BowdC, MedeirosFA, ZhangZ, et al. Relevance vector machine and support vector machine classifier analysis of scanning laser polarimetry retinal nerve fiber layer measurements. Invest Ophthalmol Vis Sci. 2005;46:1322–1329.
[CrossRef] [PubMed]BishopCM, TippingME. Variational relevance vector machines.BoutilierC GoldszmidtM eds. Uncertainty in Artificial Intelligence. 2000;45–63.Microsoft Research Cambridge, UK.
TippingME. Sparse Bayesian learning and the relevance vector machine. J Mach Learn Res. 2001;1:211–244.
VapnikV. Statistical Learning Theory. 1998;Wiley New York.
VapnikV. The Nature of Statistical Learning Theory. 2000; 2nd ed.Springer New York.
BowdC, ZangwillLM, BerryCC, et al. Detecting early glaucoma by assessment of retinal nerve fiber layer thickness and visual function. Invest Ophthalmol Vis Sci. 2001;42:1993–2003.
[PubMed]ZangwillLM, BowdC, BerryCC, et al. Discriminating between normal and glaucomatous eyes using the Heidelberg Retina Tomograph, GDx Nerve Fiber Analyzer, and Optical Coherence Tomograph. Arch Ophthalmol. 2001;119:985–993.
[CrossRef] [PubMed]CaprioliJ. Discrimination between normal and glaucomatous eyes. Invest Ophthalmol Vis Sci. 1992;33:153–159.
[PubMed]MardinCY, PetersA, HornF, JunemannAG, LausenB. Improving glaucoma diagnosis by the combination of perimetry and HRT measurements. J Glaucoma. 2006;15:299–305.
[CrossRef] [PubMed]ShahNN, BowdC, MedeirosFA, et al. Combining structural and functional testing for detection of glaucoma. Ophthalmology. 2006;113:1593–1602.
[CrossRef] [PubMed]ZangwillLM, MedeirosFA, BowdC, WeinrebRN. Optic nerve imaging: recent advances.GrehnF StamperR eds. Glaucoma. 2004;63–91.Springer-Verlag Berlin.
MedeirosFA, ZangwillLM, BowdC, BerndAS, WeinrebRN. Fourier analysis of scanning laser polarimetry measurements with variable corneal compensation in glaucoma. Invest Ophthalmol Vis Sci. 2003;44:2606–2612.
[CrossRef] [PubMed]BishopCM, TippingME. Bayesian regression and classification.SuykensJ HorvarthG eds. Advances in Learning Theory: Methods, Models and Applications. 2003;267–285.IOS Press Amsterdam.
TrespV. Committee machines.HuYH HwangJ-N eds. Handbook of Neural Network Signal Processing. 2002;1–21.CRC Press Boca Raton, FL.
BengtssonB, BiziosD, HeijlA. Effects of input data on the performance of a neural network in distinguishing normal and glaucomatous visual fields. Invest Ophthalmol Vis Sci. 2005;46:3730–3736.
[CrossRef] [PubMed]DeLongER, DeLongDM, Clarke-PearsonDL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–845.
[CrossRef] [PubMed]ZangwillLM, ChanK, BowdC, et al. Heidelberg retina tomograph measurements of the optic disc and parapapillary retina for detecting glaucoma analyzed by machine learning classifiers. Invest Ophthalmol Vis Sci. 2004;45:3144–3151.
[CrossRef] [PubMed]JaeschkeR, GuyattGH, SackettDL. Users’ guides to the medical literature. III. How to use an article about a diagnostic test. B. What are the results and will they help me in caring for my patients? The Evidence-Based Medicine Working Group. JAMA. 1994;271:703–707.
[CrossRef] [PubMed]Burgansky-EliashZ, WollsteinG, ChuT, et al. Optical coherence tomography machine learning classifiers for glaucoma detection: a preliminary study. Invest Ophthalmol Vis Sci. 2005;46:4147–4152.
[CrossRef] [PubMed]KassMA, HeuerDK, HigginbothamEJ, et al. The Ocular Hypertension Treatment Study: a randomized trial determines that topical ocular hypotensive medication delays or prevents the onset of primary open-angle glaucoma. Arch Ophthalmol. 2002;120:701–13.discussion 829–830
[CrossRef] [PubMed]MigliorS, ZeyenT, PfeifferN, Cunha-VazJ, TorriV, AdamsonsI. Results of the European Glaucoma Prevention Study. Ophthalmology. 2005;112:366–375.
[CrossRef] [PubMed]BrigattiL, HoffmanD, CaprioliJ. Neural networks to identify glaucoma with structural and functional measurements. Am J Ophthalmol. 1996;121:511–521.
[CrossRef] [PubMed]BowdC, ChiouC, HaoJ, et al. Relevance vector machine for combining HRT II and SWAP results for discriminating between healthy and glaucoma eyes (abstract). Acta Ophthalmol Scand. 2006;84:569.
HuangML, ChenHY. Development and comparison of automated classifiers for glaucoma diagnosis using stratus optical coherence tomography. Invest Ophthalmol Vis Sci. 2005;46:4121–4129.
[CrossRef] [PubMed]HuangML, ChenHY, LinJC. Rule extraction for glaucoma detection with summary data from StratusOCT. Invest Ophthalmol Vis Sci. 2007;48:244–250.
[CrossRef] [PubMed]