In the present study, we used an ROC regression modeling technique to evaluate the influence of atypical scan patterns and severity of disease on the diagnostic accuracy of GDx VCC and ECC in glaucoma. This modeling approach was initially described by Medeiros et al.
15 for evaluation of the influence of covariates on the performance of diagnostic tests in glaucoma. This methodology allows the evaluation of the influence of covariates on the diagnostic performance of the test, so that ROC curves for specific values of the covariate of interest can be obtained. Also, it allows adjustment for the possible confounding effects of other covariates. Details of the modeling procedure have been described previously.
15 16 In brief, the ROC
X, XD (
q) is the probability that a diseased individual with disease-specific covariates
X D (that is, covariates specific to diseased subjects such as disease severity, for example) and common covariates
X (covariates common to both diseased and healthy subjects) has test results
Y D that are greater than or equal to the
qth quantile of the distribution of tests results from nondiseased individuals. That is, when the specificity of the test is 1 −
q, the sensitivity is ROC
X, XD (
q). The general ROC regression model can be written as:
\[\mathrm{ROC}_{\mathrm{X,XD}}(q){=}{\Phi}({\alpha}_{1}{+}{\alpha}_{2}{\Phi}^{{-}1}(q){+}{\beta}X{+}{\beta}_{\mathrm{D}}X_{\mathrm{D}})\]
where the coefficients α
1 and α
2 are the intercept and slope of the ROC curve, respectively, and Φ is the normal cumulative distribution function. If the coefficient for a specific variable
X (β) is greater than zero, then the discrimination between diseased and nondiseased subjects increases with increasing values of this covariate. Similarly, if the coefficient for the disease-specific covariate
X D (β
D) is greater than zero, then diseased subjects with larger values of this covariate are more distinct from nondiseased subjects than are diseased subjects with smaller values of
X D.
In the present study, an ROC model was fitted to assess the influence of the common covariate TSS on the diagnostic performance of the GDx ECC and VCC parameters. The model was adjusted for the disease-specific covariate severity and the common covariate age. The following ROC regression model was fitted for each parameter evaluated:
\[\mathrm{ROC}_{\mathrm{X,XD}}(q){=}{\Phi}({\alpha}_{1}{+}{\alpha}_{2}{\Phi}^{{-}1}(q){+}{\beta}_{1}\mathrm{ECC}{+}{\beta}_{2}\mathrm{ECC}{\times}{\Phi}^{{-}1}(q){+}{\beta}_{3}\mathrm{TSS}{+}{\beta}_{4}\mathrm{TSS}{\times}\mathrm{ECC}{+}{\beta}_{5}severity{+}{\beta}_{6}age)\]
where
ECC is a binary variable indicating the type of test (GDx VCC was used as the reference test), and
TSS is a continuous variable quantifying the presence of atypical patterns of retardation. An interaction term between the variable
ECC and Φ
−1 (
q) was included to verify whether the performance of the ECC differed by varying amounts depending on the false-positive rate
q (or specificity 1 −
q)—that is, whether the shapes of ROC curves for VCC and ECC tests were different. The interaction term between ECC and TSS was included to assess whether the influence of ARPs was similar or different between GDx VCC and ECC tests. Finally, to adjust for severity of disease and age, the model included a variable
severity indicating severity of glaucomatous damage as measured by the AGIS score, and a variable
age indicating patient’s age. No variable selection method was used. Because the models were developed for hypothesis testing, there was little concern for parsimony. The full prespecified model fit, including all variables, results in more accurate probabilities for tests of variables of interest.
17
Parameters were estimated using probit regression. To obtain confidence intervals for regression parameters, a bootstrap resampling procedure was used (
n= 500 resamples).
18 As measurements from both eyes of the same subject are likely to be correlated, the use of standard statistical methods for parameter estimation can lead to underestimation of standard errors and to confidence intervals that are too narrow.
19 Therefore, to account for the fact that both eyes of some subjects were used for analyses, the cluster of data for the study subject was considered as the unit of resampling when calculating standard errors. This procedure has been used to adjust for the presence of multiple correlated measurements from the same unit.
18 20
Statistical analyses were performed using commercial software (STATA ver. 9.0; StataCorp, College Station, TX; and SPSS ver. 13.0; SPSS Inc., Chicago, IL). The α level (type I error) was set at 0.05. The study had a power of 0.85 to detect a difference of 0.05 between ROC curve areas for the parameter NFI, assuming α = 0.05.