Glaucoma screening has historically been challenging because of a poor balance of sensitivity and specificity of screening procedures. IOP, automated visual field testing, frequency-doubling perimetry, optic disc evaluation, and nerve fiber analysis by photographs and automated methods have all been evaluated as potential screening devices, with variable results.
The meta-analysis performed by Mowatt et al.
25 reviewed 40 studies of nine screening tests for the detection of glaucoma. In judging the performance of these tests by diagnostic odds ratios, these researchers found that FDT, oculokinetic perimetry, and HRT II (Retinal Tomograph II; Heidelberg Engineering, Heidelberg, Germany) were promising tests, whereas ophthalmoscopy, standard automated perimetry, retinal photography, and IOP performed relatively poorly as single tests. They concluded that no test or group of tests was clearly superior for glaucoma screening.
The Baltimore Eye Survey was a population-based study that evaluated IOP, vertical C/D ratio, and narrowest neuroretinal rim width as screening parameters for glaucoma.
24 The investigators found no cutoff points that gave a good balance between sensitivity and specificity for any parameter or combination thereof—this despite the use of expert evaluation of stereoscopic disc photography as one of the screening parameters.
A study in the United Kingdom found that by adding visual field results, optic disc evaluation, and IOP, a sensitivity and specificity of >0.9 were obtained.
26 Tests of visual function such as visual acuity, contrast sensitivity, and color discrimination did not improve the model. However, this was not a population-based glaucoma screening study, since the participants included a small group of persons without glaucoma and about half that number with suspected glaucoma. Thus, the results cannot be directly applied to population screening for glaucoma, because most persons do not have glaucoma.
In our study, while the vertical C/D ratio has the best balance of specificity and sensitivity, it does not achieve levels that would make it a useful screening test for population-based glaucoma screening in the general population or in high-risk subgroups. Nevertheless, we have shown that a combination of screening tests can be used in a simple decision algorithm to screen and identify persons with glaucoma, with high sensitivity and specificity. While our results do not suggest that glaucoma screening in Latino populations is materially different from screening in other populations, that possibility should be considered when extrapolating our results to other population groups. We calculated likelihood ratios which have the advantage of consistency across different levels of disease prevalence in the population being tested. However, there are also known differences in the levels of these screening parameters seen in various populations. For example, even within the Latino community, Latinos in Arizona (Proyecto VER) had higher levels of IOP among participants with OAG than were found in Latinos in Los Angeles among participants with OAG. It is not clear whether this difference is due to different study methods in measurement or determining glaucoma or to population differences within the Latino community (e.g., percentage with Native American ancestry, which was substantially different in the two studies). Given these limitations, an appropriate interpretation of these results is that our study is exploratory and provides a different methodological approach to screening for glaucoma. The use of a multiparameter algorithm such as we have developed herein may eliminate some of the ambiguity found with single-parameter evaluations; however, further studies are necessary to validate these findings. Indeed, a generalized algorithm may include race/ethnicity as a parameter subject to the CART decision process. A remaining limitation of this algorithm, however, is that from a practical standpoint, not all the required parameters may be readily available. In a future analysis, we propose an objective measure of optic disc parameters with automated testing to determine whether there is good a balance between sensitivity and specificity. The strategy of screening a high-risk population neither improved nor detracted from the sensitivity and specificity of the various glaucoma screening parameters, suggesting that a screening program would be equally effective in both settings.
In summary, we have validated the findings of previous studies highlighting that no single parameter is highly sensitive and specific for population-based screening of glaucoma. However, a combination of IOP, HVF, and optic disc assessments used in the algorithmic manner described herein yields the best approach to screening for glaucoma. Other population-based studies should develop similar algorithmic approaches to diagnosis based on parameters that are specific and applicable to their populations.
Supported by Grants U10 EY-11753 and EY-03040 from the National Eye Institute, Bethesda, Maryland, and by an unrestricted grant from Research to Prevent Blindness, Inc, New York, New York.
The authors thank the LALES External Advisory Committee (members shown in the Appendix) for their advice and contributions.