Abstract
purpose. To use physical attributes of redness to determine the accuracy of four bulbar redness grading scales, and to cross-calibrate the scales based on these physical measures.
methods. Two image-processing metrics, fractal dimension (D) and percentage of pixel coverage (% PC), as well as photometric chromaticity were selected as physical measures, to describe and compare grades of bulbar redness among the McMonnies/Chapman-Davies scale, the Efron Scale, the Institute for Eye Research scale, and a validated scale developed at the Centre for Contact Lens Research. Two sets of images were prepared by using image processing: The first included multiple segments covering the largest possible region of interest (ROI) within the bulbar conjunctiva in the original images; the second contained modified scale images that were matched in size and resolution across scales, and a single, equally-sized ROI. To measure photometric chromaticity, the original scale images were displayed on a computer monitor, and multiple conjunctival segments were analyzed. Pearson correlation coefficients between each set of image metrics and the reference image grades were calculated to determine the accuracy of the scales.
results. Correlations were high between reference image grades and all sets of objective metrics (all Pearson’s r ≥ 0.88, P ≤ 0.05); each physical attribute pointed to a different scale as being most accurate. Independent of the physical attribute used, there were wide discrepancies between scale grades, with almost no overlap when cross-calibrating and comparing the scales.
conclusions. Despite the generally strong linear associations between the physical characteristics of reference images in each scale, the scales themselves are not inherently accurate and are too different to allow for cross-calibration.
Red eye, clinically known as bulbar hyperemia, is an increased dilation of blood vessels in the bulbar conjunctiva that gives the eye its red appearance and is a prominent sign of ocular irritation. The recognition of change in redness is crucial for clinicians in management of the ocular surface, particularly in contact lens research and practice. Commonly, redness is estimated in a patient’s eye by subjectively comparing it to references that represent different levels of severity for the condition and assist in monitoring changes over time. The references are descriptive,
1 2 illustrative,
3 4 5 6 or computer generated
7 8 and are presented in the form of grading scales. The subjectivity in grading is a criticism linked to the use of grading scales, and weak repeatability for inter- and intraobserver assessments is of particular concern for clinical practice.
9 10 Aside from variability introduced by observer use, grading scales have been criticized for technical difficulties,
10 11 such as unequal steps, references not capable of covering the whole range of the scale, or biased depiction of references for different levels of severity. Hence, it has been recommended that the different grading scales not be interchanged.
8 12 13 14
Repeatability has been the main focus of most research studies of traditional grading, with respect to differences between observers,
3 11 12 15 between grading scales,
8 12 or between levels of observer training,
6 16 17 or compared with novel objective techniques measuring the physical attributes of redness.
9 14 18 19 20 21 22 23 The physical attributes to describe conjunctival redness have included various quantitative
13 20 21 22 24 (e.g., number of vessels or percentage of vessel coverage) and colorimetric (e.g., chromaticity levels or red intensity ratios)
9 18 19 23 variables that were determined using either digital image processing or photometric techniques. However, we found only three studies in which the physical attributes of the scales, per se, were analyzed.
13 14 19
An interval or ratio scale level has been recommended for grading scales,
22 since it ensures uniform separation of reference images across the scale range; that is, a change from 10 to 20 on a 100-point scale represents the same difference as a change from 70 to 80 on the same scale. The extent of blood vessel coverage (%
PC, an objective measure of redness) has been used to examine scales and compare them, but not specifically to investigate the separation of the steps of the scales.
13 14 19
In this study, we introduce fractal analysis, a new technique for analysis of grading scales, and compare it to %
PC 13 14 19 and photometric chromaticity.
25 Fractal analysis has been shown to be a powerful objective technique for detecting changes in various biological systems.
26 27 It describes the complexity of the object or pattern by estimating the degree of branching of the vascular tree in the respective biological system.
28 Fractals found in nature are so-called random fractals, objects that are scale invariant over a finite range, which means that they look the same under different degrees of magnification or scale (e.g., the branches of a tree). They are quantized by a fractal dimension,
D, describing the degree of branching. In a two-dimensional photograph of vascular branches in the eye, the fractal dimension
D can take on any decimalized value between 0 and 2.
Figure 1shows simulated examples of vascular branching of the bulbar conjunctiva and the range of the expected fractal dimension,
D. With respect to the eye, fractal analysis has been used to simulate corneal neovascularization
29 and has been successfully applied to investigate the vessel structure in normal and diseased retinas.
28 30 31
The purpose of this study was to estimate the accuracy of the grading scales by comparing the distribution and separation of the reference images of illustrative redness grading scales to objective physical attributes of redness defined by fractal analysis and photometric chromaticity and to use these measures to cross-calibrate the scales.
Scale Version 1: Greatest Conjunctival Area Coverage.
Scale Version 2: Size-Matching of Reference Images.
Fractal Analysis.
To determine fractal dimensions and % PC for the largest conjunctival area possible of each reference image, the results for the individual segmented ROIs of scale version 1 and the photometric measures were averaged to represent a global estimate of the conjunctival redness.
The Pearson’s product moment correlation coefficient (Pearson’s r) was used to estimate the strength of linear association between scale steps and physical attributes of the scales (D, % PC, and photometric chromaticity).
The purpose of this study was to estimate the accuracy of the grading scales by comparing the distribution and separation of the reference images of illustrative redness grading scales to objective physical attributes of redness defined by fractal analysis and photometric chromaticity, and to use these measures to cross-calibrate the scales.
One limitation of the study was that scale reference images were provided at different resolutions (72–300 dpi). As with all image-processing techniques, the highest spatial resolution possible is advantageous, particularly when small spatial details form the object of interest. However, even at the lowest resolution used in this experiment, there was a systematic relationship between grading scale steps and the estimated fractal dimension. If we were to attempt to glean recommendations from these resolution data, our results suggest that images acquired only at the common resolution (72 dpi) of a screen display were sufficient to quantify redness reasonably, based on fractal dimension. Most common clinical digital image-acquisition instrumentation would provide much higher resolution.
The use of fractal analysis to assess changes in vascular branching is an emerging strategy in clinical research.
27 28 31 This is the first study in which fractal analysis was used to evaluate vascular structures in the conjunctiva, as well as to compare differences in the physical attributes between grading scale images. The strong correlations (all
r ≥ 0.89;
P ≤ 0.05) between physical measures to describe conjunctival redness (%
PC 13 14 19 22 and photometric chromaticity
25 ) and fractal dimensions indicate that fractal analysis is capable of describing changes in severity of bulbar redness.
The preprocessed scale images showing the changes in severity across the whole scale range are displayed in
Figure 5 . The physical attributes derived from these images (%
PC and
D) correlated highly with the grading scale steps
(Table 3) . The types of fractal dimensions (D̄,
D sc,
D e, and
D sce) calculated by FracLac showed only minimal differences in the raw data for any scale for the same preprocessing procedure, which resulted in very small variations of the Pearson correlation coefficients (
Table 3 ; differences within a scale ±0.01). Therefore the slope-corrected, most efficient covering fractal dimension (
D sce), which eliminates periods of no change in the data by using the lowest number of boxes, was selected to illustrate the results of fractal analysis for scale versions 1
(Fig. 6a)and 2
(Fig. 6b) .
The results of this study showed high levels of linear association between grading scale reference levels and physical attributes (
Table 3 ; range of 0.88 ≤
r ≤ 1.0) for all grading scales. A Pearson correlation of 1.0 represents a perfectly linear association between scale grades and physical attributes; grading scales that exhibit this feature may be characterized at least as interval. For the purposes of this study, we decided to define the accuracy of a grading scale by the level of linear association it exhibited between scale steps and associated physical attribute (Pearson’s
r,
Table 3 ). Thus, the most accurate grading scale would be the scale for which this correlation was the highest, and the least accurate scale the one with the lowest Pearson correlation level.
In our study, a Pearson correlation of
r = 1.0 between scale reference grades and one physical attribute was found for each grading scale except for the MC-D scale. However, each physical attribute extracted from the images pointed to a different scale as being most accurate. If the accuracy of a scale were defined only on the amount of vessel coverage across the scale range, the IER scale would be the most accurate (
Table 3 , %
PC). Based on fractal dimension (
Table 3 , all types of
D), the Efron and the VBR scales were more accurate. With the third physical attribute of redness in our study, photometric chromaticity, the VBR scale showed the highest linear association with the reference grades and therefore might be described as the most accurate scale.
The high accuracy of the VBR scale with respect to chromaticity is not surprising, since this scale was developed based on a combination of subjective estimates and photometric chromaticity measures.
6 The reference images for the Efron scale were painted, perhaps focusing on highlighting certain features and simultaneously avoiding confounding artifacts.
39 Our study supports this intention, as we determined consistently strong associations with degree of vascular branching and the amount of vessel coverage, whereas the linear association between photometric chromaticity changes and scale steps was lowest of all scales.
These results show that estimating the accuracy of a grading scale is closely related to the technique and the physical attributes used. Overall, fractal dimension, % PC, and photometric chromaticity all were capable of detecting changes in the severity of redness. The high linear associations between scale steps and physical attributes indicate that the anticipated change in the scales could be determined with the physical attributes used, and that all scales thus may be considered accurate. Superior or inferior accuracy of a scale, however, should be defined solely by indicating the physical attribute that was used. Based on our results, it seems that each scale describes one characteristic of redness best: The VBR and MC-D scales best describe redness in terms of photometric chromaticity, the Efron scale with respect to changes of vascular branching (D), and the IER scale with respect to changes in the area that is covered (% PC). The consistently high correlation levels for the VBR scale (all r ≥ 0.97), however, suggest that this scale is the least affected by the selection of the physical attributes or the preprocessing procedure.
The first goal of this study was to determine the accuracy of bulbar redness grading scales based on their correlation levels between scale grades and three physical measures (D, % PC, and photometric chromaticity). Based on our results, all scales can be considered accurate according to the criteria we specified; however, it seems that each scale describes one characteristic of redness best: the VBR and MC-D scales best describe redness in terms of photometric chromaticity, the Efron scale with respect to changes of vascular branching (D), and the IER scale with respect to changes in the area that is covered (% PC).
The second intent of this series of analyses was to cross-calibrate the scales so that they could be compared via the physical measures. We have shown that an objective cross-calibration between scales would be technically possible, but as is apparent because of the wide discrepancies and the relatively low overlap, it cannot be recommended. Differences between acquisition methods, image quality, and physically obtained measures show that the differences between grading scales are too severe to allow for cross-calibration. Among many things, this highlights the need for standardization, inasmuch as scales are proposed with few physical similarities, bringing into question whether the numbers that represent the steps on the scale (and therefore the numbers derived using the scales) are measurements at all.
Supported by a Canada Foundation for Innovation (CFI) Equipment Grant.
Submitted for publication October 9, 2007; revised December 4, 2007; accepted February 28, 2008.
Disclosure:
M. M. Schulze, None;
N. Hutchings, None;
T. L. Simpson, None
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked “
advertisement” in accordance with 18 U.S.C. §1734 solely to indicate this fact.
Corresponding author: Marc M. Schulze, Centre for Contact Lens Research, School of Optometry, University of Waterloo, 200 University Avenue West, Waterloo, Ontario, Canada N2L 3G1;
[email protected].
Table 1. Original File Types for the Grading Scale Reference Images and Their Associated Image Size and Resolution
Table 1. Original File Types for the Grading Scale Reference Images and Their Associated Image Size and Resolution
| Original File type | Size (px × px) | Resolution (dpi) |
VBR | TIFF | 1524 × 1012 | 300 |
IER | TIFF | 700 × 525 | 100 |
Efron | JPEG | 1628 × 1399 | 72 |
MC-D | TIFF | 372 × 271 | 100 |
Table 2. Grayscale Brightness Values, Standard Deviation and Signal-to-Noise Ratio for Each Grading Scale and Each 8-bit RGB Component
Table 2. Grayscale Brightness Values, Standard Deviation and Signal-to-Noise Ratio for Each Grading Scale and Each 8-bit RGB Component
| Red | | | | Green | | | | Blue | | | |
| s n | a max | a min | SNR | s n | a max | a min | SNR | s n | a max | a min | SNR |
VBR | 12 | 255 | 190 | 14.7 | 3 | 247 | 114 | 32.9 | 4 | 242 | 102 | 30.9 |
IER | 13 | 255 | 117 | 20.5 | 2 | 251 | 103 | 37.4 | 3 | 253 | 76 | 35.4 |
Efron | 6 | 255 | 147 | 25.1 | 1 | 255 | 20 | 47.4 | 2 | 255 | 45 | 40.4 |
MC-D | 24 | 255 | 121 | 14.9 | 7 | 242 | 40 | 29.2 | 10 | 210 | 42 | 24.5 |
Table 3. Pearson Correlation Coefficients between Scale Grades and Their Associated Physical Attributes
Table 3. Pearson Correlation Coefficients between Scale Grades and Their Associated Physical Attributes
| VBR | | IER | | Efron | | MC-D | |
| 1 | 2 | 1 | 2 | 1 | 2 | 1 | 2 |
D̄ | 0.97 | 0.98 | 0.93 | 0.96 | 0.99 | 1.00 | 0.88 | 0.95 |
D sc | 0.97 | 0.98 | 0.93 | 0.96 | 0.99 | 1.00 | 0.88 | 0.95 |
D e | 0.98 | 0.98 | 0.93 | 0.95 | 0.99 | 0.99 | 0.88 | 0.94 |
D sce | 0.97 | 0.98 | 0.92 | 0.94 | 0.99 | 0.99 | 0.88 | 0.95 |
% PC | 0.98 | 0.98 | 0.97 | 1.00 | 0.99 | 0.97 | 0.95 | 0.97 |
u′ | 0.99 | | 0.95 | | 0.94 | | 0.97 | |
u* | 1.00 | | 0.96 | | 0.96 | | 0.98 | |
The authors thank Charles McMonnies, Nathan Efron, and the International Association of Contact Lens Educators (IACLE) for providing high-resolution copies of the original reference images.
Slit Lamp Findings Classification Scale.. 1984;U.S. Food and Drug Administration Washington, DC.
MandellRB. Slit lamp classification system. J Am Optom Assoc. 1987;58(3)198–201.
[PubMed]McMonniesCW, Chapman-DaviesA. Assessment of conjunctival hyperemia in contact lens wearers. Part I. Am J Optom Physiol Opt. 1987;64(4)246–250.
[CrossRef] [PubMed]IER Grading Scales. Institute for Eye Research, Sydney, Australia. Available at: http://www.siliconehydrogels.org/grading_scales/SETUP.HTM. Accessed July 20, 2007.
EfronN. Clinical application of grading scales for contact lens complications. Optician. 1997;213(5604)26–35.
SchulzeM, JonesD, SimpsonT. The development of validated bulbar redness grading scales. Optom Vis Sci. 2007;84(10)976–983.
[CrossRef] [PubMed]SimpsonT, PritchardN. Continuous grading scales of ocular redness, papillae and corneal staining. Optom Vis Sci. 1995;72(suppl 12)77.
ChongT, SimpsonT, FonnD. The repeatability of discrete and continuous anterior segment grading scales. Optom Vis Sci. 2000;77(5)244–251.
[PubMed]FieguthP, SimpsonT. Automated measurement of bulbar redness. Invest Ophthalmol Vis Sci. 2002;43(2)340–347.
[PubMed]BaileyIL, BullimoreMA, RaaschTW, et al. Clinical grading and the effects of scaling. Invest Ophthalmol Vis Sci. 1991;32(2)422–432.
[PubMed]MurphyPJ, LauJS, SimMM, et al. How red is a white eye?—clinical grading of normal conjunctival hyperaemia. Eye. 2006;21:633–638.
[PubMed]EfronN, MorganPB, KatsaraSS. Validation of grading scales for contact lens complications. Ophthalmic Physiol Opt. 2001;21(1)17–29.
[PubMed]Perez-CabreE, MillanMS, AbrilHC, et al. Image processing of standard grading scales for objective assessment of contact lens wear complications. Proc SPIE Int Soc Opt Eng. 2004;5622(1)107–112.
WolffsohnJS. Incremental nature of anterior eye grading scales determined by objective image analysis. Br J Ophthalmol. 2004;88(11)1434–1438.
[CrossRef] [PubMed]EfronN. Grading scales for contact lens complications. Ophthalmic Physiol Opt. 1998;18(2)182–186.
[CrossRef] [PubMed]EfronN, MorganPB, FarmerC, et al. Experience and training as determinants of grading reliability when assessing the severity of contact lens complications. Ophthalmic Physiol Opt. 2003;23(2)119–124.
[CrossRef] [PubMed]EfronN, MorganPB, JagpalR. The combined influence of knowledge, training and experience when grading contact lens complications. Ophthalmic Physiol Opt. 2003;23(1)79–85.
[CrossRef] [PubMed]VillumsenJ, RingquistJ, AlmA. Image-analysis of conjunctival hyperemia: a personal-Computer based system. Acta Ophthalmol. 1991;69(4)536–539.
WillinghamFF, CohenKL, CogginsJM, et al. Automatic quantitative measurement of ocular hyperemia. Curr Eye Res. 1995;14:1101–1108.
[CrossRef] [PubMed]OwenCG, FitzkeFW, WoodwardEG. A new computer assisted objective method for quantifying vascular changes of the bulbar conjunctivae. Ophthalmic Physiol Opt. 1996;16(5)430–437.
[CrossRef] [PubMed]GuillonM, ShahD. Objective measurement of contact lens-induced conjunctival redness. Optom Vis Sci. 1996;73(9)595–605.
[CrossRef] [PubMed]PapasEB. Key factors in the subjective and objective assessment of conjunctival erythema. Invest Ophthalmol Vis Sci. 2000;41(3)687–691.
[PubMed]SorbaraL, SimpsonT, DuenchS, et al. Comparison of an objective method of measuring bulbar redness to the use of traditional grading scales. Contact Lens Anterior Eye. 2007;30(1)53–59.
[CrossRef] [PubMed]WolffsohnJS, PurslowC. Clinical monitoring of ocular physiology using digital image analysis. Contact Lens Anterior Eye. 2003;26(1)27–35.
[CrossRef] [PubMed]SimpsonT, ChanA, FonnD. Measuring ocular redness: first order (luminance and chromaticity) measurements provide more information than second order (spatial structure) measurements. Optom Vis Sci. 1998;75(suppl 12)279.
[CrossRef] [PubMed]MandelbrotBB. The Fractal Geometry of Nature. 1982;W.H. Freeman and Company New York.
MastersBR. Fractal analysis of human retinal blood vessel patterns: developmental and diagnostic aspects.MastersBR eds. Noninvasive Diagnostic Techniques in Ophthalmology. 1990;515–527.Springer-Verlag New York.
MastersBR. Fractal analysis of human retinal vessels. SPIE Proc. 1990;1357(1)250–256.
LandiniG, MissonG. Simulation of corneal neovascularization by inverted diffusion limited aggregation. Invest Ophthalmol Vis Sci. 1993;34(5)1872–1875.
[PubMed]AvakianA, KalinaRE, SageEH, et al. Fractal analysis of region-based vascular change in the normal and non-proliferative diabetic retina. Curr Eye Res. 2002;24(4)274–280.
[CrossRef] [PubMed]MastersBR. Fractal analysis of the vascular tree in the human retina. Ann Rev Biomed Eng. 2004;6(1)427–452.
[CrossRef] Rasband WS. ImageJ. Version 1.38x (07/13/2007). Bethesda, MD: National Institutes of Health. Available at: http://rsb.info.nih.gov/ij/.
SmithSW. The Scientist and Engineer’s Guide to Digital Signal Processing. Available at http://www.dspguide.com. Accessed February 26, 2008. ;
YoungIT, GerbrandsJJ, van VlietLJ. Image Processing Fundamentals. Accessible at: http://www.ph.tn.tudelft.nl/courses/fip/noframes/fip-statisti.html#heading24. Accessed June 7, 2007. ;
KarperienA. FracLac for ImageJ— FracLac Advanced User’s Manual. Accessible at http://rsb.info.nih.gov/ij/plugins/fraclac/fraclac-manual.pdf. Accessed July 21, 2007.. ;
CIE (Commission International de L’Eclairage). Recommendations on Uniform Color Spaces, Color-Difference Equations, Psychometric Color Terms. 1978. Report Supplement No. 2 of CIE Publ. No 15 (E-1.3.1) 1971.
WyszeckiG, StilesWS. Color Science: Concepts and Methods, Quantitative Data and Formulae. 1982; 2nd ed.John Wiley & Sons New York.
PoyntonCA. Gamma FAQ: Frequently Asked Questions about Gamma. Accessible at: http://www.poynton.com/notes/colour_and_gamma. Accessed February 26, 2008. ;
EfronN. Contact Lens Complications. 1999;33–39.Butterworth-Heinemann Oxford; Boston.
LandEH, MccannJJ. Lightness and Retinex Theory. J Opt Soc Am. 1971;61(1)1–11.
[CrossRef] [PubMed]FinkelsteinL. Widely, strongly and weakly defined measurement. Measurement. 2003;34(1)39–48.
[CrossRef] FinkelsteinL. Problems of measurement in soft systems. Measurement. 2005;38(4)267–274.
[CrossRef] RossiGB. Measurability. Measurement. 2007;40(6)545–562.
[CrossRef] PointerMR. Measuring Visual Appearance: a Framework for the Future. 2003; Report NPL COAM 19. Available at: http://publications.npl.co.uk/npl_web/pdf/coam19.pdf. Accessed April 10, 2007. ;