Abstract
purpose. To establish objectively measurable characteristics of the conjunctival
vasculature that correspond with the judgment of erythema by human
observers.
methods. Color images of bulbar conjunctiva from 21 subjects were
digitally analyzed to extract the following variables characteristic of
the scene: vessel width (W), number of vessels
(V), proportion of area occupied by vessels
(PA), relative redness both in vessels
(RRV) and in the whole image
(RRI), red-green difference both in vessels
(RGV) and in the whole image
(RGI), red-blue difference both in vessels
(RBV) and in the whole image
(RBI), and red hue value
(RHV). These data were compared with subjective
judgments by a panel of seven trained observers who independently rated
erythema in the same images, using a 0 to 4 scale with decimal
interpolation between grades.
results. Correlation analysis indicated significant associations
(P < 0.05) between the mean response of the human
observers and all the objective variables except RHV.
Associations with the morphometric variables PA (R 2 = 0.93) and V (R 2 = 0.90) were markedly stronger than
for the best colorimetric variable RBV (R 2 = 0.62).
conclusions. Judgments of erythema made by human observers do not rely primarily on
color but can be closely approximated by a univariate, linear model
involving only the proportion of the scene occupied by vessels. Under
the conditions of this study, grading of erythema by trained observers
can be considered to constitute measurement to at least an interval
level.
Observing erythema (i.e., the appearance of redness), is a
valuable clinical procedure, because changes in blood flow within the
conjunctiva or sclera accompany a wide range of ocular conditions. In
some circumstances, such changes are sufficiently gross that their
observation requires little expertise. There are, however, many
occasions when the subtlety of the event demands greater sensitivity.
To assist in the process of assessment, clinicians and researchers have
often resorted to the use of grading scales.
1 2 3 4 Thus, a
given presentation is gauged relative to a predetermined set of
criteria chosen to represent different degrees of the condition of
interest. Such scales vary in their design and may be either
descriptive,
5 artistically rendered,
6 photographic,
7 8 or computer generated.
9
As a clinical aid, grading scales have been convenient and useful;
however, their inherent subjectivity is a source of some concern.
Repeated viewing of the same scene, whether by different observers or
by the same individual on separate occasions, typically produces a
range of responses.
Presumably with this as well as other factors in mind, several groups
have applied objective methods to the problem of measuring ocular
surface vasculature.
10 11 12 13 14 15 16 17 A variety of parameters have
been used in this body of work in attempts to describe the changes
associated with vascular activity. These include vessel caliber, vessel
area, percentage vessel area, relative redness, number of vessel
segments, intervessel spacing, and vessel length or area, Faced with
such diversity, it seems natural to wonder about the nature of the
relationship between the array of morphologic and colorimetric factors
and the view of erythema arrived at by subjective means. Establishing
the relative importance of individual objective measures by comparing
them with subjective responses would seem a logical step therefore.
Some efforts in this direction have been made previously, although
apparently with little success. Willingham et al.
14 used
an objective system to measure the relative redness and percentage of
vessel coverage in each of the six reference photographs of an
independently produced scale.
7 After correlating the
objective results with the scale integers, they claimed good agreement
for both variables measured. However, because their data were all
derived from just one eye, the validity of this claim is questionable.
Owen et al.
15 made objective and subjective measurements
in a group of subjects but, unfortunately, the observed region of
conjunctiva was not the same in both cases. Perhaps as a result, they
did not compare their data over a range of responses. Finally, Guillon
and Shah
16 collected both objective and subjective data
from 129 individuals, assessing two vascular parameters, vessel width
and percentage of coverage. They were unable to demonstrate any
consistent linkages, however, probably because of the limited range of
conjunctival response present in their sample.
In the absence of other pertinent literature, the present study was
undertaken with the purpose of identifying associations between
individual, objectively measurable characteristics of the ocular
surface vasculature and typical subjective judgments of conjunctival
erythema. Knowledge of these factors and their relative importance
during the process of subjective judgment would be valuable, not only
to designers of objective measurement systems, but also to those either
using, or teaching the use of, subjective methods of assessment.
Relative Redness of Image.
Relative Redness of Vessels.
Red-Green Difference in Entire Image.
Red-Green Difference in Vessels.
Red-Blue Difference in Entire Image.
Red-Blue Difference in Vessels.
Red Hue Value.
Number of Vessels.
Vessel Width.
Proportion of Area Occupied by Vessels.
Preliminary checks indicated that all data were approximately
normally distributed (Kolmogorov–Smirnov;
P > 0.05).
The mean scores from all seven observers for each image together with
the associated SD are shown in
Figure 1 . Overall, on this set of images, the average SD of judgments was 0.33
grades. This implies that 95% of observations made by these observers
on the same image would be within a range of ±0.8 grades.
Taking mean observed grade (
OG) as ordinate in all
cases, a series of scatterplots was constructed, each having one of the
measured variables as its abscissa. These plots are shown in
Figure 2 . In several instances, evidence of a linear relationship between the
plotted variables was detected on visual inspection. Further
confirmation was made by calculating bivariate Pearson correlation
coefficients as shown in
Table 1 .
Overall, the strongest relationships found were those with V and PA. These variables accounted for
90% and 93%, respectively, of the variance in OG.
Vessel width, showed only a relatively weak association with OG (R 2 = 0.39).
Among the colorimetric variables, all except RHV were
significantly correlated with OG and had fairly similar
coefficients of determination. The association with the RBV was the strongest within this group
(R 2 = 0.62), although it was notably
weaker than that for V and PA. Attempts
to improve the fit still further by combining colorimetric and
morphometric information in a multivariate model were ineffective.
Given that the purpose of this study was to discover which
observable characteristics of the conjunctival surface contribute to
the perception of its redness, it seemed reasonable at the outset to
expect that some aspect of the color information contained within the
image would be an important factor. For this reason, several data
constructs were incorporated into the analysis that, in one sense or
another, encoded for redness. All these, with the exception of RHV, produced reasonably good fits to the observers’ data.
However, even the most successful measure (RBV), fell
considerably short of the conformity shown by the two morphologic
variables V and PA. That these measures were
individually capable of accounting for at least 90% of the variance in
the observers’ ratings, suggests that the process whereby this group
made their judgments primarily involved gauging the extent or quantity
of vessels in the scene rather than their color. Both these variables
can be thought of as indicators of vascular density and were highly
correlated with one another (R 2 =
0.93).
The finding that assessments of conjunctival erythema can be modeled
morphometrically is an important one so far as the design of an
objective measurement system is concerned. The ability to ignore color
permits monochromatic image acquisition, which is relatively
inexpensive and offers advantages in CCD size and sensitivity.
Furthermore, the amount of data involved is reduced by a factor of
three, allowing processing and storage to be simpler, faster, and less
demanding of computer resources.
The strong association between
PA and
OG has the
intriguing consequence that good, subjective erythema judgments should
be possible even from a monochromatic scene. Because the suggestion
that the “redness” of a black and white image could be judged may
be conceptually difficult, a supplementary experiment was performed to
test its accuracy. Five of the original observers, who were available,
were asked to re-view the test images. On this occasion, however, color
information was removed before viewing by converting the images from
their original RGB format to 8-bit gray scale. Because approximately 2
months had elapsed since the first viewing session, recall of previous
scores was deemed to be unlikely. Nevertheless, the order of
presentation was rerandomized for all observers. Correlating the mean
grading scores from the monochromatic viewing with those from the
original colored session yielded a coefficient of determination
(
R 2) of 0.97 (
P <
0.0001). Mindful of the criticisms leveled at correlation as a method
of indicating agreement,
19 the difference between the
gradings made on color and monochromatic images was plotted against
their mean, as shown in
Figure 3 . No evidence of a relationship with measurement size was indicated
(
R 2 = 0.1,
P = 0.16),
and the mean difference between color and monochromatic gradings was
0.13 ± 0.15 (SD).
Overall, across the range of measurement, this result indicates a
slight tendency for monochromatic judgments to underestimate those for
color, but only by a small amount. The limits of agreement suggest that
95% of color-monochrome differences would fall between approximately
0.5 and −0.2 grades. Neither the magnitude of the bias, nor its
associated error are large compared with the random variability
inherent in subjective grading, and it appears reasonable to conclude
that removal of color information from this kind of grading task did
not materially alter its outcome. This tends to confirm the suggestion
that, for this group of observers at least, erythema judgment is
essentially morphometric.
Apart from its methodologic implications, this study may also bear on
the way graded data are treated analytically. Until now, information of
this type has commonly been regarded as ordinal in nature. The reasons
for this and its ramifications have been comprehensively discussed
elsewhere
20 ; but briefly, ordinal scales simply require
that some meaningful order, or ranking, can be imposed onto the sample.
Because of this limitation, the application of even simple operations,
such as calculating the sample mean, have been seen as inappropriate.
Achieving the next higher level (i.e., interval measurement) requires
that successive scale increments be equally spaced. Clearly, this
presents a problem for most grading scales because of difficulty in
obtaining a suitable standard for comparison. However, an inspection of
Figure 2J plainly shows that the relationship between
OG (a
graded score), and
PA (a continuous variable) is closely
linear across the grading range used. A line fitted to the data has the
equation
OG = 9.0 ×
PA + 1.3
(
R 2 = 0.93;
P <
0.0001).
Thus, an increase of one erythema grade corresponded to a 0.11 increase
in PA, irrespective of whether the change was from Grade 1
to 2, from 3 to 4, or from 2.7 to 3.7. This appears to satisfy the
requirement for interval scale measurement. As such, the practice of
avoiding using statistics such as the mean and SD, or parametric
analytical techniques such as analysis of variance, simply on order of
measurement grounds does not seem justified.
An interesting further observation along these lines is that PA goes to zero when OG is approximately 1.3,
implying that grades below 1 are redundant. Making an axis translation
to account for this results in a scale the zero point of which could be
said to have a real physical meaning, (i.e., it corresponds to complete
absence of vascular detail; PA = 0). Interestingly,
having a meaningful zero point is a property required of ratio scale
measurement that constitutes the next, and highest, level of scaling.
In summary, this work has demonstrated that subjective erythema
judgment can be closely modeled using a linear, univariate,
morphometric approach and that, under the conditions of the study,
clinical grading displays at least interval level measurement
characteristics.
Supported by the Australian Government Co-operative Research Centre Scheme.
Submitted for publication July 23, 1999; accepted September 17, 1999.
Commercial relationships policy: N.
Corresponding author: Eric B. Papas, CRCERT, Rupert Myers Bldg. UNSW, Kensington, NSW 2052, Australia.
[email protected]
Table 1. Coefficients of Determination and P from Pearson Correlation
of Human Observations with Each Listed Variable
Table 1. Coefficients of Determination and P from Pearson Correlation
of Human Observations with Each Listed Variable
Variable | R 2 | P |
Relative redness image (RRI) | 0.49 | <0.0001 |
Relative redness vessels (RRV) | 0.55 | <0.0001 |
Red-green difference image (RGI) | 0.49 | <0.0001 |
Red-green difference vessels (RGV) | 0.48 | <0.0001 |
Red-blue difference image (RBI) | 0.52 | <0.0001 |
Red-blue difference vessels (RBV) | 0.62 | <0.0001 |
Red hue value (RHV) | 0.17 | 0.06 |
Number of vessels (V) | 0.90 | <0.0001 |
Vessel width (W) | 0.39 | 0.002 |
Percentage vessel area (PA) | 0.93 | <0.0001 |
The author thanks Isabelle Jalbert, Lisa Keay, Edward Lum, Padmaja
Sankaridurg, Cheryl Skotnitsky, Andrew Stephenson, and Rob Terry for
supplying their clinical skills.
Efron N, Brennan NA, Hore J, Rieper K. Temperature of the hyperemic bulbar conjunctiva. Curr Eye Res
. 1988;7:615–618.
[CrossRef] [PubMed]Donnenfeld ED, Perry HD, Wallerstein A, et al. Subconjunctival mitomycin C for the treatment of ocular cicatricial pemphigoid. Ophthalmology
. 1999;106:72–78.
[CrossRef] [PubMed]Terry RL, Schnider CM, Holden BA, et al. CCLRU standards for success of daily and extended wear contact lenses. Optom Vis Sci
. 1993;70:234–243.
[CrossRef] [PubMed]Papas EB, Vajdic CM, Austen R, et al. High oxygen transmissibility soft contact lenses do not induce limbal hyperaemia. Curr Eye Res
. 1997;16:942–948.
[CrossRef] [PubMed]Mandel RB. Slit lamp classification system. J Am Optom Assoc
. 1987;58:198–201.
[PubMed]Efron N. Clinical application of grading scales for contact lens complications. Optician. 1997;213(5604)26–34.
McMonnies CW, Chapman–Davies A. Assessment of conjunctival hyperaemia in contact lens wearers. Part I. Am J Optom Physiol Opt
. 1987;64:246–250.
[CrossRef] [PubMed]Cornea and Contact Lens Research Unit. C.C.L.R.U. Grading Scales. 1996; University of New South Wales Sydney, Australia.
Simpson TL, Pritchard N. Continuous grading scales of ocular redness, papillae and corneal staining. Optom Vis Sci. 1995;72(suppl 12)77.
Chen PCY, Kovalcheck SW, Zwiefach BW. Analysis of microvascular network in bulbar conjunctiva by image processing. Int J Microcirc Clin Exp
. 1987;6:245–255.
[PubMed]Proia AD, Chandler DB, Haynes WL, et al. Quantification of corneal neovascularisation using computerized image analysis. Lab Invest
. 1988;58:473–479.
[PubMed]Villumsen J, Ringquist J, Alm A. Image analysis of conjunctival hyperaemia. A PC based system. Acta Ophthalmol. 1991;69:536–539.
Conrad TJ, Chandler DB, Corless JM, et al. In vivo measurement of corneal angiogenesis with video data acquisition and computerized image analysis. Lab Invest
. 1994;70:426–434.
[PubMed]Willingham JF, Cohen KL, Coggins JM, et al. Automatic quantitative measurement of ocular hyperaemia. Curr Eye Res
. 1995;14:1101–1108.
[CrossRef] [PubMed]Owen CG, Fitzke FW, Woodward EG. A new computer assisted objective method for quantifying vascular changes of the bulbar conjunctivae. Ophthalmic Physiol Opt
. 1996;16:430–437.
[CrossRef] [PubMed]Guillon M, Shah D. Objective measurement of contact lens-induced conjunctival redness. Optom Vis Sci
. 1996;73:595–605.
[CrossRef] [PubMed]Cox I, Potvin R. Quantification of ocular bulbar conjunctival redness using computer-based image analysis. Optom Vis Sci
. 1996;73(suppl 12)231.
[CrossRef] [PubMed]Bailey IL, Bullimore MA, Raasch TW, et al. Clinical grading and the effects of scaling. Invest Ophthalmol Vis Sci
. 1991;32:422–432.
[PubMed]Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;8476:307–310.
Barbeito R, Simpson TL. Should level of measurement considerations affect the choice of statistic?. Optom Vis Sci
. 1991;68:236–242.
[CrossRef] [PubMed]