Abstract
Purpose:
Even during steady fixation, people make small eye movements such as microsaccades, whose rate is altered by presentation of salient stimuli. Our goal was to develop a practical method for objectively and robustly estimating contrast sensitivity from microsaccade rates in a diverse population.
Methods:
Participants, recruited to cover a range of contrast sensitivities, were visually normal (n = 19), amblyopic (n = 10), or had cataract (n = 9). Monocular contrast sensitivity was estimated behaviorally while binocular eye movements were recorded during interleaved passive trials. A probabilistic inference approach was used to establish the likelihood of observed microsaccade rates given the presence or absence of a salient stimulus. Contrast sensitivity was estimated from a function fitted to the scaled log-likelihood ratio of the observed microsaccades in the presence or absence of a salient stimulus across a range of contrasts.
Results:
Microsaccade rate signature shapes were heterogeneous; nevertheless, estimates of contrast sensitivity could be obtained in all participants. Microsaccade-estimated contrast sensitivity was unbiased compared to behavioral estimates (1.2% mean), with which they were strongly correlated (Spearman's ρ 0.74, P < 0.001, median absolute difference 7.6%). Measurement precision of microsaccade-based contrast sensitivity estimates was worse than that of behavioral estimates, requiring more than 20 times as many presentations to equate precision.
Conclusions:
Microsaccade rate signatures are heterogeneous in shape when measured across populations with a broad range of contrast sensitivities. Contrast sensitivity can be robustly estimated from rate signatures by probabilistic inference, but more stimulus presentations are currently required to achieve similarly precise estimates to behavioral techniques.
Even while attempting to maintain steady fixation on a point target, humans make regular small, fixational eye movements. These fixational eye movements are typically made without knowledge and include drift, tremor, and microsaccades. Microsaccades are conjugate eye movements, typically of around 10 to 20 deg/s velocity and 25-millisecond duration. Our understanding of the function of microsaccades is incomplete; they appear to play a role in maintaining fixation on a target
1–3 while also negating perceptual fading due to neural adaptation during periods of steady fixation under laboratory conditions
4,5 (but see
6,7). The functions of fixational eye movements have been more extensively reviewed elsewhere.
8–11
The rate at which microsaccades are made is modulated by visual input. For example, brief presentation of a visible texture elicits a pattern of change in the microsaccade rate.
12,13 This pattern has been termed the “rate signature” and typically follows a sequence of post-stimulus inhibition, followed by a “rebound” increase in rate and a return to the baseline rate over a total period of approximately 500 milliseconds. Microsaccade rate signatures are elicited by any stimulus transient—even those that are not relevant to the task under investigation
14–16—and by motor preparation.
17 The latency and magnitude of both the inhibition and rebound phase are modulated by stimulus attributes such as contrast
13,15,18 and the expected frequency of occurrence.
19 As microsaccades are generated in the superior colliculus,
20–22 it is likely that this is the locus for the rate signature, although the exact mechanisms remain unclear. Recent evidence suggests that the rate signature can be modulated by cortical areas
23 and that the frontal eye fields may play a key role in the rebound phase.
24
Our group has previously shown in a small cohort of healthy observers that sensitivity to visual stimuli can be predicted from properties of the rate signature including the magnitude of the inhibition and rebound phases and the amplitude of the resulting fixational saccade.
13 Further, we have demonstrated the utility of a machine learning approach that can classify stimuli as seen or unseen, depending on the subsequently observed rate signature.
13 This method produced accurate and precise estimates of contrast detection thresholds in a small group of participants with normal vision; however, it has several limitations related to the downsampling of recorded information across subjects, trials, and time that was necessary due to the sparse distribution of microsaccade events in the data.
13 Downsampling of information across subjects relies on the assumption that rate signatures are similar in form across subjects. While this assumption held approximately true for the small population previously tested, it may not hold across wider clinical populations with much greater variation in age and visual and motor function. Downsampling across trials renders the method inflexible to variations in numbers of trials collected, limiting the use of the method outside the laboratory where competing priorities may influence the amount of time available to collect data. Finally, downsampling across time results in loss of potentially useful information for estimating contrast sensitivity from stimulus-induced changes in microsaccade rate.
The purpose of this study was to develop and test a tractable, practical method for estimating contrast sensitivity from microsaccade rate. Our intention was to develop a method that could be incorporated into conventional frameworks for psychophysical vision testing as well as produce a threshold estimate for a new observer with a microsaccade rate signature of unknown shape. We aimed for the method to be robust to a wide range of contrast sensitivities and to potential differences in the profile of the microsaccade rate signature; therefore, we measured rate signatures in a deliberately broad clinical population. Finally, we also aimed to estimate the time cost of using this approach compared to conventional behavioral measurements.
To measure a broad spectrum of contrast sensitivities due to optical or neural deficits, we recruited participants from three groups: older adults with cataract, adults with amblyopia, and younger adults (≤40) with normal or corrected-to-normal vision and no known ocular pathology. Our intention was not to compare factors among these groups but rather to recruit a diverse population on which to test the robustness of our methods. Therefore, we did not employ strict inclusion/exclusion criteria to each group. Participants were recruited from the staff and student population of the University of Nottingham and by advertisement in local media.
Volunteers with self-reported amblyopia or cataract first underwent a screening eye examination in which their condition was verified by means of self-reported history, objective and subjective refraction, visual acuity, slit-lamp biomicroscopy, and indirect ophthalmoscopy. Lens opacity was quantified using the LOCS III grading system.
25 Those with or suspected of having amblyopia also undertook a variety of binocular vision assessments to further characterize their condition, including TNO stereopsis testing, cover testing, and Bagolini striated lens testing to evaluate suppression or the presence of anomalous retinal correspondence. Amblyopia was classified as a difference in visual acuity of at least 0.20 logMAR between the two eyes with no apparent ocular pathology. Pelli-Robson contrast sensitivity was also measured for all participants.
One eye of each participant was selected for testing. For participants with amblyopia, the amblyopic eye was tested. For participants with cataract, we chose the eye with poorer visual acuity as long as the vision was still good enough to perform the experiment. For young participants with normal vision, we chose one eye at random.
Data were collected from each participant over 4 to 5 sessions, each a duration of approximately 1 to 1.5 hours. Appropriate refractive correction for the screen distance of 1.5 meters was provided to all participants as wide aperture lenses mounted in a trial frame. The screen distance was maintained by use of a chin and forehead rest. The untested eye was occluded by material that permitted transmission of infrared but not visible light (Optolite Infrared Acrylic; Instrument Plastics, Maidenhead, UK) to permit binocular infrared eye tracking.
During the first session, we initially measured contrast sensitivity functions to Gabor stimuli (random-phase sinusoidal gratings within Gaussian spatial envelope with standard deviation 2.3°) of a range of spatial frequencies (0.5–12 cycles/deg). Gabors were presented for one frame at 85 Hz refresh rate on a gamma corrected CRT display (NEC MultiSync FP2141SB, 1024 × 768 spatial resolution, mean luminance 59 cd/m
2; NEC Display Solutions, Tokyo, Japan) via a graphics card that enabled 14 bit luminance resolution (Bits#; Cambridge Research Systems, Kent, UK), using custom software written in PsychoPy
26,27 (version 1.83.01). On each trial, the Gabor was randomly oriented at either 45° or 135°, and the observer indicated the orientation by key press (3 or 1 on a numeric key pad, respectively), making their best guess if unsure (two alternative forced choice). Gabor contrast was adjusted on each trial by a 3-down-1-up staircase, changing contrast in logarithmic steps and terminating after 60 trials. The mean of the last 8 reversals was taken as the contrast detection threshold for the particular spatial frequency. Contrast was defined in Michelson units as:
\(\def\upalpha{\unicode[Times]{x3B1}}\)\(\def\upbeta{\unicode[Times]{x3B2}}\)\(\def\upgamma{\unicode[Times]{x3B3}}\)\(\def\updelta{\unicode[Times]{x3B4}}\)\(\def\upvarepsilon{\unicode[Times]{x3B5}}\)\(\def\upzeta{\unicode[Times]{x3B6}}\)\(\def\upeta{\unicode[Times]{x3B7}}\)\(\def\uptheta{\unicode[Times]{x3B8}}\)\(\def\upiota{\unicode[Times]{x3B9}}\)\(\def\upkappa{\unicode[Times]{x3BA}}\)\(\def\uplambda{\unicode[Times]{x3BB}}\)\(\def\upmu{\unicode[Times]{x3BC}}\)\(\def\upnu{\unicode[Times]{x3BD}}\)\(\def\upxi{\unicode[Times]{x3BE}}\)\(\def\upomicron{\unicode[Times]{x3BF}}\)\(\def\uppi{\unicode[Times]{x3C0}}\)\(\def\uprho{\unicode[Times]{x3C1}}\)\(\def\upsigma{\unicode[Times]{x3C3}}\)\(\def\uptau{\unicode[Times]{x3C4}}\)\(\def\upupsilon{\unicode[Times]{x3C5}}\)\(\def\upphi{\unicode[Times]{x3C6}}\)\(\def\upchi{\unicode[Times]{x3C7}}\)\(\def\uppsy{\unicode[Times]{x3C8}}\)\(\def\upomega{\unicode[Times]{x3C9}}\)\(\def\bialpha{\boldsymbol{\alpha}}\)\(\def\bibeta{\boldsymbol{\beta}}\)\(\def\bigamma{\boldsymbol{\gamma}}\)\(\def\bidelta{\boldsymbol{\delta}}\)\(\def\bivarepsilon{\boldsymbol{\varepsilon}}\)\(\def\bizeta{\boldsymbol{\zeta}}\)\(\def\bieta{\boldsymbol{\eta}}\)\(\def\bitheta{\boldsymbol{\theta}}\)\(\def\biiota{\boldsymbol{\iota}}\)\(\def\bikappa{\boldsymbol{\kappa}}\)\(\def\bilambda{\boldsymbol{\lambda}}\)\(\def\bimu{\boldsymbol{\mu}}\)\(\def\binu{\boldsymbol{\nu}}\)\(\def\bixi{\boldsymbol{\xi}}\)\(\def\biomicron{\boldsymbol{\micron}}\)\(\def\bipi{\boldsymbol{\pi}}\)\(\def\birho{\boldsymbol{\rho}}\)\(\def\bisigma{\boldsymbol{\sigma}}\)\(\def\bitau{\boldsymbol{\tau}}\)\(\def\biupsilon{\boldsymbol{\upsilon}}\)\(\def\biphi{\boldsymbol{\phi}}\)\(\def\bichi{\boldsymbol{\chi}}\)\(\def\bipsy{\boldsymbol{\psy}}\)\(\def\biomega{\boldsymbol{\omega}}\)\(\def\bupalpha{\unicode[Times]{x1D6C2}}\)\(\def\bupbeta{\unicode[Times]{x1D6C3}}\)\(\def\bupgamma{\unicode[Times]{x1D6C4}}\)\(\def\bupdelta{\unicode[Times]{x1D6C5}}\)\(\def\bupepsilon{\unicode[Times]{x1D6C6}}\)\(\def\bupvarepsilon{\unicode[Times]{x1D6DC}}\)\(\def\bupzeta{\unicode[Times]{x1D6C7}}\)\(\def\bupeta{\unicode[Times]{x1D6C8}}\)\(\def\buptheta{\unicode[Times]{x1D6C9}}\)\(\def\bupiota{\unicode[Times]{x1D6CA}}\)\(\def\bupkappa{\unicode[Times]{x1D6CB}}\)\(\def\buplambda{\unicode[Times]{x1D6CC}}\)\(\def\bupmu{\unicode[Times]{x1D6CD}}\)\(\def\bupnu{\unicode[Times]{x1D6CE}}\)\(\def\bupxi{\unicode[Times]{x1D6CF}}\)\(\def\bupomicron{\unicode[Times]{x1D6D0}}\)\(\def\buppi{\unicode[Times]{x1D6D1}}\)\(\def\buprho{\unicode[Times]{x1D6D2}}\)\(\def\bupsigma{\unicode[Times]{x1D6D4}}\)\(\def\buptau{\unicode[Times]{x1D6D5}}\)\(\def\bupupsilon{\unicode[Times]{x1D6D6}}\)\(\def\bupphi{\unicode[Times]{x1D6D7}}\)\(\def\bupchi{\unicode[Times]{x1D6D8}}\)\(\def\buppsy{\unicode[Times]{x1D6D9}}\)\(\def\bupomega{\unicode[Times]{x1D6DA}}\)\(\def\bupvartheta{\unicode[Times]{x1D6DD}}\)\(\def\bGamma{\bf{\Gamma}}\)\(\def\bDelta{\bf{\Delta}}\)\(\def\bTheta{\bf{\Theta}}\)\(\def\bLambda{\bf{\Lambda}}\)\(\def\bXi{\bf{\Xi}}\)\(\def\bPi{\bf{\Pi}}\)\(\def\bSigma{\bf{\Sigma}}\)\(\def\bUpsilon{\bf{\Upsilon}}\)\(\def\bPhi{\bf{\Phi}}\)\(\def\bPsi{\bf{\Psi}}\)\(\def\bOmega{\bf{\Omega}}\)\(\def\iGamma{\unicode[Times]{x1D6E4}}\)\(\def\iDelta{\unicode[Times]{x1D6E5}}\)\(\def\iTheta{\unicode[Times]{x1D6E9}}\)\(\def\iLambda{\unicode[Times]{x1D6EC}}\)\(\def\iXi{\unicode[Times]{x1D6EF}}\)\(\def\iPi{\unicode[Times]{x1D6F1}}\)\(\def\iSigma{\unicode[Times]{x1D6F4}}\)\(\def\iUpsilon{\unicode[Times]{x1D6F6}}\)\(\def\iPhi{\unicode[Times]{x1D6F7}}\)\(\def\iPsi{\unicode[Times]{x1D6F9}}\)\(\def\iOmega{\unicode[Times]{x1D6FA}}\)\(\def\biGamma{\unicode[Times]{x1D71E}}\)\(\def\biDelta{\unicode[Times]{x1D71F}}\)\(\def\biTheta{\unicode[Times]{x1D723}}\)\(\def\biLambda{\unicode[Times]{x1D726}}\)\(\def\biXi{\unicode[Times]{x1D729}}\)\(\def\biPi{\unicode[Times]{x1D72B}}\)\(\def\biSigma{\unicode[Times]{x1D72E}}\)\(\def\biUpsilon{\unicode[Times]{x1D730}}\)\(\def\biPhi{\unicode[Times]{x1D731}}\)\(\def\biPsi{\unicode[Times]{x1D733}}\)\(\def\biOmega{\unicode[Times]{x1D734}}\)\begin{equation}\tag{1}C = {{{L_{max}} - {L_{min}}} \over {{L_{max}} + {L_{min}}}}\end{equation}
where L
max and L
min represent the maximum and minimum luminance of the stimulus, respectively.
From the measured contrast sensitivity function, we then chose the highest spatial frequency that the individual participant could reliably detect at 75% contrast for further testing in the main procedure. During this procedure, the participants' eye movements were recorded binocularly at 500 Hz using a calibrated infrared eye tracking device (Eyelink 1000; SR Research Ltd, Ottawa, ON, Canada). Participants were instructed to fixate a central white dot (0.1° diameter) and avoid blinking where possible, while Gabor stimuli randomly oriented at either 45° or 135° appeared for one frame at jittered intervals between 1000 and 1400 milliseconds. The contrast of each Gabor was presented according to a method of constant stimuli procedure with 7 levels. Five levels were spaced around the previously measured detection threshold in 0.1 or 0.2 log unit steps depending on the participant; one level was of zero contrast, and the remaining level was of a high contrast (>50%) where task performance was expected to be maximal. Participants were instructed not to respond to these stimuli but to simply continue fixating on the central dot (passive trials). Interleaved at random intervals among the passive presentations were response trials in which the same stimulus was accompanied by an audible tone. Participants indicated the orientation of the Gabor in each response trial by key press (3 or 1 on a numeric keypad for 45° or 135°, respectively), as in the initial phase. Breaks were taken after every three trials to allow participants to blink freely before continuing. This was done to reduce tear film breakup and maintain participants' comfort. Longer rests were taken after approximately 5 to 8 minutes. Each session typically comprised 10 blocks, yielding a total of 400 passive trials and 80 response trials per contrast level.
Estimates of contrast sensitivity from microsaccades were compared to the reference standard of those obtained behaviorally from the interleaved response trials. Overall accuracy and precision was calculated as the mean difference and standard deviation difference from the behavioral estimate (including all trials) respectively across all participants.
For both methods, estimates of accuracy and precision were made with varied numbers of presentations per contrast level up to the total number collected (80 for the behavioral method, 400 for the microsaccade method); presentations were randomly sampled with replacement from the dataset, and this was repeated 200 times to derive bootstrap 95% confidence intervals on the estimates.
To assess the potential utility of the microsaccade method under conditions that produce approximately similar measurement precision to behavioral methods in common use, we considered relationships between contrast sensitivity estimates from microsaccade and behavioral responses with unequal presentation numbers. Specifically, relationships between contrast sensitivity estimates from microsaccades with 400 presentations per contrast level and behavioral responses with 20 presentations per contrast level were measured by Spearman correlation and Bland-Altman analysis.
31 Relationships between both microsaccade and behaviorally measured contrast detection thresholds and Pelli-Robson contrast sensitivity were also investigated by Spearman correlation. All analyses were conducted in R, version 3.2.0.
29
Microsaccade rate signatures elicited by visual stimuli are modulated by properties of the stimulus such as contrast.
13,15,18 Previous studies have demonstrated the potential utility of involuntary microsaccades as an objective marker of visual sensitivity, although these studies were limited to young, visually normal participants with a narrow range of contrast thresholds.
13,18 It follows that there may be a future use of such oculometric measures in objectively estimating visual sensitivity in clinical populations.
In the present study, we deliberately recruited participants with cataract and amblyopia, as well as young visually healthy participants, to test our methods on a clinically diverse population with varied ages and visual functions. We found considerable heterogeneity in microsaccade rate signatures across our study population, including in the timing of the individual phases, and indeed whether each distinct phase occurs at all (
Fig. 2). These between-individual differences in microsaccade signature bore no apparent relationship in our sample with contrast sensitivity, clinical condition (normal, amblyopia, or cataract), or age.
Given the apparent heterogeneity of rate signature shapes in the wider population observed herein, it is important that methods that estimate aspects of visual or nonvisual function from microsaccade rates are robust to differences in rate signature. In this study, we have demonstrated a tractable, probabilistic inference-based approach to estimating contrast sensitivity from the microsaccade rate signature. The method accommodates the heterogeneity of signatures we report and so works for any individual with a measurable microsaccade rate signature that exhibits a graded response to contrast, as was the case for all participants in this study.
Correlation between contrast detection thresholds estimated using gold-standard behavioral methods and from microsaccades was strong (
Fig. 3). Mean difference between estimates from the two methods was small (1%), indicating that there is very little bias in the estimates from microsaccades relative to behavioral estimates. Though median absolute differences between the two measures were 7.6% (indicating that for 50% of participants, differences between the two estimates were smaller than this), Bland-Altman analysis showed 95% limits of agreement spanning approximately 20% either side of the mean. This indicates that while the two measurements of contrast sensitivity were closely correlated, in some cases they did not exhibit strong agreement, leading to the wide 95% limits. This limited agreement between the two modalities is likely to be multifactorial; the limited precision of both measures is likely to be one factor, potentially leading to large differences between their estimates in some cases. Another source of discordance may be the differences in nonvisual processing
24 between converting a detected visual stimulus to a change in involuntary microsaccade behavior and making a sensory decision and voluntary motor output as required for a key press.
Precision of threshold estimates from microsaccades was worse than that of the behavioral method, and equating precision between the methods required microsaccade rate signatures to be measured across many more stimulus presentations per contrast level than would be made by an equivalently precise behavioral technique. However, the total time taken to measure contrast sensitivity using our method could be drastically reduced by approaches that reduce the amount of time required for each presentation. In the present study, we have observed microsaccade rate signatures following the presentation of single stimuli. Future work could be directed toward understanding how rate signatures triggered by multiple, sequentially presented stimuli combined. If the combined signatures can be separated, it would enable the presentation of stimulus “trains” in which multiple stimuli are presented in rapid sequence, and the microsaccade rate signature to all stimuli combined is then separated into responses to individual stimuli. Such an approach may allow multiple presentations to be made per second, either at the same contrast or at varied contrasts, greatly reducing test duration, but this remains a target for further work.
An advantage of our method is that it may be incorporated into conventional psychophysical frameworks. In this study, we took the simple approach of the method of constant stimuli that can be used with our method without adaptation. Adaptive methods such as staircase procedures or Bayesian techniques may also incorporate our method by repeatedly presenting the same stimulus in order to build up a microsaccade rate signature which can then be classified as “seen” or “not seen” depending on whether the scaled log-likelihood ratio between
P(observed microsaccades | maxrate) and
P(observed microsaccades | baserate) exceeds 0.5 (see
Fig. 1D). The procedure can then continue to present further stimuli in the usual way. In this format of testing, it is likely that the number of stimulus presentations contributing to one “seen/unseen” response will need to be at least 20 to approach the precision of a short behavioral method of constant stimuli procedure. It should be noted, however, that current clinical psychophysical procedures, such as automated perimetry, commonly use highly abbreviated thresholding procedures
32 that result in considerably lower precision than even a short method of constant stimuli procedure, as investigated herein.
33,34 Fewer stimulus presentations per “response” may be required for the microsaccade method to deliver sufficiently precise threshold estimates for clinical purposes that are currently adequately served by such methods.
Even given the poor precision of the threshold estimates produced by the current approach compared to behavioral methods, there remain advantages over any behavioral method that requires subjects to respond. Such methods may be unable to return an estimate of any kind in clinical populations where responses cannot be readily measured, such as young children and those with cognitive disabilities. In these populations, microsaccade-based measures may be able to elicit clinically useful estimates of visual sensitivity, providing an alternative or complement to electrophysiological tests. Of course, further development of our methods would be needed before they could be tested in such populations. For example, since fixation stability is likely to be poor in such populations, it may be necessary to develop gaze-contingent stimulus display methods that are compatible with measurement of microsaccade rate signatures. Future efforts to improve the clinical utility of microsaccade-based measures of visual sensitivity should also aim to improve precision and therefore reduce the time taken to produce clinically useful measurements. Further development of the algorithms used to detect microsaccades could help in this respect, as could better understanding of the causes of heterogeneity in rate signature shape and how these can be exploited.
In conclusion, the shape of microsaccade rate signatures following presentation of brief, transient stimuli varies considerably among individuals, though the graded response to contrast is ubiquitous across these variations in shape. We have demonstrated a simple, tractable method for estimating contrast sensitivity from microsaccade rate following presentation of visual stimuli. The method appears robust to variation in contrast sensitivity and to heterogeneity in the shape and features of the microsaccade rate signature. It is flexible to different numbers of stimulus presentations and can be incorporated into conventional psychophysical frameworks, albeit currently requiring considerably longer test time than laboratory behavioral methods to achieve similarly precise results. This method, therefore, holds potential for further development toward future clinical use, though the immediate focus of development needs to be on improving the efficiency of data collection to ensure clinical viability.
This work was supported by the Wellcome Trust [grant number WT097387]; the Medical Research Council [grant number MC_PC_15033]; and Fight for Sight [grant number 5059/5060].
Disclosure: J. Denniss, None; C. Scholes, None; P.V. McGraw, None; S-H. Nam, None; N.W. Roach, None