The video-based head tracker described in this article enables compensation for horizontal, vertical, and torsional in-plane head movements during pupil/iris-tracking videooculography with our tilting haploscope. Two adhesive dots affixed near the medial canthi were used to monitor head motion on the bite plate, which were tracked using the ordinary feature detection method already applied for pupil tracking. As a result, the strictly fixed position of the head was no longer a major requirement, allowing more comfort to the subject investigated. Furthermore, this technique minimized the need for calibration to just once before the experiment to obtain individual transformation information later required for conversion from two-dimensional video-oculographic image space to three-dimensional eye coordinate space (Fick coordinates).
Various other methods have been proposed to compensate for head-versus-camera movement,
9 10 11 12 13 but all require additional hardware or have resolution deficiencies. For example, though commercial video-oculography-based systems are available that account for head movements (Applied Science Laboratories, SR Research, EyeLink, Chronos Vision), they rely on additional sensors attached to the subject’s head and track sensor motion. Not only can these distract the subject, the mounting of the sensors is critical. Tracking errors will be introduced if proper positioning is not achieved. Ronsse et al.
9 suggested tracking three markers affixed to the subject’s head to determine head position. To measure the position of these points on the head, another camera must be used. Other approaches, such as the those proposed by Yoo et al.
10 and Zhu et al.,
11 involve tracking of corneal light reflexes in addition to the pupil to eliminate the influence of head movement. However, the relatively small pixel size of the corneal light reflex and, hence, the low feature resolution, may cause inexact detection of the features and thereby decrease tracking accuracy.
10 Image-based methods have been reported to estimate head position using distinctive facial features, such as eye and mouth corners.
12 13 These methods rely heavily on complex algorithms, and the whole face must be visible to extract the features, potentially decreasing the resolution of eye tracking because of the reduced pixel size of the eye’s image with respect to the whole image frame.
13
Our head-tracking solution does not require additional hardware or complex algorithms, and the subject is relieved of any additional equipment attached to the head. The main advantage of our dot-based head-tracking technique is that it provides a simple means for head movement compensation during pupil/iris-tracking video-oculographic recordings and can be incorporated into existing video-oculographic systems without loss of eye-tracking resolution.
Our validation experiments showed that head-tracker measurements and model head position were highly correlated (
R = 1.00) for all three degrees of freedom in the frontal plane. These results for accuracy and precision of the head tracker are consistent with those previously reported for eye tracking.
1
Assessment of performance of the head tracker with human subjects confirmed its ability to reduce erroneous eye movements. The remaining horizontal and vertical eye displacements seen in
Figure 4Brepresent the expected physiologic eye movement required to maintain fixation on the central target when tilting against the bite plate. The residual torsion represents the torsion in the same direction as the head tilt that is characteristically seen when an attempt is made to tilt the head rapidly, as was the case here.
14
Some limitations of our method should be discussed. First, even though the medial canthus has been reported as the most stable feature in the face and is relatively insensitive to facial expressions,
8 we are aware that facial gestures can still cause the dots to move (tracked as head motion from the head-tracker perspective). However, our algorithm accounts for false head movement caused by facial expressions by monitoring the inter-dot distance. In a separate experiment, we looked at the induced movements of the medial canthi with facial gestures, such as brow lifting, brow lowering, frown, and forced eye closing. All these gestures caused a change in the distance between right and left dots, so that whenever the head tracker detected a change greater than 1% in the immutable individual inter-dot distance, this motion was identified as a facial gesture, and the accompanying segment of recorded data was removed from further analysis.
Second, head movements that brought dots outside the field of view of each camera could not be compensated. The largest horizontal and vertical translations we could compensate for were 15.99 mm and 17.98 mm, respectively. Head tilts of up to 32.52° could be compensated for. The presence of the bite plate, however, restricted large head movements, and both cameras were adjusted for each subject before the experiment to ensure that the pupils and dots were centered within the viewing field of each webcam.
Third, out-of-plane head movements were not accurately compensated. To be complete, head motion would have to be described with 6 df (3 df translational, 3 df rotational). However, the purpose of our head tracker is to compensate for residual movement on the bite plate and forehead rest, which more effectively impede head motion in the remaining 3 df out of plane. The most head movement that could be expected from our specific haploscopic experimental design occurred while the apparatus and upper body of the subject were tilted 45° to the left or right, producing predominately false horizontal eye movement. Assessment of head-tracker performance during such a haploscopic tilting experiment showed that our method was capable of compensating for head-versus-camera motion that occurred during the tilting process. For the range of expected head motion on the bite plate encountered in an experiment, the simplification of considering only in-plane head motion appears reasonable.
In conclusion, tracking black dots placed near the inner canthi is a simple but effective method of compensating for horizontal, vertical, and torsional in-plane head movements during pupil/iris-based video-oculography. It will allow future investigations with our tilting haploscope in less cooperative subjects, including children, than has been possible. Analyzing patients in addition to healthy subjects, especially patients with congenital and acquired forms of superior oblique paresis, is of major importance in further exploring our hypothesis that basic cyclovertical deviations can masquerade as congenital superior oblique paresis.
3 Our goals are to better understand the mechanisms underlying cyclovertical strabismus and to understand the mechanisms of strabismus in general.