The complex numbers representing the amplitudes and phases of the 2-Hz first harmonic of the evoked response were coherently averaged over all trials for each stimulus condition for each observer. Coherent averaging uses amplitude and phase information. Group averages were computed in a similar way. For example, the average sine and cosine coefficients were calculated across observers before magnitude was calculated. For the plots of response magnitude
(Fig. 2) , we computed an error measure by pooling the errors on the sine and cosine coefficients in quadrature, as has previously been done for single-observer averages.
15 This measure assumes that the sine and cosine coefficients are uncorrelated. However, correlations occur between these measurements in the case of cross-observer averages because of the presence of individual differences in overall response amplitude. For significance testing, therefore, we used multivariate analysis of variance (MANOVA), which takes these correlations into account. MANOVA also correctly models the correlation structure of our repeated-measures design. When we plot data in the complex plane, we show two-dimensional standard errors
(Fig. 3) . These error bounds are often elliptical because of correlations between real and imaginary values that arise from individual differences in absolute amplitude. Note that these errors are the same as those that would be computed for a between-subjects design and that they do not reflect the
within-observer errors used to assess significance when the MANOVA was used to compare responses across conditions measured within a given group. In all the statistical evaluations, we used the last bin of the sweep to test for effects of stimulus condition because this bin generally had the largest response across conditions and observer group.