Microarray data first were assessed using Expression Console (Affymetrix, Inc) provided by Affymetrix to confirm quality control performance and establish the presence or absence of each probe set. All eight arrays underwent strict quality control standards as reported previously.
26,27 All were of good quality. We focused on the 31,468 (57%) non-control probe sets that were detected on at least two of the eight arrays. Principal component analysis (PCA) showed a clear separation between normal and strabismic cohorts, with 96% of overall variation explained by the first component and 2% of overall variation by the second component. Data were cleansed to exclude highly outlying replicate expression values. Specifically, any set of replicates having a coefficient of variation greater than 0.5 and a replicate value with a SD (across the four replicate expression values) of greater than 1 was excluded. (The maximum possible SD for four measures was 1.5; thus, if one measure was near 1.5, this indicated that the fourth replicate was at its maximum outlying capacity and was removed.) Only 2530 (1%) of the data values were excluded during this step. We found that this cleansing process allows us to identify notable outliers within a replicate set.
26,28 Expression values were averaged across replicates.
Table 2 presents expression ratios of strabismic/normal and the log-transformed expression ratios, using the log base 2 transform.
P values were calculated by unpaired Student's
t-test, assuming equal variance for each probe set.
P values were adjusted using a multiple testing correction (false discovery rate),
29 and significance thresholds were set at ≥3-fold change, either up or down, with an adjusted
P value of ≤ 0.05. WebGestalt Gene Kit Analysis ToolKit (from the public domain at
http://bioinfo.vanderbilt.edu/webgestalt, Vanderbilt University),
30 and Database for Annotation, Visualization and Integrated Discovery analysis (DAVID, version 6.7, from the public domain at
http://david.abcc.ncifcrf.gov/home.jsp)
31 were used to annotate and characterize genes that showed a ≥3-fold change (either up or down) between strabismic and normal groups. PCR array data were analyzed using web-based analysis and a downloadable template provided by SABiosciences (available in the public domain at
www.sabiosciences.com/pcrarraydataanalysis.php). We chose housekeeping genes for PCR array by analyzing all 89 genes on the array using geNorm analysis (from the public domain at
http://medgen.ugent.be/∼jvdesomp/genorm/). Genes deemed most stable over three normal and three strabismic groups were Calpain2 (
CAPN2) and Utrophin (
UTRN). Expression data for 87 genes (89 on array minus two designated housekeeping genes) obtained from each PCR array first were normalized to the expression of the two housekeeping genes (the geometric average Ct value of
CAPN2 and
UTRN combined), and then compared to other arrays. We combined the results from all arrays creating one normal (
n = 4) and one strabismic (
n = 4) group, which were compared for gene expression. The qPCR data were analyzed similarly. Gene expression was first normalized to housekeeping gene expression, then the difference of expression between strabismic and normal was calculated using the same equations as on the SABiosciences template. To validate the accuracy of the expression measures determined by the microarrays, expression values were compared to those obtained by qPCR for 10 genes. The qPCR and microarray data ratios for these 10 pairs showed a statistically significant correlation of 0.833,
P < 0.005.