DNA samples from 2231 individuals were sent for genotyping at the Center for Inherited Disease Research (CIDR). Samples were first genotyped using 385 STRP markers. The results of our linkage analysis of quantitative spherical equivalent using these markers has been published previously.
27 Given that SNP markers have been shown to provide increased information content and thereby greater power to detect linkage, all remaining available samples were regenotyped (Linkage Panel I; Illumina, San Diego, CA). Genotype data on 6008 SNPs were generated on 2170 samples with a minimum call rate per sample of 96%. Markers were dropped if they had either poorly defined (
n = 148) or atypical (
n = 10) clusters, leaving 5850 high-quality SNPs, of which 5525 were located on the autosomes. All SNPs had a call rate >96%. For the SNP markers, we then used Haploview 4.0 to identify linkage disequilibrium (LD) blocks, as LD can cause false-positive linkage signals (
http://www.broad.mit.edu/mpg/haploview/ The Broad Institute, Massachusetts Institute of Technology, Cambridge, MA). A total of 525 singletons and 43 trios from 431 families were used for LD calculations. SNP markers were excluded if they had (1) Hardy-Weinberg Equilibrium
P < 0.001, (2) Mendelian errors >1 per marker, or (3) call frequency <0.98. Blocks were defined based on solid spine of LD (i.e., the spine was extended if D′ > 0.8). Within an LD block, the SNP with the highest minor allele frequency was retained. After quality control of the marker data, we merged the SNP and STRP marker sets based on the Marshfield genetic map (research.marshfieldclinic.org/genetics/ Marshfield Clinic, Marshfield, WI). The Marshfield genetic position of the 385 microsatellite markers was obtained from the UCSC Genome Browser (
http://genome.ucsc.edu/; provided by the University of California at Santa Cruz) and UniSTS database (
http://www.ncbi.nlm.nih.gov/unists/ National Center for Biotechnology Information [NCBI], Bethesda, MD; and the website of Mammalian Genotyping Service at Marshfield, sponsored by the National Heart Lung blood institute (
http://www.marshfieldclinig.org/mgs/). We could not obtain a precise genetic position for marker
D2S1780, and thus this marker was dropped. To obtain a common genetic map for SNPs and STRPS we used 7756 microsatellite markers with known Marshfield genetic position from the USCS website to interpolate the genetic position for each SNP by using the physical position. Relationship errors were identified using PREST
34 and RELPAL (S.A.G.E., ver. 6.0) and residual Mendelian errors using MARKERINFO (S.A.G.E., ver. 6.0
29 ), and SIBPAIR.
35 We obtained clear evidence of misspecified relationships in 20 pedigrees, and these errors were resolved by removing the problematic individuals or re-assigning the relationship. Our final dataset consisted of 4892 SNP markers and 384 STRP markers on the 22 autosomes.