For eight of our samples, no control tissue was available (these comprised six lenses and two capsulorhexis specimens from eight individual donors; see
Table 1). Sequence data obtained from these samples were, therefore, examined with VarScan2 in unpaired mode (
Table 3). In the eight unpaired samples, 123 variants with frequencies of less than 25% were identified. This list was refined further by selecting variants covered to a depth of greater than 300 and supported by both forward and reverse strands. Of the 34 variants that met those conditions, 23 (67.5%) were detected in multiple samples (
Table 3). Notably, 21 of the 23 variants were located in
CYP family genes (
CYP2D6 and
CYP2A6). Moreover, the variants at positions 41354661 and 41354606 were observed at low frequency in 35 of the 39 tissue samples sequenced in this study. It is unlikely that the same point substitution would occur independently in multiple samples, implying that the variants detected in the
CYP family genes reflected sequencing or alignment errors. Manual inspection of the seven repetitive variants (
Table 3) revealed that the paired end reads did not map to a single gene, but rather to two members of the
CYP gene family located on the same chromosome. These imprecise alignments, a consequence of the high-sequence homology between
CYP genes, were a source of false positives, and variants associated with these genes were therefore discarded. Similar observations were obtained for the unique variants (see
Table 3) in
CYP2D6 (sample N26) and
CYP2A6 (sample N931). From the 34 variants identified initially, only nine (from samples N29, N793, and N934;
Table 3) passed all quality control filters, with frequencies ranging from 1.02% to 3.94%. Of these, four were nonsynonymous missense substitutions and a fifth, in
ALK, represented a splice site donor mutation.