Gene structure is generally well conserved among IF proteins,
particularly within the segment of gene that encodes for the rod
domain.
24 25 26 27 28 The number and location of introns is
strongly conserved within an IF class, mirroring the subgrouping of IF
proteins that is achieved through primary sequence analysis. Thus, IF
type can be determined by gene structure as well as by primary sequence
similarity.
29 Figure 2 shows the nucleotide sequence at the intron–exon boundaries for
introns that we have identified in the trout CP49 gene, along with the
approximate size of the introns, estimated by PCR-agarose gel
electrophoresis. We show that introns C, E, F, and G, which are common
to type I, II, and III IF proteins, are present in trout CP49. These
introns are identical in location and in the phase of the triplet codon
that they interrupt, with those shown for human CP49, and for type I
cytokeratins in general. Intron H, which is generally present in type
I, II, and III IF genes, and in the human CP49, is absent from trout.
Notably, intron H is also absent from the chicken CP49
gene,
30 as well as the type I cytokeratin
K19 gene.
31
Confirmation of intron B in trout genomic DNA was problematic. In mouse
and human CP49 this intron is in excess of 23 kb and is resistant to
PCR amplification. Several PCR primer sets that flanked the predicted
site of intron B worked well on trout lens cDNA but failed to produce a
product when trout genomic DNA was used as a template. This failure is
indirect evidence that is consistent with the presence of intron B in
the trout CP49 gene.
We identified an intron in the tail domain of the trout CP49 gene as
well (located in
Fig. 1 at nucleotide 1215).