The peptide assignments based on mass and retention time were confirmed by sequencing the peptides with MS/MS
(Figs. 2 3) . In MS/MS a precursor ion with a specific charge to mass ratio is allowed to enter a chamber where it interacts with argon gas. Collision-induced fragmentation of the peptides results in cleavage of amide bonds and the formation of N termini containing b-ions and C termini containing y-ions. Labeling of the b- and y-ions was performed according to nomenclature established by Biemann.
43 The MS/MS mass spectra were sequenced by hand, and submitted to the GPM Website to obtain an objective confidence measure of the identity of a particular peptide
(Tables 1 2) . The log GPM e-value reported for each peptide in
Tables 1 and 2is a measure of the probability that a spectrum matches the theoretical value by chance (e.g., a log e-value of −2 means that in 1 of 100 experiments, one would expect to see a particular score on the basis of chance).
44 Many peptides scored with high e-values and some peptides that were manually identified from Cx44 did not receive log e-values below the threshold of −1. In general, most automated peptide identifying algorithms have some difficulty identifying spectra when the precursor ion is in the +1 charge state characteristic of MALDI. These algorithms do much better with multiply charged precursor ions commonly produced in electrospray.
45 The reasons GPM failed to identify some of the manually identified Cx44 peptides are not entirely clear and may be particular to each peptide
(Table 1) . One example is the case of the high log e-value for one of the peptides with multiple modifications (peptide 228-242). This peptide was present with many different modifications, so only a small amount of the unmodified peptide was present. In its unmodified form, the peptide consequently produced a low signal resulting in a relatively low-quality MS/MS spectrum. On the other hand, the reason for the high log e-value for the 110-115 peptide is probably its small size. The explanation for the high log e-value obtained for the 110-132 peptide is less certain, but it may be because the GPM does not take into account internal cleavages, and this peptide contains several proline and aspartic acid residues that make peptides prone to such events. It is also interesting in this context to note that the two other peptides with the same C terminus (106-132 and 108-132) also had relatively high log e-values (−2.7 and −1.9, respectively), although the scores reached significance in both cases. Conversely, the three peptides that end at residue 129 (i.e., peptides lacking DDR from their C termini relative to the peptides with C termini at 132) all scored with much lower log e-values. These examples point to some of the challenges of determining the probability that a spectrum is matched to the correct peptide, and indicate that the expectation values GPM reports in some cases underestimate the true probability that a spectrum is correctly matched to a peptide.
Tandem MS was also performed on peaks that could not be assigned to proteins. Many of these turned out to be due to missed cleavages in crystallin proteins. However, several truncated peptides from Cx44 and Cx49 were also identified. Overall, we were able to obtain coverage of 30% of the complete Cx44 sequence and 46% coverage of the cytoplasmic domains of the protein. For Cx49 we were able to obtain coverage of 47% of the complete protein, and 71% coverage of the cytoplasmic domains. In total, we identified 14 Cx44 peptides, 5 of which included posttranslational modifications
(Table 1) , and 32 Cx49 peptides, 12 of which were posttranslationally modified
(Table 2) .