Although this relatively small number of genera predominated, sequences from over 800 genera were identified. For conjunctiva, on average, 245 different genera were identified per sample using 16S rDNA gene deep sequencing (range, 17–579). This finding raises the strong possibility that some of these are artefactual. Production of DNA-free reagents is challenging, and false-positive amplifications are common.
26,27 Quantitative PCR of the bacterial load in the conjunctiva showed that there is approximately 1 bacterium for every 20 human conjunctival epithelial cells collected on conjunctival swab. Based on this, we estimate there should be approximately 38 bacteria detected per conjunctival sample given that one diploid human genome weighs approximately 6.6 × 10
−12 g and 5 × 10
−9 g of DNA was processed for each 16S PCR sample. A reasonable assumption is that the majority of the genera identified are contaminants during sample processing. To assign a confidence measure to determined differences, we compared the bacteria recovered from conjunctival swabs with those of the environmental control. To determine those bacteria likely to show meaningful enrichment in conjunctiva, we first limited our analysis to those bacteria whose total 16S counts constituted 0.1% or greater of the conjunctival microbiome. This analysis reduced the number of genera from 887 to 42. We next employed a novel heuristic to rank these bacteria for ‘confidence'. We reasoned that confidence in a particular result would be a function of the magnitude of fold difference in the observed bacteria between conjunctiva and environmental samples, and the
P value for the difference between these sites (adjusted for multiple comparisons, see Methods for details of calculation of
P values). Because of the large number of samples analyzed,
P values for significant differences included 17 that were smaller than 1 × 10
−10. The confidence heuristic was calculated as log
10(fold difference/adjusted
P value), where the fold difference was mean conjunctival count/mean environmental count (
Table 2). Four of six highest confidence genera were
Corynebacteria, Propionibacteria, Staphylococcus, and
Streptococcus, which were all detected frequently in cultures and had confidence heuristics of greater than 10. However,
Pseudomonas and
Elizabethkingia, which conjointly accounted for 29% of all reads in conjunctiva, had confidence levels of 2 or less, a difference of over 10 orders of magnitude from these other bacterial genera. Examination of the distribution of confidence measures across the 42 genera (
Fig. 3) reveals a remarkably broad distribution, with 22 genera having confidence heuristics less than 5, while 17 genera have confidence heuristics at least 5 logs higher (i.e., ≥10). This latter set includes a number of genera detected by culture in some samples (e.g.,
Lactobacillus and
Haemophilus)
, as well as bacteria not detected by culture including
Anaerococcus, Parascardovia, Peptoniphilus, Actinomyces, Prevotella, Dietzia, Finegoldia, Dermacoccus, Fusobacterium, Porphyromonas, and
Mycobacterium. However, none of these species accounted for more than 0.8% of total conjunctival reads. Thus, we find the core cluster microbiome identified by deep DNA sequencing consists of the same four predominant species as detected by culture:
Corynebacteria, Propionibacteria, Staphylococcus, and
Streptococcus, likely with sporadic or very low amounts of several other species.