A grayscale heat map (
Fig. 1D, first image) was generated using gridwise GCL thicknesses in microns (
Supplementary Fig. S1) to provide a range of pixel values between 0 and 255. Various scaling proportions were tested, with no difference found in the final classification results. Clustering of data requires appropriate “feature selection” to be conducted, as excessive numbers of classes do not aid classification if they are not separable and lead to unnecessary burden of the model.
26 As a consequence, clustering is significantly affected by the applied strategy and statistical criteria. Therefore, we have adopted a clustering paradigm that has been well established in previous studies.
27,28,30,34,35,40 Specifically, the data were analyzed with unsupervised classification using ISODATA clustering for each decade subgroup (PCI Geomatica, Markham, ON, Canada) (
Fig. 1D, second image) generating clusters of locations within the macula with similar change in GCL thickness with age (theme classes). ISODATA clustering is a specific form of K-means clustering (a migrating means methods) and aids with feature selection by automated splitting of high variance classes and merging classes with low separability.
41 Unlike traditional K-means methods, this algorithm is not bound by a predefined number of classes, but allows the class numbers to be reduced or increased appropriately within a given range. The separability of identified theme classes was statistically verified using transformed divergence (D
T).
26 D
T value ranges from 0 to 2, with 0 referring to inseparable clusters and 2 indicating complete separation. A value of >1.9 corresponds to a probability of correct classification of >98%
42 and is commonly accepted as the cutoff for statistically significant separability for clustering studies,
27,28,30,32–35,40 as well as recommended by the manufacturer of the software.
43 Following classification and confirmation of separability, each distinct theme class was assigned a color for visualization in pseudocolor plot (
Fig. 1D, third image). In the initial model, a single peripheral point was clustered together with the central four points corresponding with the fovea, which is inconsistent with an a priori assumption that the foveal pit is anatomically distinct from the rest of the retina.
44 Thus, in developing the final model, the central four locations corresponding to the fovea were masked and the data were reanalyzed, resulting in the reassignment of the aforementioned peripheral point with no other change in the classification. The ISODATA algorithm and statistical analysis provided the maximum number of statistically separable classes. After identifying the highest number of statistically unique theme classes using ISODATA clustering (
N = 8 theme classes), the K-means algorithm was used to restrict the number of classes stepwise down to the lowest separable number (
N = 5) to further explore the effect of total number of theme classes on the provided model.