The performance of the system is analyzed with 33 single-visit FA sequences using the leave-one-out cross-validation strategy. Each sequence is tested with the AdaBoost classifier trained using the other 32 sequences. The number of weak classifiers involved is eight. It is hard to directly assess the correctness of the severity map, since the groundtruth cannot be easily generated. Instead, based on the understating that an accurate severity map leads to good segmentation, we measure the performance of the complete system in terms of over- and undersegmentation of CNV, comparing it against the manual segmentation, serving as the ground truth. Let
Rg and
Ra be the results of manual and automatic segmentation of image
I, respectively. Each pixel is classified as one of the following: TP =
Rg ∩
Ra, FP =
Ra − (
Rg ∩
Ra), TN = I − (
Rg ∪
Ra), and FN =
Rg− (
Rg ∩
Ra), where TP stands for true positive (CNV region correctly labeled); FP for false positive (background region incorrectly labeled); TN for true negative (background correctly labeled); and FN for false negative (CNV region incorrectly labeled). Over- and undersegmentations are measured respectively as:
OS is the fraction of the segmented CNV area which is the normal tissue, whereas US is the fraction of manually segmented CNV (ground truth) mistaken as the normal tissue by our system. The 2-D space defined by OS and US is a unit square
S, where the ideal segmentation result is the point of origin in
S, and the Euclidean norm of the 2-D space offers a measure of closeness to an ideal segmentation result. The accuracy of the system is defined as
The result is in the range of [0, 1].
Table 1 shows the experimental results of all 33 sequences. The average accuracy is 83.26%.
Figures 3a and
3b are the result of a sequence with an accuracy of 94.37%, which comes from 1.42% of oversegmentation and 7.83% of undersegmentation. There are four sequences with the accuracy below 70%, three of which were severely undersegmented and one of which was oversegmented. For the cases of undersegmentation, there is usually an early hypofluorescent ring along the border that is shown to be a fibrin deposit and is defined as part of classic CNV
16 by the retina specialist. As shown in
Figure 3c, our system does not recognize the early hypofluorescent ring around the CNV, resulting in a lower accuracy of 66.81%, with no oversegmentation and 46.94% undersegmentation. This sequence was the one with the lowest accuracy in our validation set. The impact of the fibrin deposit was particularly profound on cases with small CNV lesions.