Purchase this article with an account.
Aaron S Coyner, Ryan Swan, James M Brown, Jayashree Kalpathy-Cramer, Sang Jin Kim, J. Peter Campbell, Karyn Jonas, R.V. Paul Chan, Susan Ostmo, Michael F Chiang; Deep Learning for Image Quality Assessment of Fundus Images in Retinopathy of Prematurity. Invest. Ophthalmol. Vis. Sci. 2018;59(9):2762.
Download citation file:
© ARVO (1962-2015); The Authors (2016-present)
Accurate image-based ophthalmic diagnosis relies on clarity of fundus images. This has important implications for ophthalmic diagnosis, and for emerging methods such as telemedicine and computer-based image analysis. The purpose of this study was to implement a convolutional neural network (CNN) for automatically assessing the quality of fundus images in retinopathy of prematurity (ROP).
5,204 wide-angle fundus images were collected from preterm infants during routine ROP screening examinations. Images were assessed for quality via ability to diagnose ROP accurately, and labeled “acceptable” or “not acceptable.” Acceptable images included only those that were deemed acceptable for ROP diagnosis by the majority of image graders.2,550 images were used for training, 1,000 for validation, and 1,624 for testing. Additionally, 30 images of varying quality were selected for review by six individual expert graders who performed pairwise comparisons between images to rank them from worst to best quality. An overall expert consensus rank was developed from individual rankings.A CNN was trained using dropout and early stopping to avoid overfitting. For the set of 30 images, rank was established using the predicted probabilities of each image belonging to the acceptable image class.
CNN training was halted when validation accuracy for identifying acceptable vs. not acceptable images reached 95%. Test set accuracy was 94.5%, with area under the receiver operating curve (AUROC) equal to 0.972, and area under the precision-recall curve (AUPR) equal to 0.975.Individual expert ranks were highly correlated with one another (correlation coefficient [CC] 0.89-0.94) and with the consensus rank (CC 0.94-0.98). The CNN image rank correlated with individual expert ranks (CC 0.75- 0.82) and the consensus rank (CC 0.77).
A CNN can accurately distinguish acceptable vs. not acceptable quality retinal fundus photos acquired from preterm infants during routine ROP examinations. The CNN ranks images similarly to expert graders, which suggests that similar image features may be used for quality assessment. Future directions will explore the features used for quality assessment by the CNN.
This is an abstract that was submitted for the 2018 ARVO Annual Meeting, held in Honolulu, Hawaii, April 29 - May 3, 2018.
This PDF is available to Subscribers Only