Investigative Ophthalmology & Visual Science Cover Image for Volume 65, Issue 7
June 2024
Volume 65, Issue 7
Open Access
ARVO Annual Meeting Abstract  |   June 2024
Towards an Optimal (Self)-Supervised Learning Paradigm for Diabetic Retinopathy Detection
Author Affiliations & Notes
  • Christopher Clark
    Ophthalmology, University of Colorado Anschutz Medical Campus School of Medicine, Aurora, Colorado, United States
  • Scott Kinder
    Ophthalmology, University of Colorado Anschutz Medical Campus School of Medicine, Aurora, Colorado, United States
  • Jayashree Kalpathy-Cramer
    Ophthalmology, University of Colorado Anschutz Medical Campus School of Medicine, Aurora, Colorado, United States
  • Steve McNamara
    Ophthalmology, University of Colorado Anschutz Medical Campus School of Medicine, Aurora, Colorado, United States
  • Benjamin Bearce
    Ophthalmology, University of Colorado Anschutz Medical Campus School of Medicine, Aurora, Colorado, United States
  • Praveer Singh
    Ophthalmology, University of Colorado Anschutz Medical Campus School of Medicine, Aurora, Colorado, United States
  • Advaith Veturi
    Ophthalmology, University of Colorado Anschutz Medical Campus School of Medicine, Aurora, Colorado, United States
  • Footnotes
    Commercial Relationships   Christopher Clark None; Scott Kinder None; Jayashree Kalpathy-Cramer None; Steve McNamara None; Benjamin Bearce None; Praveer Singh None; Advaith Veturi None
  • Footnotes
    Support  None
Investigative Ophthalmology & Visual Science June 2024, Vol.65, 5658. doi:
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Christopher Clark, Scott Kinder, Jayashree Kalpathy-Cramer, Steve McNamara, Benjamin Bearce, Praveer Singh, Advaith Veturi; Towards an Optimal (Self)-Supervised Learning Paradigm for Diabetic Retinopathy Detection. Invest. Ophthalmol. Vis. Sci. 2024;65(7):5658.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Purpose : Diabetic Retinopathy (DR) is the leading cause of new cases of blindness in the US with early symptoms going completely unnoticed in some patients. Currently, there is no evaluation study exploring the most optimal strategy for training an Artificial Intelligence (AI) based DR classification model. To remedy this, in this study we compare existing Supervised Learning (SL) approaches with various Self-Supervised Learning (SSL) techniques, for different architecture backbones, pretraining strategies as well as generative vs contrastive SSL methodologies.

Methods : We started with a dataset of 35,126 fundus images acquired on different CFP imaging platforms with varying image qualities. For each patient, both right and left eye images were labelled 0 to 4, from “No DR” to “Proliferative DR”. We explored both SL and SSL techniques for two different overarching architectures, Convolutional Neural Networks (CNN) and Vision Transformers (ViT). SSL training consisted of a ResNet50 backbone architecture and self-DIstillation with NO labels (DINOv1) for contrastive SSL and a Vision Transformer Masked Auto-Encoder (ViTMAE) for generative SSL. Furthermore, SSL and SL models pre-trained on ImageNet (IN) were explored with the SSL fine-tuned a second time with domain-specific (DS) data. Each was then trained for downstream classification.

Results : Our four comparisons: A) Between ViT and ResNet50 SL only, we find statistically insignificant differences between the from-scratch versions but statistically significant results with IN fine-tuned ViT outperforming IN fine-tuned ResNet50. B) Between performing SSL with ImageNet only and IN plus DS data, we find statistically significant better performance of additional DS SSL for DINOv1, but statistically insignificant differences for ViTMAE. C) Between ViTMAE and DINOv1 SSL, we find statistically insignificant differences between their best versions, SSL with IN and DS data for DINOv1 and IN only for ViTMAE. D) Between SSL (from C) and SL, we find a very strong preference for SSL for either backbone.

Conclusions : Our results show the efficacy of fine-tuning on a large dataset for both SSL and SL overall. It also shows that SSL pipelines yield better results than SL alone, though the addition of domain-specific data needs to be explored more. These results suggest that SSL can help overcome some of the shortcomings with traditional SL approaches in the medical domain.

This abstract was presented at the 2024 ARVO Annual Meeting, held in Seattle, WA, May 5-9, 2024.

 

 

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×