Investigative Ophthalmology & Visual Science Cover Image for Volume 65, Issue 7
June 2024
Volume 65, Issue 7
Open Access
ARVO Annual Meeting Abstract  |   June 2024
Frisén Grade Categorization in Papilledema using Vision Transformers
Author Affiliations & Notes
  • Asala Erekat
    Neurology, Icahn School of Medicine at Mount Sinai, New York, New York, United States
  • Brian Woods
    University of Galway College of Science and Engineering, Galway, Ireland
    Galway University Hospitals, Galway, Galway, Ireland
  • Joseph Branco
    Department of Ophthalmology, Westchester Medical Center, Valhalla, New York, United States
  • Jui-Kai Wang
    Department of Ophthalmology and Visual Sciences, Center for the Prevention and Treatment of Visual Loss, University of Iowa Hospitals and Clinics, Iowa City, Iowa, United States
  • Mona K Garvin
    Electrical and Computer Engineering, Center for the Prevention and Treatment of Visual Loss, University of Iowa Hospitals and Clinics, Iowa City, Iowa, United States
  • Randy H Kardon
    Department of Ophthalmology and Visual Sciences, Center for the Prevention and Treatment of Visual Loss, University of Iowa Hospitals and Clinics, Iowa City, Iowa, United States
  • David Szanto
    Neurology, Icahn School of Medicine at Mount Sinai, New York, New York, United States
  • Ciaran Eising
    Electronic and Computer Engineering, University of Limerick, Limerick, Ireland
  • Louis R Pasquale
    Department of Ophthalmology, Icahn School of Medicine at Mount Sinai, New York, New York, United States
  • Mark J Kupersmith
    Departments of Neurology, Ophthalmology and Neurosurgery, Icahn School of Medicine at Mount Sinai, New York, New York, United States
  • Footnotes
    Commercial Relationships   Asala Erekat None; Brian Woods None; Joseph Branco None; Jui-Kai Wang None; Mona Garvin University of Iowa, Code P (Patent); Randy Kardon None; David Szanto None; Ciaran Eising None; Louis Pasquale Twenty Twenty Inc, Skye Biosciences, Eyenovia, Code C (Consultant/Contractor); Mark Kupersmith None
  • Footnotes
    Support  New York Eye and Ear Infirmary Foundation
Investigative Ophthalmology & Visual Science June 2024, Vol.65, 1596. doi:
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Asala Erekat, Brian Woods, Joseph Branco, Jui-Kai Wang, Mona K Garvin, Randy H Kardon, David Szanto, Ciaran Eising, Louis R Pasquale, Mark J Kupersmith; Frisén Grade Categorization in Papilledema using Vision Transformers. Invest. Ophthalmol. Vis. Sci. 2024;65(7):1596.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Purpose : In previous work, we applied DenseNet architecture for classifying papilledema severity. This model correctly estimated swelling severity within one Frisén grade in 71.8% of cases. The present study explores the potential of Vision Transformers (ViT) as a novel methodology to discern papilledema from normal ocular fundus images and to classify the stages of papilledema severity more accurately. We hypothesize that ViT could improve the accuracy of automated Frisén grade classification for papilledema.

Methods : We adapted a pre-trained ViT model, RETFound, initially trained on a corpus of 1.6 million retinal images, for papilledema classification. Our dataset encompassed 5,908 fundus photographs from 165 patients in the Idiopathic Intracranial Hypertension Treatment Trial. After removing backgrounds and resizing images to 256x256 pixels, the dataset was split into 80% for training and the remainder for testing, with a portion of the training set (20%) earmarked for validation. The RETFound model was restructured to include a new head, enabling binary and six-class classification tasks.

Results : In binary classification, the ViT model exhibited an average accuracy of 99.76% and an F1-score of 99.87%. When applied on multi-class classification for Frisén grades, the model attained an average accuracy of 93.1% and an F1-score of 81.40%. Notably, the classification of grade 5 was less accurate when compared to the other grades, with only 66.67% of true grade 5 cases being correctly identified; the remaining were misclassified as grade 4, a variance ascribed to the limited representation of grade 5 within the dataset.

Conclusions : The RetFound ViT model has substantiated its capability to accurately discern and classify papilledema in fundus photography, achieving significant improvement when compared to the DenseNet. Furthermore, the ViT model has consistently delivered considerable accuracy throughout various papilledema grades. Future directions include applying this model to regression tasks for continuous severity grading, incorporating activation maps to identify defining features for each Frisén grade, and incorporating this output into larger fusion models, thus enhancing the model's diagnostic interpretability and clinical utility.

This abstract was presented at the 2024 ARVO Annual Meeting, held in Seattle, WA, May 5-9, 2024.

 

Figure 2 - Confusion Matrix for Frisén Grade Classification

Figure 2 - Confusion Matrix for Frisén Grade Classification

 

Figure 1 - Confusion Matrix for Binary Classification

Figure 1 - Confusion Matrix for Binary Classification

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×