June 2021
Volume 62, Issue 8
Open Access
ARVO Annual Meeting Abstract  |   June 2021
Investigating the impact of saliency maps on clinician’s confidence in model predictions
Author Affiliations & Notes
  • Sara Beqiri
    University College London Division of Medicine, London, United Kingdom
  • Abdallah Abbas
    University College London Division of Medicine, London, United Kingdom
  • Edward Korot
    Byers Eye Institute, Stanford University, Stanford, California, United States
    Moorfields Eye Hospital NHS Foundation Trust, London, London, United Kingdom
  • Siegfried Wagner
    Moorfields Eye Hospital NHS Foundation Trust, London, London, United Kingdom
  • Robbert Struyven
    University College London, London, London, United Kingdom
  • Madeline Kelly
    University College London, London, London, United Kingdom
  • Ritvij Singh
    Imperial College London Faculty of Medicine, London, London, United Kingdom
  • Daniel Ferraz
    Moorfields Eye Hospital NHS Foundation Trust, London, London, United Kingdom
  • Mariana Batista Goncalves
    Moorfields Eye Hospital NHS Foundation Trust, London, London, United Kingdom
  • Josef Christian Huemer
    Moorfields Eye Hospital NHS Foundation Trust, London, London, United Kingdom
  • Hagar Khalid
    Moorfields Eye Hospital NHS Foundation Trust, London, London, United Kingdom
  • Laxmi Raja
    Moorfields Eye Hospital NHS Foundation Trust, London, London, United Kingdom
  • Pearse A Keane
    Moorfields Eye Hospital NHS Foundation Trust, London, London, United Kingdom
  • Footnotes
    Commercial Relationships   Sara Beqiri, None; Abdallah Abbas, None; Edward Korot, None; Siegfried Wagner, None; Robbert Struyven, None; Madeline Kelly, None; Ritvij Singh, None; Daniel Ferraz, None; Mariana Goncalves, None; Josef Huemer, None; Hagar Khalid, None; Laxmi Raja, None; Pearse Keane, None
  • Footnotes
    Support  None
Investigative Ophthalmology & Visual Science June 2021, Vol.62, 2297. doi:
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Sara Beqiri, Abdallah Abbas, Edward Korot, Siegfried Wagner, Robbert Struyven, Madeline Kelly, Ritvij Singh, Daniel Ferraz, Mariana Batista Goncalves, Josef Christian Huemer, Hagar Khalid, Laxmi Raja, Pearse A Keane; Investigating the impact of saliency maps on clinician’s confidence in model predictions. Invest. Ophthalmol. Vis. Sci. 2021;62(8):2297.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Purpose : Saliency maps have gained popularity in medical artificial intelligence (AI) studies due to their ability to highlight significant regions for model prediction, allowing greater interpretability of models and raising the prospects of their integration in clinical practice. This study explores whether saliency maps increase clinician confidence in diagnosis via an AI supported workflow.

Methods : 60,133 retinal fundus photography images and labels were gathered from public datasets. These were used to train a classification model on the Google AutoML platform to detect referable and non-referable diabetic retinopathy. Saliency maps were produced using the XRAI algorithm supported by the same platform, providing region-based attribution, suitable for fundus images.
A survey was sent to 6 participants, including 4 ophthalmologists with fellowship-level subspecialty training in retinal disease, and two ophthalmology trainees. They were shown 50 randomly selected images, each repeated 3 times. The image was first presented with the predicted class, secondly by class and Softmax score, thirdly by including a heatmap. After each assistance level, the readers rated their confidence in the prediction via a 5-point Likert scale.

Results : Paired tests showed a significant difference in confidence rating between all assistance types, with saliency maps performing the poorest (3.12±1.28, P<0.001). On a second analysis, we separated predictions into correct and incorrect groups. The unpaired test results show that heatmaps accompanying incorrect labels were associated with a lower confidence than correct ones (incorrect: 2.14±1.15, correct: 3.23±1.25, P<0.001). Similarly, we considered the impact of referable and non-referable cases separately. Results showed no significant difference in confidence from heatmaps between the categories (P=0.097).

Conclusions : Our results show that XRAI heatmaps are associated with lower confidence in model predictions compared to the other two strategies. We emphasize the importance of considering their impact specifically for incorrect predictions and non-referable DR. For the prior, saliency could cause confirmation bias, whereas for the latter, it may mislead if there are no positive findings. Whilst saliency may be of at least some value in model interpretability, for its implementation as diagnostic assistance, higher standards of localisation and meaningful rationale are required.

This is a 2021 ARVO Annual Meeting abstract.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×