Investigative Ophthalmology & Visual Science Cover Image for Volume 65, Issue 7
June 2024
Volume 65, Issue 7
Open Access
ARVO Annual Meeting Abstract  |   June 2024
Can a novel comparative AI for diabetic retinopathy based on pairwise comparisons make uncertainty more predictable and explainable?
Author Affiliations & Notes
  • Simon P Harding
    Eye and Vision Science, University of Liverpool, Liverpool, United Kingdom
    St. Paul's Eye Unit, Royal Liverpool University Hospital, Liverpool, United Kingdom
  • David Wong
    Eye and Vision Science, University of Liverpool, Liverpool, United Kingdom
  • Mark Johnson
    School of Health Sciences, The University of Manchester, Manchester, United Kingdom
  • Elizabeth Maitland
    School of Management, University of Liverpool, Liverpool, United Kingdom
  • Wenyue Zhu
    Novartis Pharmaceuticals UK Ltd, London, United Kingdom
    Eye and Vision Science, University of Liverpool, Liverpool, United Kingdom
  • Dongxu Gao
    School of Computing, University of Portsmouth, Portsmouth, Hampshire, United Kingdom
  • Philip Burgess
    Eye and Vision Science, University of Liverpool, Liverpool, United Kingdom
    St. Paul's Eye Unit, Royal Liverpool University Hospital, Liverpool, United Kingdom
  • Gabriela Czanner
    Applied Mathematics, Liverpool John Moores University, Liverpool, United Kingdom
  • Yalin Zheng
    Eye and Vision Science, University of Liverpool, Liverpool, United Kingdom
  • Footnotes
    Commercial Relationships   Simon Harding AI-Sight Ltd, Code O (Owner), AI-Sight Ltd, Code P (Patent); David Wong AI-Sight Ltd, Code O (Owner), AI-Sight Ltd, Code P (Patent); Mark Johnson AI-Sight Ltd, Code O (Owner), AI-Sight Ltd, Code P (Patent); Elizabeth Maitland AI-Sight Ltd, Code I (Personal Financial Interest); Wenyue Zhu AI-Sight Ltd, Code O (Owner), AI-Sight Ltd, Code P (Patent); Dongxu Gao AI-Sight Ltd, Code O (Owner), AI-Sight Ltd, Code P (Patent); Philip Burgess None; Gabriela Czanner AI-Sight Ltd, Code O (Owner), AI-Sight Ltd, Code P (Patent); Yalin Zheng AI-Sight Ltd, Code O (Owner), AI-Sight Ltd, Code P (Patent)
  • Footnotes
    Support  EPSRC UK EP/R014094/1
Investigative Ophthalmology & Visual Science June 2024, Vol.65, 3751. doi:
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Simon P Harding, David Wong, Mark Johnson, Elizabeth Maitland, Wenyue Zhu, Dongxu Gao, Philip Burgess, Gabriela Czanner, Yalin Zheng; Can a novel comparative AI for diabetic retinopathy based on pairwise comparisons make uncertainty more predictable and explainable?. Invest. Ophthalmol. Vis. Sci. 2024;65(7):3751.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Purpose : Addressing uncertainty and explainability is crucial for ensuring the reliability and trustworthiness of AI systems and for widespread application in clinical practice. We investigated a novel approach, embedding pairwise comparison and ranking of images within an AI environment and applicable to images in diseases with a classification scale.

Methods : We developed and tested an approach, which we term “comparative AI” inspired by the concept of “adaptive comparative judgment. AI models were initially trained using publicly available datasets (Messidor2, APTOS, Kaggle, IDRID) using pairwise comparison to judge which of two images had more severe retinopathy. The system was refined by incorporating Bayesian statistics. The AI was then applied to the DDR dataset by subdividing into validation and testing subsets. DDR images are ranked based on the likelihood of an image being worse/better than specified boundaries. The likelihood provides explainability to the system. We analysed the total uncertainty (both aleatoric and epistemic) by correlating grading errors with ranking and investigated the effect of uncertainty on the AI’s performance.

Results : The test database (1199 images) consisted of 66% no retinopathy, 16% mild, 13% moderate, 3% severe and 3% proliferative. The plot shows the distribution (number and ordinal location) of errors with significant clustering around each of 3 boundaries and normal distributions with truncation. The sensitivity, specificity, positive and negative predictive values were: 0.84,0.89, 64%,96% for mild v moderate; 0.94, 0.89,35%,99.6% for moderate v severe; 0.97, 0.92, 26%,99.9% for severe v proliferative. Other test sets with different prevalence showed similar distributions.

Conclusions : Using comparisons, AI errors are more likely when two images are close together in severity (similar to humans as described by the Weber and Fechner law). Clustering occurs around the boundary, rendering it predictable. We believe the concentration around a classification boundary is an emergent property of comparison-based AI, enabling easy identification and explanations for the AI’s zones of uncertainty. The significance is that such algorithms can be safe (high NPV) and efficient (high PPV) even with modest sensitivity and specificity and highlight cases requiring human judgement.

This abstract was presented at the 2024 ARVO Annual Meeting, held in Seattle, WA, May 5-9, 2024.

 

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×