June 2022
Volume 63, Issue 7
Open Access
ARVO Annual Meeting Abstract  |   June 2022
Natural-Language Diagnostic Report Generation by Multi-Modal AI for Macular Diseases
Author Affiliations & Notes
  • Xufeng Zhao
    Ophthalmology, Peking Union Medical College Hospital, Dongcheng-qu, Beijing, China
  • Chunshi Li
    Ophthalmology, Dalian No.3 People's Hospital, Dalian, Liaoning, China
  • Xingwang Gu
    Ophthalmology, Peking Union Medical College Hospital, Dongcheng-qu, Beijing, China
  • Jingyuan Yang
    Ophthalmology, Peking Union Medical College Hospital, Dongcheng-qu, Beijing, China
  • Bing Li
    Ophthalmology, Peking Union Medical College Hospital, Dongcheng-qu, Beijing, China
  • Yuelin Wang
    Ophthalmology, Peking Union Medical College Hospital, Dongcheng-qu, Beijing, China
  • Xirong Li
    Key Lab of DEKE, Renmin University of China, Beijing, China
  • jianchun zhao
    Vistel AI Lab, Visionary Intelligence Ltd, Beijing, China
  • Jie Wang
    Vistel AI Lab, Visionary Intelligence Ltd, Beijing, China
  • Youxin Chen
    Ophthalmology, Peking Union Medical College Hospital, Dongcheng-qu, Beijing, China
  • Weihong Yu
    Ophthalmology, Peking Union Medical College Hospital, Dongcheng-qu, Beijing, China
  • Footnotes
    Commercial Relationships   Xufeng Zhao None; Chunshi Li None; Xingwang Gu None; Jingyuan Yang None; Bing Li None; Yuelin Wang None; Xirong Li None; jianchun zhao None; Jie Wang None; Youxin Chen None; Weihong Yu None
  • Footnotes
    Support  None
Investigative Ophthalmology & Visual Science June 2022, Vol.63, 2992 – F0262. doi:
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Xufeng Zhao, Chunshi Li, Xingwang Gu, Jingyuan Yang, Bing Li, Yuelin Wang, Xirong Li, jianchun zhao, Jie Wang, Youxin Chen, Weihong Yu; Natural-Language Diagnostic Report Generation by Multi-Modal AI for Macular Diseases. Invest. Ophthalmol. Vis. Sci. 2022;63(7):2992 – F0262.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Purpose : To investigate a multi-modal approach to automatically generate natural-language diagnostic reports for macular diseases, by detecting varied lesions and diseases from color fundus photographs (CFP) and optical coherence tomography (OCT) images and accordingly synthesizing descriptions using natural-language processing (NLP) techniques.

Methods : Three pre-trained deep neural networks were used to detect variable retinal lesions from the CFP / OCT images and predict diagnosis from both modalities independently. A sanity check was performed to ensure the coherence between the detected lesions and the predicted diagnosis. Then a rule-based NLP algorithm descripted the lesions, reported diagnosis and recommended treatment (Fig. 1). A test set of 172 eyes from 127 subjects were successively acquired from July 2020 through September 2020 in our clinic, including 113 normal eyes and 59 abnormal eyes with epiretinal membrane (ERM), dry age-related macular degeneration (AMD), wet AMD and diabetic retinopathy (DR). Each eye consisted of one CFP and 12 radial OCT B-scans. Evaluation of the generated reports was conducted by comparing with the performance of two junior ophthalmologists. A questionnaire was designed and cooperatively judged by two retina specialists to quantitatively grade each report’s readability, correctness of diagnosis and recommendations (Fig. 1). All reports were anonymized to avoid potential bias. Sensitivity and specificity per class was also analyzed.

Results : AI-based NLP reports achieved higher grades in correctness of diagnosis (9.13 vs 9.03 points,) and recommendations (8.55 vs 8.50 points) compared to the junior ophthalmologists, but there was no statistically significance (P=0.43 and 0.85, respectively). For readability, both groups had a satisfactory performance (9.87 vs 9.88 points, P=0.87). Sensitivity of AI-reports and junior ophthalmologists was 0.74 (95% CI, 0.66 - 0.82) and 0.75 (95% CI, 0.67 - 0.83), respectively. Specificity of AI-reports and junior ophthalmologists was 0.94(95% CI, 0.91 - 0.98) and 0.97 (95% CI, 0.95 - 0.99), respectively (Fig. 2).

Conclusions : NLP algorithms-generated diagnostic reports for macular diseases based on multi-modal AI system can achieve similar performance as junior ophthalmologists, suggesting this emerging concept's potential in primary eye service.

This abstract was presented at the 2022 ARVO Annual Meeting, held in Denver, CO, May 1-4, 2022, and virtually.

 

Generation and evaluation for the diagnostic report.

Generation and evaluation for the diagnostic report.

 

Sensitivty and specificity of both groups.

Sensitivty and specificity of both groups.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×