Investigative Ophthalmology & Visual Science Cover Image for Volume 65, Issue 7
June 2024
Volume 65, Issue 7
Open Access
ARVO Annual Meeting Abstract  |   June 2024
A natural language processing approach to identify patients with uveitic macular edema in the IRIS® Registry.
Author Affiliations & Notes
  • Peng Jin
    Verana Health, California, United States
  • Marie Humbert-Droz
    Verana Health, California, United States
  • Kristian Garcia
    Verana Health, California, United States
  • Helene Fevrier
    Verana Health, California, United States
  • Durga Borkar
    Verana Health, California, United States
  • Abhishek Nair
    Bausch & Lomb Americas Inc., New Jersey, United States
  • David Harrison
    Bausch & Lomb Americas Inc., New Jersey, United States
  • Zhongdi Chu
    Verana Health, California, United States
  • Footnotes
    Commercial Relationships   Peng Jin Verana Health, Code E (Employment); Marie Humbert-Droz Verana Health, Code E (Employment); Kristian Garcia Verana Health, Code E (Employment); Helene Fevrier Verana Health, Code E (Employment); Durga Borkar Verana Health, Code C (Consultant/Contractor), AbbVie, Code C (Consultant/Contractor), Apellis, Code C (Consultant/Contractor), Glaukos, Code C (Consultant/Contractor), Genentech, Code C (Consultant/Contractor), Iveric Bio, Code C (Consultant/Contractor); Abhishek Nair Bausch & Lomb Americas Inc., Code E (Employment), Bausch & Lomb Americas Inc., Code I (Personal Financial Interest); David Harrison Bausch & Lomb Americas Inc., Code E (Employment), Bausch & Lomb Americas Inc., Code I (Personal Financial Interest); Zhongdi Chu Verana Health, Code E (Employment)
  • Footnotes
    Support  None
Investigative Ophthalmology & Visual Science June 2024, Vol.65, 2428. doi:
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Peng Jin, Marie Humbert-Droz, Kristian Garcia, Helene Fevrier, Durga Borkar, Abhishek Nair, David Harrison, Zhongdi Chu; A natural language processing approach to identify patients with uveitic macular edema in the IRIS® Registry.. Invest. Ophthalmol. Vis. Sci. 2024;65(7):2428.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Purpose : To develop a natural language processing (NLP) algorithm to identify patients with active uveitic macular edema (UME) from electronic health records (EHR) data in the American Academy of Ophthalmology IRIS® Registry (Intelligent Research in Sight).

Methods : In order to identify patients with active, non-infectious UME, fellowship-trained retina specialists defined a combination of ICD-10 codes for macular edema (ME) and non-infectious uveitis in structured data and a list of UME keywords for non-historical ME in association with non-infectious uveitis in unstructured data. A heuristic NLP algorithm was then developed to identify patients with an active UME diagnosis at a given encounter based on the unstructured data definition using a SpaCy PhraseMatcher. Using IRIS Registry data from January 1st, 2016 to August 16th, 2023, notes from 500 randomly selected patients with UME keywords in their clinical records were labeled to determine their UME status: active UME or no/unknown active UME. This labeled dataset was split 7:3 for algorithm development and validation; the final algorithm was evaluated on the validation set using accuracy, sensitivity and specificity. Finally, the proposed NLP algorithm was used to identify patients with active UME in the IRIS Registry. The number of UME patients identified by the proposed NLP algorithm was compared to the number identified based on the ICD-10 codes alone.

Results : The algorithm achieved an accuracy, sensitivity and specificity of 0.83, 0.95 and 0.73, respectively, using the validation set. Out of 231,543 patients with UME keywords in their clinical records, 129,316 patients were confirmed with active UME at the encounter level by the proposed NLP algorithm. Alternatively, 40,277 patients were identified as having active UME diagnosis using the ICD-10 codes alone.

Conclusions : The proposed heuristic NLP algorithm demonstrated satisfactory performance in identifying patients with active UME in the IRIS Registry. UME patients are difficult to identify in real-world clinical research settings using structured data alone. This algorithm identified three times more patients with active UME compared to only using ICD-10 codes, providing an enhanced solution to conducting real-world evidence studies in the UME patient population.

This abstract was presented at the 2024 ARVO Annual Meeting, held in Seattle, WA, May 5-9, 2024.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×