March 2012
Volume 53, Issue 14
Free
ARVO Annual Meeting Abstract  |   March 2012
The Importance of Rating Scale Design in the Measurement of Visual Disability using Questionnaires Or Item Banks
Author Affiliations & Notes
  • Jyoti Khadka
    NH&MRC Centre for Clinical Eye Research, Discipline of Optometry and Vision Science, Flinders University, Adelaide, Australia
  • Colm McAlinden
    NH&MRC Centre for Clinical Eye Research, Discipline of Optometry and Vision Science, Flinders University, Adelaide, Australia
  • Vijaya K. Gothwal
    Meera and L B Deshpande Centre for Sight Enhancement, L V Prasad Eye Institute, Hyderabad, India
  • Ecosse L. Lamoureux
    Ophthalmology, University of Melbourne, Melbourne, Australia
  • Konrad Pesudovs
    NH&MRC Centre for Clinical Eye Research, Discipline of Optometry and Vision Science, Flinders University, Adelaide, Australia
  • Footnotes
    Commercial Relationships  Jyoti Khadka, None; Colm McAlinden, None; Vijaya K. Gothwal, None; Ecosse L. Lamoureux, None; Konrad Pesudovs, None
  • Footnotes
    Support  None
Investigative Ophthalmology & Visual Science March 2012, Vol.53, 5443. doi:
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Jyoti Khadka, Colm McAlinden, Vijaya K. Gothwal, Ecosse L. Lamoureux, Konrad Pesudovs; The Importance of Rating Scale Design in the Measurement of Visual Disability using Questionnaires Or Item Banks. Invest. Ophthalmol. Vis. Sci. 2012;53(14):5443.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Purpose: : To investigate whether differences in rating scale design (question format and response categories), for items with the same content, influences item calibration. Further, we aim to investigate whether rating scale differences lead to an overall difference in visual disability score measured by different patient reported outcome (PRO) instruments.

Methods: : Sixteen existing PROs suitable for cataract assessment, and with different rating scales, were self-administered by patients on a cataract surgery waiting list. Two hundred and twenty-six items measuring visual disability in their native rating scale format were selected to develop a visual disability item bank. Items were calibrated on an interval level scale in logits using Rasch analysis. Fifteen item content areas (e.g. reading newspapers, driving at night) appearing in at least 3 different PROs were identified. Within each content area, item calibrations were compared and their range calculated. Similarly, 5 PROs [Visual Disability Assessment (VDA); National Eye Institute Visual Function Questionnaire (NEIVFQ); Activities of Daily Vision Scale (ADVS); Technology of Patient Experience (TyPE); and Cataract Symptom Scale (CatScale)] having at least 3 items in common with the Visual Function (VF-14) were identified. Using these common items, average item measures of these 5 PROs were compared with the reference PRO (VF-14).

Results: : A total of 624 patients (mean age ± SD, 74·1 years ±9·4) participated. Items with the same content varied in their calibration by as much as two logits. Items with the content "reading the small print" had the largest range (1.99 logits) which was followed by "watching TV" (1.60). In reference to the VF-14 (0.00 logits), the rating scale of the VDA produced the most difficult items (1.13) followed by the NEIVFQ (0.66), ADVS (0.55), TyPE (0.43) and CatScale (0.24).

Conclusions: : Rasch analysis demonstrated that differences in rating scale design can have a significant effect on item calibrations beyond item content. Both question format and response category labels appear to influence item calibrations and ultimately, overall measurement of visual disability. Therefore, it is difficult to compare research findings using different PROs. Moreover, it would be inelegant to use items from different PROs in their native rating scale formats for an item bank where it is desirable that item calibration reflects item content only. A preferred strategy would be to fit all items to a common rating scale.

Keywords: quality of life • clinical (human) or epidemiologic studies: systems/equipment/techniques 
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×