|
|
||||||||
1From the Department of Pharmaceutical Economics and Policy, School of Pharmacy, the 2Doheny Eye Institute, and the 3Departments of Ophthalmology and 4Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California.
| Abstract |
|---|
|
|
|---|
METHODS. The Los Angeles Latino Eye Study (LALES) is a population-based study to assess the prevalence of eye disease and self-reported visual functioning in Latinos aged 40 or more years. Self-reported visual functioning was assessed by using English and Spanish versions of the NEI VFQ-25. Psychometric properties of the NEI VFQ-25, including internal consistency of the subscales and the individual items, were assessed through the Multi-trait Analysis Program-Revised (MAP-R) analysis. Adjusted mean and median subscale scores were compared between English and Spanish speakers to identify any systematic differences.
RESULTS. Of the 1917 participants from two census tracts, 1171 participants with no visual impairment were included in this analysis. The mean age of the participants was 52.3 years, 57% of the participants were female, and 67.5% of the participants were Spanish speaking. Median scores for Spanish-speaking participants were significantly lower than those of the English-speaking participants on four subscales: Ocular Pain, General Vision, Vision-Specific Mental Health, and General Health (P < 0.05). Internal consistency for three of eight measurable subscales for the study group was poor (Cronbach
< 0.6).
CONCLUSIONS. This study reveals psychometric inconsistencies in the NEI VFQ-25 when administered to visually normal Latinos. The difference in mean subscale scores between Spanish and English speakers must be integrated into the development of population norms of visual function. Further detailed psychometric evaluation is needed to determine the validity of this instrument in Latino populations.
To characterize visual functioning accurately in a population-based study and to assess the association between ocular disease and the patients perception of visual functioning, it is essential to have an instrument that is culturally sensitive and appropriate for Latinos.6 7 8 The purpose of assessing the cultural appropriateness of an instrument is to ensure that different participants experience the questionnaire equally so that responses are consistent between individuals with different characteristics. Language and acculturation are intimately connected.9 One major construct in the measurement of acculturation is the language in which an individual speaks, writes, reads, and thinks.10 11 To date, few studies have assessed measures of visual functioning in a visually normal Latino population.
The NEI VFQ-25 field test excluded nonEnglish-speaking participants.3 This field test included both individuals with ocular disease and a reference group of 122 participants who were examined at one of seven ophthalmology practices and who had no clinical evidence of ocular disease. Although a Spanish translation is currently available, only one study has validated this translation in a large, population-based Latino cohort.4 That study provides an initial assessment of the psychometric properties of the NEI VFQ-25 in a Latino population. Although differences in responses to items are reported, comparing Spanish- and English-speaking participants and those with visual impairment more than or equal to 20/40 with those with visual impairment worse than 20/40, Cronbach
and convergent and divergent validity (psychometric performance) are not assessed between English and Spanish speakers.
The NEI VFQ-25 was originally developed to assess visual functioning in individuals with ocular disease. However, it is important to establish benchmarks in visually normal individuals as an important comparator for visual functioning for those with ocular disease. These benchmarks can be acquired in two ways. First, they can be obtained from a clinic-based sample. This method was used in the NEI VFQ-51 field test, in which a reference group without ocular disease or visual impairment was selected from patients attending an eye clinic.1 A second method is to obtain the benchmark from a population-based sample. This approach has been used for the NEI VFQ-25 in a population-based sample in Arizona.4 In addition, it has been used by the International Quality of Life Assessment Project (IQOLA), which has established methods to translate a general quality-of-life instrument, the short form (SF)-36. In the current study, we used methodologies set forth by the IQOLA project to assess the psychometric properties in a general population. These norms can be used as a benchmark for comparison of visual functioning in persons with visual impairment or ocular disease across the continuum of visual acuity in both epidemiologic and clinic-based studies.
For this article, we assessed only visually normal participants, comparing the psychometric performance of the English and Spanish versions of the NEI VFQ in an entirely Latino population. It expands on the work by Broman et al.4 on a population-based sample of Hispanics from Arizona. Our results, and the results of Broman et al., can be used as a benchmark for comparison of visual functioning in persons with visual impairment or ocular disease.12 13
| Methods |
|---|
|
|
|---|
Study Group
Similar to methods used by the IQOLA project,12 15 a visually normal subgroup of LALES participants without evidence of visual impairment or ocular disease (e.g., visually impairing cataract, diabetic retinopathy, macular degeneration, or glaucoma) were identified for this analysis. Participants in the visually normal group had binocular visual acuity better than 20/40 and normal visual fields at initial examination. Visual acuity was measured binocularly using the Early Treatment Diabetic Retinopathy Study (ETDRS) distance charts (Lighthouse International, New York, NY) with the participants normal refractive correction (see Azen et al.16 for details). Visual field analysis was performed to assess peripheral vision (24-2 full-threshold or the 24-2 Swedish interactive test algorithm [SITA] standard test; Humphrey Field Analyzer II; Zeiss-Humphrey Systems, Dublin, CA). Participants with reliable visual fields and with no evidence of visual field loss were included in this study. The analysis was then performed again in all participants, both with and without visual impairment.
Self-Reported Visual Function
The NEI VFQ-25 is composed of 12 vision-specific subscales. Each subscale contains a minimum of one and a maximum of four items. The subscales include: General Health (one item), General Vision (one item), Near Vision Activities (three items), Distance Vision Activities (three items), Ocular Pain (two items), Vision-Specific Social Function (two items), Vision-Specific Role Difficulties (two items), Vision-Specific Mental Health (4 items), Vision-Specific Dependency (3 items), Driving Difficulties (two items), Color Vision (one item), and Peripheral Vision (one item). The NEI VFQ-25 is scored using standard algorithms.1 Each item was scored on a scale from 0 (lowest visual functioning) to 100 (best visual functioning). Items were then reverse coded, when appropriate, so that the directionality of the items and subscales were comparable. Item scores within a subscale were averaged to yield the subscale score (range, 0100). Interviewers administered the questionnaire in either English or Spanish, according to participants preference. Spanish speakers were defined as those who completed the questionnaire in Spanish; English speakers were defined as those who completed the questionnaire in English.
Covariates
The covariates included age, gender, income, education, employment status, and number of comorbidities. The number of comorbid medical conditions was computed as a summation of a list of 13 self-reported,10 nonocular-related medical conditions, including diabetes, arthritis, stroke or brain hemorrhage, high blood pressure, angina, heart attack, heart failure, asthma, skin cancer, other cancer, back problems, hearing problems, and other major health problems.
Statistical Analysis
First, descriptive statistics were generated to determine the distribution of both demographic and clinical characteristics. Demographic and clinical characteristics were then compared between Spanish and English speakers. The
2 test was used for discrete variables and Students t-test was used for continuous variables.
Psychometric Analysis
Psychometric performance was assessed through the Multi-trait Analysis Program-Revised (MAP-R), a program designed to evaluate questionnaires that have ordered-response choices.17 First, item-specific missing rates (number of times an item was not answered) for the NEI VFQ-25 subscales were compared between English- and Spanish-speaking participants. Next, the distribution of scores obtained for each scale were examined by calculating the percentage of the sample achieving the lowest possible score of zero (floor) and highest possible score of 100 (ceiling).18 In a heterogeneous sample, subscale scores should have less than 20% of observations at the theoretical floor (floor effect) or ceiling (ceiling effect).19 Even fewer floor effects were expected in this study, because this sample was specifically selected to have no evidence of visual impairment or ocular disease.
Cronbach
(a measure of the extent to which items within a single subscale correlate with the subscale score) was then calculated for each subscale (for the entire sample and for each language subgroup) as a measure of reliability of the subscales internal consistency.20 The acceptable minimum Cronbach
is 0.70.21
Item internal consistency (the degree to which each individual item measures the underlying construct) was measured through Pearson correlation of each item with the subscale to which it was assigned. If the hypothesis that the item measures the underlying construct represented by the subscale to which it is assigned was correct, then the correlation between that item and the subscale would be greater than 0.40. Item discriminant validity (the degree to which different constructs are correlated) was also evaluated with Pearson correlation coefficients for the entire sample and for Spanish- and English-speaking Latinos. Item discriminant validity evaluates the association of an individual item with the subscale that it is part of, compared with other subscales. An item should correlate most strongly with its own subscale when compared with every other subscale in the questionnaire. The acceptable cutoff for item discriminant validity is less than 0.40.17
Comparison of English and Spanish Subscale Scores
Analysis of covariance was used to compare the mean self-reported visual functioning subscale scores between Spanish- and English-speaking Latinos to determine whether there was a systematic difference in responses. The mean NEI VFQ-25 subscale scores were adjusted for the standard covariates (age, gender, income, education, employment status, and number of comorbidities). We then assessed the normality of the residuals in the parametric ANCOVA and determined that after adjusting for covariates, the distribution of the subscales remained nonnormal.
Because the self-reported visual functioning subscales were skewed, nonparametric statistics were used to compare the NEI VFQ-25 subscale scores between Spanish- and English-speaking Latinos to determine any systematic difference. For this analysis, subscale scores were first rank ordered. This rank was used as the dependent variable in a traditional ANCOVA, adjusting for the standard covariates (age, gender, income, education, employment status, and number of comorbidities), to determine whether differences in median scores were statistically significant.22
| Results |
|---|
|
|
|---|
|
|
|
|
|
Subscale Internal Consistency Reliability.
The Cronbach
21 for assessing reliability of the subscales internal consistency ranged from a low of 0.24 (Driving subscale, English speaker), to a high of 0.82 (Vision-Specific Dependency, Spanish speakers; Table 3 ). The acceptable minimum Cronbach
of 0.70 was achieved for only two of the eight subscales for which this could be computed. The Vision-Specific Dependency subscale met the minimum criteria for internal consistency in both the English and the Spanish versions, as well as the complete cohort. Vision-Specific Mental Health met the minimum criteria for the English speakers, whereas Vision-Specific Role Function achieved the minimum criteria for the Spanish speakers and the complete cohort. The subscales internal consistency was marginal (Cronbach
≥ 0.60 and < 0.70) for two subscales in both the English- and Spanish-speaking groups (Ocular Pain and Near Vision). Subscale internal consistency was also marginal for the Vision-Specific Mental Health subscale in Spanish-speaking Latinos and for Vision-Specific Role Function in English-speaking Latinos. In addition, the subscales internal consistency was poor (Cronbach
< 0.60) in three subscales (Driving Difficulties, Vision-Specific Social Function, and Distance Vision) in both the English- and Spanish-speaking groups.
Item Internal Consistency and Discriminant Validity.
The correlation coefficients for item internal consistency and discriminant validity ranged from 0.18 to 0.78 (Tables 4 5) . Seven of 21 (33%) items did not have itemsubscale correlations that correlated more strongly with their own subscales than with every other subscale in the questionnaire. When item validity was assessed, Spanish-speaking Latinos were less likely to achieve minimum internal consistency (62% achieved the minimum of 0.4) compared with English-speaking Latinos (81% achieved the minimum). Similarly, items completed by Spanish speakers were less likely to achieve minimum item discriminant validity (62%) compared with items completed by English-speaking Latinos (76%). Correlations between all items and scales are presented in Table 5 . Two Distance Vision items and both Driving items did not achieve either minimum reliable internal consistency or discriminant validity for both Spanish- and English-speakers. The Vision-Specific Mental Health scale did not achieve minimum criteria for the Spanish-speakers. One item ("How much of the time do you worry about your eyesight?") did not have minimum reliable internal consistency, whereas the other three Mental Health items ("I feel frustrated a lot of the time because of my eyesight"; "I have much less control over what I do because of my eyesight"; "I worry about doing things that will embarrass others or myself because of my eyesight") correlated more strongly with the Vision-Specific Dependency subscale than with the Mental Health subscale. One Vision-Specific Mental Health item ("I worry about doing things that will embarrass others or myself because of my eyesight") was more strongly correlated with Vision-Specific Dependency in the English speakers as well.
Comparison of Subscale Scores in English- and Spanish-Speaking Latinos.
Adjusted NEI VFQ-25 median scores were significantly different between visually normal Spanish- and English-speaking Latinos (Table 6) . Spanish speakers had significantly lower median General Health, General Vision, Vision-Specific Mental Health, and Ocular Pain scores after adjusting for differences in age, gender, income, employment status, education, and the number of comorbidities between the two groups. The magnitude of these differences ranged from 4 to 14.1 percentage points. No differences between median subscale scores were evident between interviewers.
|
| Discussion |
|---|
|
|
|---|
Overall, the psychometric performance of the NEI VFQ-25 in terms of reliable internal consistency in both Spanish- and English-speaking Latinos was found to be acceptable. The Cronbach
was acceptable for two subscales in both English (Vision-Specific Dependency and Vision-Specific Mental Health) and Spanish (Vision-Specific Dependency and Vision-Specific Role Function) at the greater than 0.70 level. The Cronbach
was low for a few scales. In particular, the Driving scale had a very low Cronbach
. Similarly, Broman et al.4 found Cronbach
< 0.70 in the Driving and Ocular Pain Subscales in the entire cohort of the population-based Proyecto VER (Video en Red) study. However, Broman et al. did not provide separate data on internal consistency for the 80% Spanish speakers, compared with the 20% English speakers. Recently, in an effort to improve internal consistency of the Driving subscale, Mangione et al.1 have added an additional item to the Driving subscale in both the English and Spanish versions.
The marginal and poor reliability of internal consistency may suggest that the subscale constructs are not homogenous or that the items are not appropriate measures of these constructs for Latinos.25 It is likely that supplementing other subscales of the NEI VFQ-25 with additional items from the NEI VFQ-51 and eliminating poor items may improve the reliability of the internal consistency of this questionnaire when used in both Spanish- and English-speaking Latinos.26
In our study, psychometric analysis at the item level revealed that 7 of 21 items (number of items in subscales with at least two items) failed to display reliable item internal consistency or item discriminant validity. This trend was more notable for Spanish-speaking than for English-speaking Latinos. Broman et al.4 report that there were differences between Spanish and English speakers in their responses to the NEI VFQ-25. Although they also found poor reliability of item internal consistency, they were able to identify only one item that had poor item discriminant validity (worry about eyesight). A direct comparison of the two studies is difficult, because Broman et al. did not report reliable internal consistency or item discriminant validity at the item level, by language spoken. Some of the possible explanations for our findings include cultural inappropriateness of the instrument, poor translation of the instrument, too many or too few item response choices, and a visually normal sample. As suggested earlier, one suggestion for improving item consistency is to increase the number of items in each subscale. Increasing the number of items in a subscale may improve the item consistency by providing a more accurate representation of the underlying construct that the subscale is measuring. Too few items may not reflect the full breadth of the construct and thus appear to be disparate. Of note, when Broman et al. used an inflation factor to adjust the reliability of item internal consistency for subscales with very few items, the one item that had poor reliability did not achieve acceptable reliability (0.30). The shorter version of the NEI VFQ, the NEI VFQ-25, was developed to decrease the time burden of administering the NEI VFQ without sacrificing any of the essential subscales. Our analysis suggests that this reduced number of test items has poor reliable internal consistency among Latinos.14 17 26 , Other explanations for these findings include problems with translation, respondents inability to understand question stems or response choices, and use of items that are not culturally appropriate. Our findings that these problems are present in both English- and Spanish-speaking Latinos suggest that the issue is less language dependent and more a function of culture or the sample. The finding that the comprehensibility of the Spanish translation and item stems was found to be appropriate by the LALES focus groups27 supports this conclusion. However, both Spanish- and English-speaking respondents in these focus groups reported that the response choices for items in the NEI VFQ-25 were too numerous and confusing. Another possible explanation may be the visually normal sample used for this analysis. When participants with visual impairment and/or abnormal visual fields were included in an analysis the same subscales continued to be below the threshold for minimal reliability.
The differences between our findings and those of Broman et al.4 may be explained by differences in the heritage of the Latinos in the two studies. The heritage of the LALES population is predominantly Mexican, whereas the Proyecto VER population from Arizona is likely to have a higher proportion of Native American heritage.32 Further evaluation of the impact of these issues on the psychometric properties of the NEI VFQ-25 should be conducted.
That there was no ceiling effect in some of the NEI-VFQ subscales in this visually normal sample is surprising. Because the participants in this analysis had no evidence of visual impairment, we would expect more than 20% of the participants to score at the highest possible range on all 12 subscales. Indeed, in the initial NEI VFQ-51 field test, 23% to 95% of the subscale scores of individuals with good visual acuity and visual fields were at the ceiling (all subscales except General Health [12% at ceiling] and General Vision [9% at ceiling]). Broman et al.4 did not quantify ceiling effects by subscale in the Proyecto VER population, although they reported that items with response choices measuring degree of difficulty and time spent on an activity were more likely to exhibit a ceiling effect. In the LALES, scores for visually normal participants did not exhibit a ceiling effect for three of the 12 subscales. In addition, in five subscales, a higher proportion of English-speaking Latinos scores were at the ceiling that of Spanish-speaking Latinos. English speakers were less likely to report any Ocular Pain or Vision-Specific Mental Health problems and were more likely to report problems with Distance Vision and Vision-Specific Role Function. These differences may reflect a difference in the cultural appropriateness and cultural relevance of different items to Spanish- and English-speaking participants. English-speaking Latinos may find certain items more or less meaningful in the context of their visual functioning compared with Spanish speakers. This would be reflected in differences, by language spoken, in which items are more or less frequently rated to reflect impairment or perfect visual functioning.24 The interpretation of this result would be enhanced by comparison of NEI VFQ-25 data in an English-speaking, non-Latino cohort of similar demographic and clinical characteristics. This comparison would clarify whether Latinos with no visual impairment have lower scores, regardless of language spoken, or whether this trend is limited to Spanish-speaking Latinos.
One explanation of the lower health status reported by Latinos is that this perceived poor health status is the result of racial/ethnic discrimination, culture, and other factors.19 Indeed, in our study, Spanish-speaking participants had statistically significantly lower median scores on 4 of the 12 NEI VFQ-25 subscales compared with English-speaking Latinos. This is consistent with the report of lower self-reported general quality of life, as measured by the SF-36, in other Latino cohorts.26 The association between language and attitudes and beliefs toward health and health care has been well documented in the literature.29 30 Few quality-of-life studies have assessed the relationship with ethnicity, although an individuals perception and response to illness is strongly determined by cultural factors.31 The value that different cultures allocate to the constructs measured by quality of life may differ, and the findings from our study may reflect these cultural differences. Because spoken language is a component of most measures of acculturation,29 32 the trend for Spanish speakers to score lower suggests that sociocultural issues play a significant role in determining self-perceived visual function.
One suggested explanation for the less than optimal scale of reliability for internal consistency, item internal consistency, and discriminant validity for some of the NEI-VFQ subscales in this sample may be a large proportion of cognitive impairment that is associated with increasing age. The Cognitive Abilities Screening Instrument (CASI)33 was administered as a cognitive status screen to a subsample of participants in the LALES. The frequency of cognitive impairment was 2.5% (n = 58). This low prevalence of cognitive impairment is unlikely to impact the results of this analysis.
Our findings demonstrate less than optimal performance of the NEI VFQ-25 in this visually normal Latino cohort. Several subscales achieved a less than optimal reliable subscale internal consistency, reliable item internal consistency, and discriminant validity. From the current analysis, it is difficult to determine whether these poor psychometric properties are due to the number and phrasing of the items and item response choices or to the cultural appropriateness of the items in general. Potential solutions to help improve the performance of the NEI VFQ-25 include revision or reduction of items, reduction of response choices, and controlling for other characteristics such as clinical health or mental status. In addition, item response analysis provides more information concerning item difficulty and the performance of individual item response choices.34 These studies are needed to conclude whether the NEI VFQ-25 is a valid measure of self-reported visual functioning for Latinos in population-based studies. The NEI VFQ-25 was designed to capture the impact of visual impairment on visual functioning along the entire spectrum of visual acuity.2 It is as important to understand the performance of this instrument as a measure of visual functioning along this continuum, including individuals without visual impairment.
| Appendix 1 |
|---|
|
|
|---|
| Footnotes |
|---|
Supported by National Eye Institute Grants U10-EY11753 and EY03040 and an unrestricted grant for the Research to Prevent Blindness.
Submitted for publication May 17, 2002; revised August 9 and December 9, 2002; accepted December 30, 2002.
Commercial relationships policy: N.
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
Corresponding author: Rohit Varma, Doheny Eye Institute, 1450 San Pablo St., DEI 4803, Los Angeles, CA 90033; rvarma{at}usc.edu.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. H. Miskala, N. M. Bressler, and C. L. Meinert Relative Contributions of Reduced Vision and General Health to NEI-VFQ Scores in Patients With Neovascular Age-Related Macular Degeneration Arch Ophthalmol, May 1, 2004; 122(5): 758 - 766. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |