|
|
||||||||
1From the College of Optometry, University of Houston, Houston, Texas; the 2Department of Preventive Medicine, University Hospital and Medical Center, Stony Brook, New York; the 3New England College of Optometry, Boston, Massachusetts; and the 4School of Optometry, University of Alabama at Birmingham, Birmingham, Alabama.
| Abstract |
|---|
|
|
|---|
METHODS. Best corrected monocular visual acuity (VA) of a subset of children (n = 86) enrolled in the Correction of Myopia Evaluation Trial (COMET; mean spherical equivalent refractive error -2.35 D with no more than 1.25 D astigmatism) was measured at baseline and 1 month later with ETDRS logMAR charts. Children started with logMAR 0.4 (6/15 or 20/50) and read each letter on all subsequent lines until they missed all letters in 1 line.
RESULTS. At baseline, the mean best corrected logMAR VA was 0.003 ± 0.076 (6/6 or 20/20 ± 3.8 letters) in the right eye and 0.008 ± 0.059 (6/6 or 20/20 ± 2.95 letters) in the left eye. The signed difference between VA measured at baseline and that measured at 1 month was not significantly different from zero in either eye. Repeatability was not associated with age, but a small, statistically significant association with gender was detected in the left eye, with boys approximately 2 letters more variable than girls. The
statistic (agreement within 1 line) was good to excellent.
CONCLUSIONS. Based on the 95% limits of agreement, the criterion for a statistically significant change in VA is no more than ±0.15 logMAR (or ±8 letters). This value is similar to those reported for adults and indicates that logMAR VA provides a repeatable measure of acuity in children.
Over the past several decades, efforts have been made to reduce the variability and enhance the precision of this fundamental measurement of visual function through improved standardization of acuity charts and testing protocols. Many of these advances were driven by the need for a clinically relevant and reliable outcome measure of visual function to be used in clinical trials. Perhaps the most widely reported and accepted modification has been the ETDRS logMAR chart, which is based on the early work of Bailey and Lovie.1 In addition to improvements in chart design,2 the ETDRS protocol3 standardizes the administration and scoring of VA. With these advances, VA has been used as an outcome measure in clinical trials designed to evaluate a variety of ocular treatments in adults involving the cornea,4 5 6 lens,7 retina,8 9 10 and optic nerve.11 12
Despite these advances, there has been surprisingly little energy devoted to applying these improvements in acuity assessment to children and determining the reliability of this key index of visual function when ETDRS charts and procedures are used. The introduction of the Glasgow acuity cards by McGraw et al.13 14 represents one attempt to apply the ETDRS principles to young, primarily preschool children. Although many of the ETDRS principles were incorporated into the design of the Glasgow acuity cards, the spacing of the letters on the Glasgow cards (Glasgow: 0.5-letter diameter spacing between letters; ETDRS: 1-letter diameter spacing between letters), the use of a crowding bar surrounding the 4-letter line array on the Glasgow cards (rather than 5 letters per line without a crowding bar on the ETDRS), the use of 6 letters with the Glasgow acuity cards rather than the 10 possible letters used with the EDTRS chart, and the single line presentation of the Glasgow cards rather than the entire ETDRS chart may result in differences in these two acuity measures in children. Although comparisons between Bailey-Lovie charts and the Glasgow acuity cards have been made in adults with good agreement between the two (95% confidence limits of ±0.07 logMAR, approximately 3 letters),14 comparisons between ETDRS charts and Glasgow cards in children have been limited by the younger ages of children who are tested by the Glasgow cards. In two other recent comparisons, only a limited number of children were tested15 or Landolt Cs were used instead of Snellen letters (Camparini M, Cassinari P, Ambrosoli D, Porta R, Macaluso C, ARVO Abstract 2068, 2001). Kheterpal et al.15 tested the unaided monocular VA of 18 children (mean age, 4.8 years; range, 3.58 years) referred for failing a vision screening. The same examiner tested these children on two occasions, separated by 1 week, using a logMAR chart and a matching response. Each letter was scored, and the 95% confidence limit in the right eye was ±0.21 logMAR (approximately 11 letters) and in the left eye was ±0.25 (approximately 13 letters). Campraini et al. measured VA in 40 children 3 to 12 years of age with Landolt Cs in an ETDRS-style chart, using both a standard and modified (ETDRS-Fast) ETDRS protocol (Camparini M, Cassinari P, Ambrosoli D, Porta R, Macaluso C, ARVO Abstract 2068, 2001). They found a standard deviation of the testretest difference for the standard protocol to be 5.57 letters. The 95% limits of agreement calculated from the standard deviation they report would be 11 letters. This value is the same as that found in the right eye by Kheterpal et al.15 The 95% limits of agreement improved to 7 letters with the more rapid modified protocol.
In this report, the repeatability of best corrected, monocular, logMAR ETDRS acuity was determined in 86 children. These children constitute a subgroup of the 469 children with myopia (age, 611 years) participating in COMET. The mean signed difference of the two acuity measures is evaluated by age, gender, examiner, and acuity level. Based on the 95% limits of agreement, the criterion for a statistically significant (and clinically meaningful) change in vision in children 6 to 11 years of age is approximately ±0.15 logMAR (±8 letters, or less than 2 lines) on the logMAR ETDRS chart.
| Methods |
|---|
|
|
|---|
To determine whether an abnormal eye alignment was induced by the random lens assignment, the studys Data Safety and Monitoring Committee requested that the first children enrolled in COMET be examined 1 month after enrollment. Children did not undergo refraction testing at the 1 month visit but the full 1-month examination included logMAR VA with the childs habitually worn correction (most often, this was the subjective refraction determined at baseline) and tests of eye alignment. One hundred fifty-two children were examined 1 month after enrollment and were potential subjects for this study of the repeatability of logMAR VA. However, 42 of these children were scheduled for an abbreviated examination that did not include logMAR acuity and were therefore not eligible for this analysis. Twenty-four additional children were excluded because their habitually worn correction differed slightly from the subjective refraction used to measure logMAR acuity at the baseline examination. Thus, the 86 children who had logMAR VA measured both at baseline and 1 month later with identical refractive corrections are the basis of this repeatability evaluation. The characteristics at baseline of these 86 children are shown in Table 1 along with all the 469 children enrolled in COMET. There were no statistically significant differences in the age, gender, refractive error or ethnicity distributions of the 86 children in this study when compared with all COMET children. However, the logMAR VA at baseline of the 86 children reported in this study was significantly different (OD, P = 0.04; OS, P = 0.02) from that of the 469 children enrolled in COMET, with the children in COMET showing slightly poorer VA than those in this study. Although the difference is statistically significant, the mean difference in logMAR VA in the two groups is not clinically meaningful, because it is only 0.018 logMAR in the right eye and 0.02 in the left eye (1 letter or less). Additional details regarding the baseline characteristics of the children enrolled in COMET are reported by Gwiazda et al.17
|
Procedure
Monocular distance VA was measured at baseline with the subjective refraction and 1 month later with the identical refractive correction. The refractive correction was placed in a refractor (model RT1200; Nidek, Gamagori, Japan) with the child appropriately aligned behind the refractor. All examiners were optometrists specifically trained in the modified ETDRS protocol used in COMET. A back-illuminated stand, 4 m from the patient, held the logMAR ETDRS acuity charts (Precision Vision charts 2121 and 2122). Chart lighting ranged from 158 to 279 cd/m2 at the four centers and was in compliance with the recommendations for standardizing the measurement of VA (80320 cd/m2), as summarized by Ferris and Bailey.18 19 Contrast of the letter charts at the four centers averaged 96.9% ± 0.83% with a range of 95.8% to 97.9%. Visual acuity was measured first in the right eye (chart 2121) followed by the left eye (chart 2122), both at baseline and at the second measurement 1 month later. Each of the two charts used ranged from 6/600 (20/200) to 6/3 (20/10) in 0.1 logMAR steps with 5 letters on each line. With this particular chart configuration, each letter was equivalent to 0.02 logMAR. Because all children were free of disease and were tested with their best correction, the ETDRS protocol was modified slightly by having the child begin with the 6/15 (20/50) line and continue until all letters were missed on 1 line, consistent with the recommendations of Carkeet.20 If the child reached the 6/3 (20/10) without missing an entire line, the 6/3 (20/10) line was the last line attempted. If a letter was missed on the 6/15 (20/50) line, children were directed to start with the 6/30 (20/100) line. Children were required to identify each letter and to guess if not sure. The examiner pointed to each new line and to each letter after the child had incorrectly identified a letter. Visual acuity was determined by scoring 0.02 logMAR for each letter correctly identified.
To evaluate the repeatability of the best corrected logMAR VA measured in these 86 children with myopia, the VA measured at baseline was subtracted from the VA measured at 1 month. In each child, the difference in VA obtained at these two visits was plotted against the average VA at baseline and 1 month. The SD of the signed difference between repeated measures was used to calculate the 95% limits of agreement. Results are presented in logMAR and number of letters. The number of letters is provided for a clinical reference and applies to the particular chart configuration used in the study. When the number of letters was rounded, it was to the next higher integer, because the number of letters was the unit of measurement.
Data Analysis
The statistical methods used in the data analysis were based on a hierarchical scheme that comprises data summaries, univariate analyses, and multivariate and modeling analyses. Data summaries were based on estimating distribution parameters such as the mean, median, quartiles, range, and SD for continuous measurements and the frequency and percent for categorical variables.
Where normality assumptions were met21 statistical comparisons for univariate analyses were based primarily on one or two independent sample(s) t-distributions. For example, a one-sample t-distribution was used to determine whether the difference in VA measurements between the baseline and 1-month visit was significantly different from zero. A two-sample t-test was used to detect a statistically significant difference in the measurements between 1-month and baseline, stratified by whether the measurements were performed on both occasions by the same examiner. Other univariate analyses used the Pearson linear correlation method. Nonparametric methods (Wilcoxon rank sum test) were used in cases in which the normality assumption was not met.
Multivariate and modeling analyses were based on linear multiple regression techniques to determine the magnitude and the significance of the effect of several covariates (e.g., age, same or different examiner, and gender) on repeatability of VA measurements. To evaluate whether repeatability varied by subgroups, two-factor interaction terms were also included in the multiple linear regression (i.e., different combinations of ages, gender, and same or different examiners). Except for situations involving multiple comparisons, the significance levels for testing a statistically significant difference were preset at
= 0.05. The Dunnett/Bonferroni adjustments were applied when multiple comparisons were made.
In addition to continuous data analysis, categorical
22 was also used for measuring agreement in VA measurements between baseline and 1 month. A
greater than 0.75 indicates excellent agreement, and a
between 0.40 and 0.75 suggests the agreement is fair to good.23
| Results |
|---|
|
|
|---|
|
To examine the effects of age, gender, and examiner on the repeatability of VA, a multivariate analysis was performed. Age was investigated as a categorical variable (age 69 years versus 1011 years, inclusive, based on the median age). Looking at the signed difference in VA (1-month minus baseline), there was no significant association with any of the three variables in the right eye. In the left eye, gender was significant (P = 0.032). VA in boys was more variable than girls by 0.037 logMAR (less than 2 letters). The effect of examiner approached significance in the left eye (P = 0.07) with the same examiners showing less variability in test results by 0.03 logMAR (less than 2 letters). The age of the child was not associated with the difference in VA on repeated measures in either eye.
Another approach to evaluation of the reproducibility of VA measured at two different times is to determine the level of agreement with the
statistic. Table 2 displays the
statistic and the 95% confidence limits for agreement between the baseline and 1-month visits to within 1 line of acuity, in each eye of all subjects and by gender and age. In both eyes, agreement to within 1 line is excellent in all subjects, in girls, and in children less than 10 years of age. Agreement in the right eye is good in those older than 10 years (0.71) and in boys (0.68), whereas the left eye shows excellent agreement in these two categories.
|
| Discussion |
|---|
|
|
|---|
|
Several potential sources of the variability indexed by the 95% limits of agreement were analyzed further. There was no statistical difference in the signed difference when the same examiner performed both measures of acuity compared with when different examiners took the two measures. In addition, when the effect of the examiner on the signed difference was explored with a multivariate analysis, the examiner was not associated with the variability in either eye. This result is important in the context of this and other clinical trials involving children, in which multiple examiners are common, and suggests that the standard protocol removes this potential source of bias. It should be noted that the power to detect such a small difference in repeatability with the same examiner compared with different examiners is compromised by the small number of children (59 tested with the same examiner and 27 with a different examiner). However, the power to detect a more clinically meaningful difference of 3 letters when the same examiner obtains both measures of acuity compared with different examiners obtaining the two estimates of acuity is 93% in this sample of children.
In addition to the examiner, the age and gender of the subjects are other potential sources of variability. Given the limited age range of the subjects in this study (611 years), it is not surprising that age was not associated with the sign difference in either the right or left eye. However, caution is advised when generalizing to children younger than age 6 who have more difficulty with chart formats and are frequently tested with isolated letters.27 A significant association was found for gender. The signed difference in boys showed approximately 2 letters greater variability than that in girls, but only in the left eye. The charts used to measure the acuity differed between the eyes. In addition, VA was always measured in the left eye after that in the right eye. Perhaps the greater variability demonstrated by boys in the left eye is related to the fact that boys tend to fidget more than girls.28 29 Fidgeting has been inversely associated with cognitive ability test scores, and these test scores have been directly associated with concentration in a sample of children (mean age, 9.4 years).29 However, this potential explanation is only partially consistent with the analysis of agreement (
statistic). Although the agreement was good23 in the right eye of males (
= 0.68), it was the poorest level of agreement of the repeated measures of acuity.
With the exception of the good agreement also found in the right eye in children 10 years of age or older (
= 0.71), all other
statistics were excellent (
> 0.75). It is important to recall that in calculating the
statistic, we used 1 line or 5 letters as the level of agreement to compare the VA measured at baseline with that measured 1 month later. This level of agreement was selected to provide the clinician with an indication of the expected repeatability based on a line of letters rather than individual letters and the agreement is, in general, excellent.
The criterion for a statistically significant change reported for this subset of COMET children is similar to the data summarized in Table 4 from different populations with a variety of conditions. However, caution must be exercised when generalizing to other populations a criterion derived from children with myopia tested with their best correction. This caution is still warranted, despite the fact that a multivariate analysis that controlled for age and gender indicated that baseline myopia did not influence the magnitude of the difference between the two VA measures (OD P = 0.53; OS: P = 0.11). Children with hyperopia, children with larger amounts of astigmatism, or children tested without correction may not show the same degree of repeatability because of additional sources of error from variable accommodation, spectacle lens aberrations, or blur interpretation. Several investigators have also suggested that estimates of error derived from clinically healthy individuals may not be the same as those for individuals with anomalies.30 31 Therefore, in cases of amblyopia, in which it is important to identify a significant change in acuity in response to treatment, alternative approaches may be needed, particularly if magnitude of difference scores are correlated with raw scores. One approach would be to derive separate estimates of reliability of different subgroups.30 Another approach would be to determine each individuals criterion through repeated testing of that individual.32 33 34 Such repeated testing would improve the variability by the square root of the number of estimates.
|
| Appendix 1 |
|---|
|
|
|---|
COORDINATING CENTER: Department of Preventive Medicine, University Hospital and Medical Center, Stony Brook, New York: Leslie Hyman (Principal Investigator), M. Cristina Leske (Co-principal Investigator), Mohamed Hussein (Co-Investigator/Biostatistician), Elinor Schoenfeld (Epidemiologist), Lynette Dias (Study Coordinator), Rachel Harrison (Study Coordinator), Elissa Schnall (Assistant Study Coordinator), Allison Schmertz (Project Assistant), Wen Zhu (Lead Programmer), Ahmed Yassin (Analysis), Ying Wang (Analysis), Lauretta Passanant (Project Assistant), and Phyllis Neuschwender (Administrative Assistant).
CLINICAL CENTERS:
New England College of Optometry, Boston, Massachusetts: Daniel Kurtz (Principal Investigator), Bruce Moore (Optometrist), Robert Owens (Primary Optician), Sheila Martin (Clinic Coordinator), and Stacey Hamlett (Backup Optician). Pennsylvania College of Optometry, Philadelphia, Pennsylvania: Mitchell Scheiman (Principal Investigator), Kathleen Zinzer (Optometrist), Timothy Lancaster (Primary Optician), Theresa Elliot (Backup Optician), and Mariel Torres (Clinic Coordinator). University of Alabama at Birmingham Medical Center, School of Optometry, Birmingham, Alabama: Wendy Marsh-Tootle (Principal Investigator), Bradley S. Bessant (Optometrist), James Raley (Optician), Angela Rawden (Backup Optician), Nicholas Harris (Clinic Coordinator), Cheryl Jackson (Clinic Coordinator), and Trana Mars (Backup Clinic Coordinator). University of Houston College of Optometry, Houston, Texas: Ruth E. Manny (Principal Investigator), Connie Crossnoe (Optometrist), Sheila Deatherage (Optician), Charles Dudonis (Optician), and Sally Henry (Clinic Coordinator).
NATIONAL EYE INSTITUTE, Bethesda, Maryland:
Donald Everett (Project Director, Collaborative Trials Branch).
COMMITTEES:
Data Safety and Monitoring: Robert J. Hardy (Chair), Argye Hillis, Don Mutti, Richard Stone, and Sr. Carol Taylor. Executive Committee: Jane Gwiazda (Chair), Leslie Hyman, Wendy Marsh-Tootle Donald Everett. Steering Committee: Jane Gwiazda (Chair), Mohamed Hussein, Leslie Hyman, Daniel Kurtz, M.Christina Leske, Ruth E. Manny, Wendy Mash-Tootle, Mitchell Scheiman, Donald Everett, Thomas T. Norton.
| Footnotes |
|---|
Presented in part at the American Academy of Optometry, Orlando, Florida, December 2000.
Supported by National Eye Institute Grants EY11740, EY11805, EY11756, EY11754, EY11755, and EY11752, Bethesda, Maryland; and by Essilor of America, Marchon Eyewear, Marco Technologies, and Welch Allyn.
Submitted for publication November 22, 2002; revised February 20, 2003; accepted March 10, 2003.
Disclosure: R.E. Manny, None; M. Hussein, None; J. Gwiazda, None; W. Marsh-Tootle, None
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
Corresponding author: Ruth E. Manny, 505 J. Davis Armistead Building, Houston, TX 77204-2020; rmanny{at}uh.edu.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
L. Ore, H. J Garzozi, A. Tamir, N. Stein, and M. Cohen-Dar Performance measures of the illiterate E-chart vision-screening test used in Northern District Israeli school children J Med Screen, June 1, 2008; 15(2): 65 - 71. [Abstract] [Full Text] [PDF] |
||||
![]() |
M Bach, J P Maurer, and M E Wolf Visual evoked potential-based acuity assessment in normal vision, artificially degraded vision, and in patients Br. J. Ophthalmol., March 1, 2008; 92(3): 396 - 403. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Bassan, C. Limperopoulos, K. Visconti, D. L. Mayer, H. A. Feldman, L. Avery, C. B. Benson, J. Stewart, S. A. Ringer, J. S. Soul, et al. Neurodevelopmental Outcome in Survivors of Periventricular Hemorrhagic Infarction Pediatrics, October 1, 2007; 120(4): 785 - 792. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. I. Chen, A. Chandna, A. M. Norcia, M. Pettet, and D. Stone The Repeatability of Best Corrected Acuity in Normal and Amblyopic Children 4 to 12 Years of Age Invest. Ophthalmol. Vis. Sci., February 1, 2006; 47(2): 614 - 619. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Virgili, C. Cordaro, A. Bigoni, S. Crovato, P. Cecchini, and U. Menchini Reading Acuity in Children: Evaluation and Reliability Using MNREAD Charts Invest. Ophthalmol. Vis. Sci., September 1, 2004; 45(9): 3349 - 3354. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Harrad, C. Williams, and J. Sparrow Treatment of unilateral visual impairment on preschool vision screening: Mild amblyopia should still be treated BMJ, February 7, 2004; 328(7435): 348 - 348. [Full Text] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |