|
|
||||||||
1 From the Department of Vision Sciences, Aston University, Birmingham; 2 Business School, Aston University, Birmingham, United Kingdom; and the 3 School of Optometry, University of Waterloo; the 4 Department of Ophthalmology, University of Toronto; and the 5 Glaucoma Research Unit, Toronto Hospital, Ontario, Canada.
| Abstract |
|---|
|
|
|---|
METHODS. The LF components were calculated using a two-factor analysis of variance (ANOVA) with replications and were determined between each pair of three successive HFA program 30-2 fields for each patient from two groups, each containing 30 patients with primary open-angle glaucoma. The interval between examinations for the first group was 6 to 9 months and for the second group was 3 weeks.
RESULTS. The group mean values for LF(Ho) ranged from 1.50 to 2.19 dB and for LF(He) from 1.70 to 2.05 dB. The average difference between examinations was within ±0.35 dB for each component, and the 95% limits of agreement for the two groups, respectively, were ± 2.31 and ± 2.39 dB for the LF(Ho) and ± 2.36 and ± 2.09 dB for the LF(He). The estimate of the 90% confidence limit for the LF(Ho) was 3.30 dB and for the LF(He), 3.60 dB. Little relationship was present between the LF components and the modulus differences in mean deviation (MD), the corrected pattern SD (CPSD), or the mean MD, mean short-term fluctuation, and mean CPSD, of the two fields.
CONCLUSIONS. Estimation of the LF components and of the corresponding confidence limits yields an expression of the normal between-examination variability of two consecutive fields that can be used as a reliability index. A value outside the confidence limits indicates the necessity for a confirmatory follow-up field.
| Introduction |
|---|
|
|
|---|
The outcome of any given single visual field examination can be influenced by many factors including pupil size,1 2 3 refractive error,4 5 6 media opacities,7 8 9 10 learning,11 12 fatigue,13 14 15 16 medical therapy,17 18 and random variations in the physiological status of the patient.19 These factors are such that the variability within and between any two successive examinations can mask or even mimic true deterioration in the visual field. The variation in the threshold estimate within a perimetric examination at a given stimulus location is known as the short-term fluctuation (SF), and the variation between examinations is known as the long-term fluctuation (LF).20 The SF is generally evaluated for a given number of stimulus locations but can also be considered for each individual stimulus location. It essentially represents the measurement error associated with the determination of threshold.19 The LF is the variance additional to that of the SF and is present between two or more examinations. It is divided into two components: the homogeneous component, LF(Ho), which affects all locations equally, and the heterogeneous component LF(He), which varies between locations.21 The two components each represent summary measures of the fluctuation occurring across all the specified locations and, as such, are global measures. Several alternative measures of between-examination variation have been proposed for the field as a whole21 22 23 24 25 or for a given stimulus location.24 26 27 28 29 30 These additional measures, which have also been labeled as the LF, describe the total variation and do not differentiate between the two classically defined components of between-examination fluctuation, LF(Ho) and LF(He).
Knowledge of the LF has been considered a prerequisite for quantitative comparison of visual fields.23 Nevertheless, the LF is not routinely specified, despite the overriding need to separate fluctuation from true progression.20 This omission is perhaps due to the computational complexity of the calculation. Progressive loss is frequently only identified retrospectively from a series of visual field examinations obtained over a period of several years or more, despite suggestions that confirmation of deterioration should be based on repeat examinations performed over a period of weeks.31 Additional confirmatory examinations add to the financial and resource burdens and may delay appropriate therapeutic intervention. Ideally, repeat examinations would be performed in patients only when the additional information would enhance the management of the glaucoma (i.e., only in those in whom the LF lies outside the normal limits encountered in stable glaucoma).
The Humphrey Field Analyser (HFA; Humphrey Systems, Dublin, CA.) has become a standard for automated perimetry. Two algorithms of the system (Full Threshold and Fastpac) undertake a double determination of threshold (Programs 30-2 or 24-2) at each of 10 specified stimulus locations within 21° eccentricity. The SF is calculated from these 10 specified locations, but the corresponding components of LF have never been specified. The two HFA algorithms also undertake additional double determinations of threshold in cases of decreased patient reliability or in cases of irregularity of the field loss between adjacent locations.32 The effect on the magnitude of the LF components when these additional double determinations of sensitivity are included is also unknown.
The overall purpose of the study was to determine, in stable POAG, the magnitude of the LF between two consecutive visual field examinations over a routine (long) follow-up and over a confirmatory (short) follow-up, thereby aiding the identification of progressive loss from fluctuation in sensitivity. The detailed goals were threefold. First, to compare the magnitudes and distributions of the LF(Ho) and LF(He) components between a pair of successive examinations over each of the two types of follow-up. Second, to determine the degree of similarity in the magnitudes of the components derived from the succeeding pair of examinations. Third, to determine the relationship between the magnitude of the LF components and the summary visual field indices.
| Methods |
|---|
|
|
|---|
Long Follow-up
The protocol for the derivation of the LF over the long follow-up
involved a retrospective study. The sample comprised 30 patients with
stable POAG drawn from the Glaucoma Service of the Toronto Western
Hospital and was selected to provide the broadest possible range of
field loss. Three consecutive visual field examinations (HFA Program
30-2; stimulus size III), each separated by 6 to 9 months, were
selected from the randomly assigned eye of each patient. The three
fields were chosen on the basis that they were stable and had been
preceded by at least two earlier examinations (to minimize the effects
of learning) and that the optic nerve head appearance, intraocular
pressures, and therapeutic modality remained unchanged for each patient
during the period in question. All patients had a distance visual
acuity of 6/12 or better during the selected period. Patients with
clinically significant cataract, receiving pilocarpine therapy, or with
ocular history in addition to glaucoma were excluded from the study.
The sample was not controlled for the type of ocular medical therapy
between patients, with the exception of pilocarpine. The mean age of
the sample was 64.8 ± 11.4 (SD) years. The mean deviation (MD) at
the initial presentation ranged from -0.2 to -23.7 dB (mean,
-8.3 ± 7.3 dB [SD]) and the pattern standard deviation in full
(PSD) from 1.3 to 16.5 dB (mean, 6.7 ± 3.4 dB). The group mean
intraocular pressure over the three examinations was 15.0 ± 2.3
mm Hg (SD). The controlled intraocular pressure did not vary by more
than 3 mm Hg for any patient over the study course. The mean interval
between each of the three examinations was 7.9 ± 1.3 months.
The stability of the optic nerve head appearance had been evaluated using stereo observation by a single experienced clinician (GET) in terms of the occurrence of new or increased notching, new or increased saucerization, increased cup-to-disc area ratio, or new hemorrhage. The nerve head evaluation was undertaken with the investigator masked to the outcome of the visual field examinations.
The usual protocol for examination of the visual field had been strictly followed, and all fields were deemed to be reliable in terms of the incidence of false responses to the catch trials. Stability of the visual field over each of the three examinations was confirmed from inspection of the Overview, Change Analysis, and Glaucoma Change Probability Analysis Statpac (Humphrey) printouts by one of the authors, experienced in visual field interpretation (JMW) who was masked to the outcome of the optic nerve head evaluation and statistically in terms of the stability of the visual field indices MD, corrected PSD (CPSD), and SF.33 For the statistical analysis, three separate measures of the MD were evaluated: the weighted MD of all 76 locations, the corresponding unweighted MD, and the unweighted MD derived from the 10 standard stimulus locations at which a double determination of threshold is undertaken. The weighted MD was available from the perimeter printout and is a summary measure of the differences between the measured values of sensitivity and the corresponding expected age-matched normal values. The unweighted indices were calculated because the details of the HFA weighting function have not been published. Two measures of the SF were analyzed: the weighted SF derived from the HFA printout and the unweighted SF. Three measures of the CPSD were analyzed: the weighted CPSD from the HFA printout, the unweighted 76 location CPSD, and the unweighted CPSD derived from the 10 standard locations. The unweighted mean deviation and unweighted corrected pattern SD require the age-matched normal sensitivity at each stimulus location for their calculation; the necessary normal values were obtained from the manufacturer. The values were identical with the normal data used in the software (Statpac; Humphrey).32 A separate repeated measures analysis of covariance (ANCOVA) was undertaken for each of the three indices, MD, SF, and CPSD. Age was considered as a covariate and subtype of each index (i.e., weighted 76 point, unweighted 76 point, or unweighted 10 point) and examination (i.e., first, second, and third) as within-subject factors. The three measures of each index were each stable over the three examinations (MD P = 0.148; SF P = 0.644; CPSD P = 0.475).
The LF(Ho) and LF(He) components were calculated by a two-factor analysis of variance (ANOVA) with replications that uses the sensitivities from the double determination of threshold at each of a given number of stimulus locations that are common to the two examinations.21 22 23 The replications comprised the double determinations of threshold at each of the 10 stimulus locations used by the field analyzer for calculation of the SF. The procedure assesses the difference in the average overall sensitivity between the two examinations, LF(Ho), and whether the difference varies between stimulus locations, LF(He). The two components are separately derived from the root of the corresponding mean square estimate of the ANOVA corrected for the aggregate measurement error across the two examinations.21 22 23 34 The LF(Ho) and LF(He) components for each patient were calculated between the first and second, and between the second and third, fields of the series (see Appendix). The within-subject factors were stimulus location and examination order. Custom software was used to convert and analyze the HFA data files (HFA Tools, 1989; Microsystems Technology, Waterloo, Ontario, Canada).
Although the true values of the LF components can only be positive or
zero, the formulae generally used to estimate these components allow
negative values, and negative values of both components have been
reported.23
A negative value for the variance estimate,
jk2, corresponding to the LF(He) was
obtained in cases in which the magnitude of the mean square estimate
was considerably smaller than that of the variance estimate of the
measurement error (
2; see Appendix).
Similarly, a negative value for the variance estimate
k2 was obtained when the mean square
estimate corresponding to the LF(Ho) was small relative to that of
LF(He). In such cases, the mean square estimates corresponding to the
measurement error and to the particular negative variance estimate were
combined to produce a pooled aggregate error, the magnitude of which
better described the between-examination fluctuation. In all cases, the
components of the LF were expressed in decibels as the square root of
the given variance estimate.
Short Follow-up
The protocol for the derivation of the LF over the short
examination interval involved a prospective study. The sample comprised
30 patients with POAG selected from the database of the Glaucoma
Service of the Toronto Western Hospital. Each patient was individually
matched, as closely as possible, to a patient in the long follow-up on
the basis of the appearance of the field defect in the designated eye
(i.e., the number, severity, and location of the pattern deviation
probability symbols and the magnitudes of the MD and CPSD;
P = 0.79), in terms of intraocular pressure (group mean
15.4 ± 2.21 mm Hg; P = 0.110) and of age (group
mean, 67.8 ± 7.14 years; P = 0.532). The group
mean difference between the long and short follow-ups for the MD was
1.32 ± 1.18 dB and for the CPSD was 2.85 ± 1.58 dB. All
patients were experienced in automated perimetry having had numerous
previous examinations. The exclusion criteria and optic nerve head
evaluation were identical with that for the long follow-up.
Each patient underwent three field threshold tests (HFA Program 30-2; stimulus size III) in the designated eye over a period of 6 weeks with an interval of 3 weeks between each examination. The short follow-up examinations followed a protocol identical with that of the long follow-up and were performed by one experienced perimetrist. All fields were deemed to be reliable in terms of the incidence of false responses to the catch trials.
The assessment of the visual field stability of the three fields over the 6-week period of the short follow-up was performed as described for the long follow-up by one author (JMW), who was masked to the study arm from which the fields were drawn and to all other clinical information. The respective ANCOVAs showed that the three separate indices were each stable over the three examinations (MD P = 0.877; SF P = 0.126; CPSD P = 0.064), including all subtypes. The LF(Ho) and LF(He) components for each patient were then separately calculated between the first and second and the second and third fields of the series, using a procedure identical with that of the long follow-up.
Analysis
Stability of the LF(Ho) and LF(He) components and of the error
term between the three successive examinations of each arm of the study
were determined using a separate repeated measures analysis of
covariance (ANCOVA) for each component. Age was considered as a
covariate, examination order as a within-subject factor, and type of
study as a between-subjects factor. The magnitudes of the LF(Ho) and
LF(He) and of the error term were not significantly different over the
three examinations in either arm of the study: LF(Ho) P = 0.940; LF(He) P = 0.426; error P =
0.652.
| Results |
|---|
|
|
|---|
|
Only 14 of the 30 patients in the long follow-up exhibited additional double determinations of threshold at stimulus locations that were common to the first and second examinations, and only 16 patients had additional double determinations at locations common to the second and third examinations. The corresponding figures for the short follow-up were 29 and 25 patients, respectively. The mean number of additional double determinations of threshold across pairs of examinations was 2.2 in the long follow-up group and 2.9 in the short follow-up group. The LF(Ho) and LF(He) components derived from all available double determinations were, in general, higher in magnitude than the corresponding components derived from the 10 standard double determinations of threshold (Table 2) . The higher group mean for the LF components derived from all available double determinations of threshold is to be expected, inasmuch as a second threshold determination is undertaken at a stimulus location exhibiting either decreased patient reliability and/or irregularity of the field loss between adjacent locations.32 However, this difference did not reach statistical significance for either component (LF[Ho] P = 0.132; LF[He] P = 0.082), regardless of examination pair (LF[Ho] P = 0.671; LF[He] P = 0.357) or length of follow-up (LF[Ho] P = 0.622; LF[He] P = 0.461).
|
| Discussion |
|---|
|
|
|---|
The mean values of the LF(Ho) and LF(He) were larger than those reported previously. The early theory relating to the components of the LF was based on results from Program JO of the Octopus automated perimeter (Interzeag, Schlieren, Switzerland), which determined the threshold twice at each of 49 stimulus locations within 26° eccentricity. With this perimeter, the values for the LF(Ho) and LF(He) were 0.5 and 0.2 dB, respectively, in normal subjects; 0.9 and 0.4 dB in those with suspected glaucoma, and 1.2 and 0.5 dB in those with glaucoma.23 The disparity in the values between the Humphrey and Octopus perimeters is likely to be due in part to the differences in the thresholding algorithms,35 in the number of stimulus locations at which a double determination of sensitivity is undertaken, and in the stimulus parameters of the two instruments.36
The variance estimates were obtained using established methodology21 22 23 that originally had ignored the presence of negative LF components. The treatment of negative values of the variance estimate is a notoriously challenging statistical issue. The procedure described here incorporated an approach for the handling of negative estimates of LF. This difference in methodologies accounts, to some extent, for the disparity between our results and those reported for the Octopus, because the effect of the modified technique is to skew the distribution toward a more positive value. Another approach would have been either to allocate a value of zero to the negative values or to omit such values from the summary distributions. The former was considered inappropriate, because it would have ignored the influence of the measurement error but would have resulted in smaller estimates, whereas the latter would not have been representative of the clinical reality. Alternative statistical techniques for approximating interval estimates have also been described.37 38 The probable effect of the method for dealing with negative estimates of variances used in the study is to overestimate the magnitude of those components of the LF that exhibit a negative variance. The consequence of this overestimation in relation to the identification of progressive visual field loss is the provision of a conservative standard for the definition of excessive long-term fluctuation.
Calculation of the LF for the standard algorithm of the HFA (Program 30-2 or 24-2) is effectively limited to the 10 standard double determinations of threshold situated within 21° eccentricity. The use of 10 stimulus locations merely samples the extent of the fluctuation occurring across the field on the assumption that it is representative of the field as a whole and potentially ignores the influence of the relatively greater variance at the more peripheral locations.29 In any patient, the precision of the estimate of the LF for the field as a whole is affected by the degree of similarity in the position and depth of the field loss for the complete field compared with that manifested at the 10 locations. However, any sampling error derived from using the 10 locations is constant between any two tests of the same program for any patient. An alternative would be to increase the number of stimulus locations at which a double determination of threshold is routinely undertaken. The resultant LF would be more representative of the LF for the field as a whole but would be acquired at the expense of an increased examination time. The longer examination duration would produce a greater fatigue effect,16 particularly in the second eye examined, with the consequent reduction in the accuracy of the estimate of the true threshold. Inclusion of all double determinations of threshold consistent between any two consecutive fields would theoretically seem to offer an opportunity for a more representative estimate of the various LF components. However, these supplementary double determinations are seldom common between the second field of the given pair and the following field and, in any case, did not yield LF components that were statistically different in magnitude than those based only on the 10 double determinations.
The LF(Ho) may appear similar to the difference in the MD between two successive examinations, because both measures represent descriptions of the change in sensitivity occurring across the entire field. The difference in the MD provides a measure of alteration in the overall height of the visual field and represents the difference between two linear averages of measured values distorted from the true average by a weighting function applied to the MD (adjusting for the variability of the threshold at each stimulus location).32 However, the LF(Ho) component derived by ANOVA is an expression of the variability of the measured height of the field based on a linear model involving an interaction term that allows for the inherent error in the measurement of threshold. Similarly, the LF(He) may appear similar to the difference in the CPSD between the two successive fields. However, the difference between the CPSDs provides a measure of alteration in the overall shape of the visual field, whereas the LF(He) component represents the difference in the variability of the measured shape of the visual field. That there is no relationship between the LF(Ho) component and the difference in the MD and also between the LF(He) and the difference in the CPSD emphasizes the distinction between the change in measurement (i.e., the index) and the variability present across the visual field between examinations. The magnitude of the LF components might also be expected to change as a function of the severity of field loss, because of ceiling and floor effects of the measured sensitivity; however, this was not the case. Because the LF cannot be inferred from the magnitude of the visual field indices, the importance of the LF in identifying progressive visual field loss cannot be overstated.
The stability of the two LF components over both the short and long follow-ups, together with there being no relationship with the severity of field loss, suggests that the confidence limits associated with each LF component could be used as a reliability index of the relationship between any two consecutive fields. Such an index would be based on the variability inherent in the stable glaucomatous visual field, rather than on the basis of the variability inherent in the normal field. It can be argued that having confirmed the presence of glaucomatous field loss by reference to the properties of the normal visual field, glaucomatous progression should be evaluated by reference to the properties of the stable glaucomatous visual field. This latter approach has been adopted in the Glaucoma Change Probability Analysis of the HFA Statpac for Windows.39 40
The resultant values for each component of the LF were therefore separately bootstrapped41 for each pair of successive fields to identify the error distribution associated with the 90th percentiles (i.e., the empiric confidence limits). Bootstrapping is a statistical technique of sampling with replacement and increases the maximum information content from a given sample size. A sampling size of 27 was drawn and replaced 200 times for each LF component, and the 90% confidence limits were determined (Table 3) . The construction of a 95th percentile based on a sample of 30 patients was thought to be statistically inappropriate.
|
|
| Appendix 1 |
|---|
|
|
|---|
![]() |
The mean square estimates (MSE) for Vk and
LVjk were then reduced to their
constituent variances to remove the accompanying error variance
Ejkl by the hypothesis34
![]() | (1) |
![]() | (2) |
![]() | (3) |
The LF(He) component (
jk) was calculated
by substituting the mean square error of the error term
2 from equation 3
into equation 2
. The LF(Ho) component
(
k) was calculated in a similar manner by
substituting equation 2
into equation 1
.
Treatment of Negative Components of LF(Ho) and LF(He)
In cases in which either or both of the variance estimates,
k2 and
jk2, corresponding to the LF(Ho)
and LF(He) components, respectively, were found to be negative, the
mean square estimate Vk and
LVjk of either
k2 or
jk2,was combined with the mean
square estimate of Ejkl to give a
corrected error variance,
2. The
resultant root mean square of
2 was
considered to be more representative of the actual aggregate error.
This corrected error variance was achieved by adding the sums of
squares of the original error estimate to the sums of squares of the
source of the negative value
![]() | (4) |
2 is the corrected aggregate
error; SSerror and
dferror are the sums of squares and
degrees of freedom of the error variance, respectively; and
SSx and dfx
are the sums of squares and degrees of freedom of the source of the
negative value (i.e., either LV or V). Because
the estimate of
2 becomes
2, the interaction variance,
,
was recalculated by substitution into equation 2
. Because
2 =
2, then
![]() | (5) |
Therefore,
jk2 was a
more accurate assessment of
jk2
from which to calculate the LF(He). Because the magnitude of the
aggregate error variance masks the statistical effect of the negative
value, the corrected aggregate error variance is the most
representative value of the negative effect. If both
k2 and
jk2 were found to be negative,
then the combined error estimate calculation (equation 4)
included the
sums of squares of both Vk and
LVjk from which they are derived.
| Footnotes |
|---|
Submitted for publication September 14, 1999; revised March 2, 2000; accepted March 22, 2000.
Commercial relationships policy: R (JW); F (JF); N (NH, MH, GT).
Corresponding author: Natalie Hutchings, School of Optometry, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada. nhutchin{at}sciborg.uwaterloo.ca
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. Fogagnolo, A. McNaught, M. Centofanti, L. Rossetti, and N. Orzalesi The Effects of Intraocular Pressure Reduction on Perimetric Variability in Glaucomatous Eyes Invest. Ophthalmol. Vis. Sci., October 1, 2007; 48(10): 4557 - 4563. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. G. Strouthidis, A. Scott, A. C. Viswanathan, D. P. Crabb, and D. F. Garway-Heath Monitoring Glaucomatous Visual Field Progression: The Effect of a Novel Spatial Filter Invest. Ophthalmol. Vis. Sci., January 1, 2007; 48(1): 251 - 257. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. W. Gillespie, D. C. Musch, K. E. Guire, R. P. Mills, P. R. Lichter, N. K. Janz, and P. A. Wren The Collaborative Initial Glaucoma Treatment Study: Baseline Visual Field and Test-Retest Variability Invest. Ophthalmol. Vis. Sci., June 1, 2003; 44(6): 2613 - 2620. [Abstract] [Full Text] [PDF] |
||||
![]() |
S K Gardiner and D P Crabb Frequency of testing for detecting visual field progression Br. J. Ophthalmol., May 1, 2002; 86(5): 560 - 564. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Hutchings, S. L. Hosking, J. M. Wild, and J. G. Flanagan Long-term Fluctuation in Short-Wavelength Automated Perimetry in Glaucoma Suspects and Glaucoma Patients Invest. Ophthalmol. Vis. Sci., September 1, 2001; 42(10): 2332 - 2337. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |