|
|
||||||||
1From the Hamilton Glaucoma Center, Department of Ophthalmology, University of California San Diego, La Jolla, California; and the 2Department of Ophthalmology, University of São Paulo, São Paulo, Brazil.
| Abstract |
|---|
|
|
|---|
METHODS. The study included 223 patients with suspected glaucoma during an average follow-up of 63.3 months. Included subjects had a suspect optic disc appearance and/or elevated intraocular pressure, but normal visual fields. Conversion was defined as development of either repeatable abnormal visual fields or glaucomatous deterioration in the appearance of the optic disc during the study period. The association between baseline GPS and conversion was investigated by Cox regression models.
RESULTS. Fifty-four (24.2%) eyes converted. In multivariate models, both higher values of GPS global and subjective stereophotograph assessment (larger cup–disc ratio and glaucomatous grading) were predictive of conversion: adjusted hazard ratios (95% CI): 1.31 (1.15–1.50) per 0.1 higher global GPS, 1.34 (1.12–1.62) per 0.1 higher CDR, and 2.34 (1.22–4.47) for abnormal grading, respectively. No significant differences (P > 0.05 for all comparisons) were found between the c-index values (equivalent to area under ROC curve) for the multivariate models (0.732, 0.705, and 0.699, respectively).
CONCLUSIONS. GPS values were predictive of conversion in our population of patients with suspected glaucoma. Further, they performed as well as subjective assessment of the optic disc. These results suggest that GPS could potentially replace stereophotograph as a tool for estimating the likelihood of conversion to glaucoma.
Recently, several imaging technologies have become available for structural evaluation of the optic disc and RNFL and to assist in the diagnosis of glaucoma. One of these technologies, confocal scanning laser ophthalmoscopy (CSLO) provides objective measurements of optic disc topography that have been shown to predict progression in those with suspected glaucoma.6 7 As part of the ocular hypertension treatment study, Zangwill et al.6 showed that CSLO stereometric parameters and the Moorfields regression analysis (MRA) had a significant predictive value in discriminating those ocular hypertensive patients who converted to glaucoma from those who did not.
Previous CSLO measures used for evaluation of optic disc topography have been limited by the need for an examiner to approximate the optic disc margin with a contour line to calculate stereometric parameters and the MRA. This requirement added an undesirable subjectivity to the examination and may have resulted in significant differences in the topographic parameter values obtained by different examiners.8 Also, accurate tracing of the contour line optimally requires simultaneous visualization of optic disc photographs,8 which obviously decreases the value of CSLO as a replacement method for stereophotographs in the assessment and documentation of the optic disc. To overcome this limitation, a new version of CSLO (HRT-3; Heidelberg Engineering GmbH, Dossenheim, Germany) includes the Glaucoma Probability Score (GPS), an index that is independent of the contour line traced by the examiner. The GPS is based on a three-dimensional model of the entire topographical image and has been shown to discriminate glaucomatous from normal subjects in cross-sectional studies.9 10
The purpose of the present study was to evaluate the ability of the contour-line–independent parameter GPS to predict the development of visual field loss or optic disc deterioration in persons with suspected glaucoma. Further, the predictive ability of this parameter was compared to that of subjective stereophotograph assessment.
| Methods |
|---|
|
|
|---|
Subjects were defined as having suspected glaucoma according to the clinical examination by two glaucoma specialists (FAM, RNW). Included were those with suspect optic disc appearance (as determined by subjective assessment) and/or elevated intraocular pressure (>21 mm Hg). All subjects had normal and reliable standard automated perimetry (SAP) visual fields at baseline, as defined later in the paper. Eligible subjects were required to have had a visual field examination and optic disc stereophotograph taken close in time to a baseline HRT scan used for evaluation. Baseline was set at the first occurrence of this matching, and the HRT date was used as the baseline date. The average time interval between examinations was 1.4 months (median: 0.6 months, first quartile: 0.2 months, third quartile: 1.7 months). Only subjects with open angles on gonioscopy were included. Subjects were excluded if they presented best-corrected visual acuity less than 20/40, spherical refraction outside ± 5.0 D and/or cylinder correction outside 3.0 D, or any other ocular or systemic disease that could affect the optic nerve or the visual field. One eye of each patient was randomly selected for analysis.
For each eye, central corneal thickness (CCT) was calculated as the average of three measurements obtained during the same visit using an ultrasound pachymeter (Pachette GDH 500; DGH Technology, Inc, Philadelphia, PA).
Standard Automated Perimetry
Only patients with normal and reliable visual fields on the baseline were included. Standard automated perimetry (SAP) visual fields were obtained using either 24-2 Full Threshold or Swedish Interactive Thresholding Algorithm (SITA) (Humphrey Field Analyzer; Carl Zeiss Meditec, Dublin, CA) strategies. Only tests with reliable results (
33% fixation losses, false positives, and false negatives) were included.
Glaucomatous conversion by visual field was defined as the development of three consecutive abnormal examinations during follow-up, or two consecutive when these were the last examination results available during follow-up. An abnormal result followed by a normal one was not considered conversion. An abnormal visual field was defined as a pattern standard deviation (PSD) with P < 0.05 and/or a glaucoma hemifield test (GHT; Humphrey Perimeter; Carl Zeiss Meditec, Inc., Oberkochen, Germany) with results outside normal limits. Two experienced glaucoma specialists verified that the visual field defects were consistent with glaucoma.
Stereophotograph Grading
Simultaneous stereoscopic optic disc photographs (TRC-SS; Topcon Instrument Corp of America, Paramus, NJ) were reviewed with a stereoscopic viewer (Pentax Stereo Viewer II; Asahi Optical Co., Tokyo, Japan). Baseline stereophotographs were evaluated by two masked, experienced graders and classified as glaucomatous or normal. Glaucomatous optic disc appearance was defined based on the presence of neuroretinal rim thinning, excavation, notching, or characteristic retinal nerve fiber layer defects. Vertical cup-to-disc ratio (CDR) was assessed by visually estimating the CDR based on the contour of the cup. The average value between examiners was calculated and used for analysis. For progression assessment, each patients most recent stereophotograph was compared with the baseline one. Each grader was masked to the temporal sequence of the photographs. Definition of change was based on focal or diffuse thinning of the neuroretinal rim, increased excavation, and the appearance or enlargement of RNFL defects.
Discrepancies between the two graders were resolved either by consensus or by adjudication of a third experienced grader. Only photographs with adequate quality were included. From an initial group of 310 patients who fulfilled inclusion criteria, 5 (2%) subjects had poor-quality photographs at baseline, and 29 (9%) did not have follow-up stereophotographs to assess progression and were excluded from further analysis.
Confocal Scanning Laser Ophthalmoscopy
CSLO images were acquired using either the HRT-I or –II (Heidelberg Engineering, GmbH) and analyzed on each respective machine, using HRT-3 software. Further details on these instruments have been described previously.11 12 13 Only 15° images were used. For each patient, three topographical images were obtained and then combined and automatically aligned to make a single mean topography used for analysis. Magnification errors were corrected using patients corneal curvature measurements. Good-quality images required a focused reflectance image with a standard deviation not greater than 50 µm and centered GPS analysis. From an initial group of 310 patients who fulfilled inclusion criteria, 15 (5%) were excluded because the 15° HRT baseline image could not be retrieved, 26 (8%) were excluded after quality control of the HRT mean image, 4 (1%) were excluded because the HRT was not able to run the GPS analysis, and 11 (4%) were excluded as a result of highly off centered analysis of the GPS algorithm.
The GPS is obtained using a new automated analysis independent of either contour line tracing or a reference plane. The software analyzes the optic disc and parapapillary retina topography and builds a three-dimensional (3-D) model using five shape-based measures: cup size, cup depth, and rim steepness (referring to the optic disc) and vertical (superior to inferior) and horizontal (nasal to temporal) parapapillary nerve fiber layer curvatures. The values of the parameters are then fed into a machine-learning classifier analysis, a relevance vector machine (RVM), which compares the patients results to previously defined healthy and glaucomatous models. Glaucomatous eyes usually present with flatter RNFL curvature and increased cup size, depth, and slope (rim steepness). The final GPS is the probability that the model has structural differences from the normal model that are compatible with glaucomatous damage. The higher the GPS, the more similar it is to the glaucoma model.
GPS results were obtained for the global region, as well as for six predefined sectors (with 0° as temporal): temporal superior (45°–90°), nasal superior (91°–135°), nasal (136°–225°), nasal inferior (226°–270°), temporal inferior (271°–315°), and temporal (316°–44°).
Follow-up and Definition of Study Endpoints
Conversion to glaucoma in this study was defined by either visual field test results or optic disc stereophotograph evaluation, whichever came first. Eyes in which a confirmed visual field defect developed or optic disc deterioration on stereophotographs were referred to as converters.
For converters, follow-up time was defined as the time between the HRT baseline visit and the date of the first abnormal visual field result or the first optic disc stereophotograph showing deterioration (the study endpoint). For nonconverters in both visual field and optic disc evaluation, follow-up time was defined as the time between the HRT baseline visit and date of last available visual field test result or stereophotograph on DIGS, whichever came first. During follow-up time, each patient was treated at the discretion of the attending ophthalmologist.
Statistical Analysis
The primary purpose of the study was to determine whether the baseline GPS is predictive of progression. Other variables analyzed as potential risk factors were age, baseline IOP, CCT, and the baseline SAP visual field index PSD. Hazard ratios (HRs) for the association between GPS parameters and the development of a documented progression were obtained by Cox proportional hazards models. We report HRs from univariate models, which do not adjust for the presence of other factors, as well as adjusted HRs from multivariate Cox proportional hazards models. For the multivariate models, we report hazard ratios after adjustment for age, baseline IOP, CCT, and SAP PSD. These variables have been reported to be significantly associated with the risk of development of glaucomatous visual field loss or optic disc deterioration among patients with ocular hypertension or suspected glaucoma.14 15 16
We also evaluated the ability of subjective stereophotograph evaluation (grading and vertical cup/disc ratio) in predicting the development of progression. Univariate hazard ratios were reported for stereophotograph grading (glaucoma versus normal) as well as for vertical CDR. Adjusted HRs were also reported for these variables after adjustment for age, baseline IOP, CCT, and SAP PSD.
As the magnitude of a hazard ratio for a particular variable depends on its unit of measurement, a direct comparison of HRs would be an inappropriate way of comparing the predictive abilities of GPS and stereophotograph assessment. For this purpose, we used the c-index, as suggested by Harrell.17 The c-index is similar to the area under the receiver operating characteristic (ROC) curve and is frequently used to evaluate the discriminating ability of predictive models in survival data. It is calculated as the proportion of all usable subject pairs in which the predictions and outcomes are concordant. If the predicted survival time is larger for the subject who actually survived longer, the predictions of the pair are concordant with the outcomes. In predicting the time to an event, c is calculated by including all possible pairs of subjects, at least one of whom has experienced the event (viz., progression). Two subjects survival times cannot be ordered if both subjects are censored or if one has failed and the follow-up time of the other is less than the failure time of the first.18 A c-index of 0.5 indicates random predictions, whereas 1.0 indicates perfect prediction. The c-index was calculated for multivariate models, including GPS, and adjusting for age, baseline IOP, CCT, and SAP PSD, as well as for multivariate models including stereophotograph parameters and adjusting for the same variables. Therefore, each multivariate model contained the combination of an optic disc parameter (objective versus subjective) plus other variables previously identified as significantly associated with the risk of the development of glaucoma. To test for the significance of the difference in discrimination between two models, we used the rcorrp.cens function from Harrells Hmisc/Design library.17 This computes U statistics for testing whether the predictions of one model are more concordant with actual observations than those of another model.
To adjust for potentially confounding effects of treatment, these analyses were also performed incorporating treatment as a time-dependent covariate.
Statistical analyses were performed with commercial software packages (SPSS, ver. 15.0; SPSS, Chicago, IL; Stata, ver. 9.0; StataCorp, College Station, TX; and S-PLUS ver. 6.0; Insightful Corp., Seattle, WA). The
level (type I error) was set at 0.05.
| Results |
|---|
|
|
|---|
|
|
|
|
|
|
The c-index was used to evaluate the predictive abilities of the multivariate models (Table 2) . Models containing global or sectoral GPS results had very similar c-index values. The c-index for the multivariate model containing GPS global was 0.732. For the multivariate model containing stereophotograph grading (glaucoma versus normal), the c-index was 0.699, whereas it was 0.705 for the multivariate model containing vertical cup–disc ratio. There were no statistically significant differences in the predictive abilities of multivariate models containing GPS global results, stereophotograph CDR, or stereophotograph grading (P > 0.05 for all comparisons). Similar results were found when analyses were repeated using treatment as a time-dependent covariate.
Spatial agreement between visual field loss and GPS abnormalities was evaluated for those who developed a repeatable abnormal visual field. Twenty-four subjects presented an inferior visual field defect at conversion. Of these, there were 6 with an outside normal limits classification for GPS global analysis and 6 for temporal, 5 for nasal, 10 for inferior (inferior temporal and inferior nasal), and 10 for superior (superior temporal and superior nasal) visual field defects. Thirty-two subjects presented a superior visual field defect at conversion (10 had loss in both hemispheres). Of these, there were 8 with an outside normal limits classification for GPS global analysis and 9 for temporal, 9 for nasal, 14 for inferior (inferior temporal and inferior nasal), and 12 for superior (superior temporal and superior nasal) visual field defects. In several cases, there was more than one sector outside normal limits, and there was also an overlap of subjects with superior and inferior visual field defects.
Table 4 shows the HRs for the association of each variable and progression as assessed by stereophotograph evaluation and by visual fields separately. When stereophotographs were used as the only endpoint, only CCT and stereophotograph assessment at baseline were significantly associated with the risk of progression. It is important to note, however, that the low number of patients who converted by stereophotograph during follow-up did not allow precise estimation of HRs in this analysis, and the CIs were considerably larger than in the complete analysis. When only consecutive abnormal visual fields were considered as the endpoint, results were similar to those of the complete analysis.
|
|
| Discussion |
|---|
|
|
|---|
After adjustment for other potential risk factors, each 0.1 increase in the GPS global score was associated with a 31% higher risk of reaching the study endpoint. No significant improvement in predictive ability was noted when GPS results in the six predefined sectors were considered. When manufacturers suggested cutoffs were used, an outside normal limits result on the GPS final classification was associated with an approximately fivefold higher risk of glaucoma, compared to a within normal limits result. For example, at 5 years of follow-up, the probability of conversion was 11% for those with within normal limits baseline GPS final classification, compared with 25% for those with borderline GPS and 45% for those with results outside normal limits.
Several investigators have previously evaluated the ability of HRT parameters to predict development of visual field defects or optic disc deterioration in patients with suspected glaucoma. Bowd et al.7 demonstrated that HRT II MRA results and machine learning classifiers developed using stereometric parameters were able to detect abnormalities in optic disc topography before the development of visual field loss. Zangwill et al.6 showed that several baseline HRT topographic measurements, alone or in combination with baseline clinical and demographic factors, were significantly associated with the development of glaucoma among participants with ocular hypertension in the Ocular Hypertension Study (OHTS), suggesting that this instrument could be a useful predictive tool in this population. All HRT parameters included in these studies, however, required tracing of a contour line outlining the optic disc margin for their calculation. Although the HRT has been developed to provide objective measurements of optic disc topography, the contour line requirement still imposes limitations on the objectivity and accuracy of this test. In fact, improper contour line drawing is recognized as a frequent source of error and misinterpretation of HRT exams.8 With the development of the GPS analysis, these limitations have been largely overcome. GPS calculations are independent of contour-line tracing and do not rely on a reference plane. Therefore, use of the GPS parameter removes the subjectivity that would be added by relying on contour line tracing performed by an examiner. To our knowledge, the present study is the first to demonstrate that the GPS parameters are also significantly predictive of conversion in patients with suspected glaucoma.
We also compared the performance of the GPS to that of subjective expert stereophotograph assessment for predicting which patients would have glaucoma develop during follow-up. Both methods had similar ability to discriminate converters from nonconverters, as assessed by the c-index. For the multivariate model containing GPS global, age, baseline IOP, CCT, and SAP PSD, the c-index was 0.732. A c-index of 0.73 indicates that, in approximately 73% of the cases, the model allocated a higher predicted probability for a subject who actually converted than for a subject who did not. For multivariate models containing stereophotograph grading and vertical cup–disc ratio, corresponding values were 0.699 and 0.705. It is interesting to note that the c-indexes for the multivariate models reported in our study are similar to those reported in other studies conducted to evaluate predictive models to estimate risk of glaucoma. Medeiros et al.19 found c-indexes from 0.68 to 0.73 when predictive models derived from the OHTS were applied to estimate risk of conversion to glaucoma in a group of 126 ocular hypertensive patients followed for approximately 8 years. Similarly, a predictive model developed from combined results of the OHTS and the European Glaucoma Prevention Study (EGPS) had a c-index of 0.74 for discriminating ocular hypertensive subjects who converted to glaucoma from those who did not.20
Several cross-sectional studies have previously compared the performances of objective structural assessment by imaging instruments and subjective stereophotograph evaluation. Girkin et al.21 and DeLeón-Ortega et al.22 showed that subjective stereophotograph assessment outperformed HRT parameters in discriminating patients with glaucomatous visual field loss from normal subjects. Both studies, however, are limited by their cross-sectional design and lack of a reference standard that was completely independent of the tests being evaluated. Although visual fields were reported to be the main test used to classify patients in both studies, it is very likely that clinical examination of the optic disc was used at some point to classify participants, due to the cross-sectional design and glaucoma clinic-based samples in the studies. The appearance of the optic disc on clinical examination is more likely to be related to its appearance on stereophotographs than to results of imaging instruments, and this tendency could introduce a bias in favor of stereophotograph assessment. Also, although diagnostic accuracy measures obtained from the studies just mentioned are useful in providing an initial evaluation of the ability of these tests to detect glaucomatous damage, it is clear that in clinical practice a clinician does not need an imaging test just to help differentiate a patient with repeatable glaucomatous visual field loss from a healthy subject without suspect findings. In fact, clinicians are most interested in the ability of the test to diagnose or predict damage in patients with suspected disease who do not show any clear abnormality such as repeatable visual field loss.23 The longitudinal design used in our study largely overcomes these limitations when imaging instruments and stereophotographs are compared.
Our study has limitations. Patients were not randomized for treatment or no treatment, and the decision as to whether to initiate treatment might have been based on disc assessment results and other risk factors. It is possible that patients with more risk factors or a more suspect appearance of the optic disc were the ones who received treatment during follow-up. It might be argued that these patients would be less likely to progress—a situation that would underestimate the true predictive value of baseline optic disc evaluation for conversion to glaucoma. However, this probably would affect the predictive ability of both methods of optic disc assessment evaluated in our study. Further, when treatment was included as a time-dependent covariate, no significant differences were seen on the results. It must be emphasized that stereophotographs in our study were obtained by specialized personnel and evaluated by highly trained individuals from a reading center, blinded to chronological order. Such expertise is unlikely to be available to most eye care providers, and thus the performance of subjective assessment of optic disc morphology was probably overstated.
It is known that, in a considerable number of patients, damage to the optic nerve head occurs despite any evidence of functional loss.1 2 Therefore, structural and functional assessments were used to define conversion in our study. When we performed separate analyses using stereophotographs or visual fields as endpoints, significant differences were observed on the predictive abilities of the baseline variables. It is noteworthy that baseline stereophotograph assessment seemed to perform better than GPS when stereophotographs were used as the only endpoint. This, however, may reflect a bias of using the same method as the predictive variable and for determination of the endpoint, which likely resulted in overestimation of the predictive ability of stereophotograph assessment. Also, the relatively small number of patients in whom conversion was identified by stereophotograph resulted in large CIs and less precise estimates of the predictive abilities of the baseline variables. Therefore, it is likely that a more fair comparison of the predictive abilities of GPS and stereophotographs was obtained in the analysis that used visual fields as the endpoint measure.
It also might be argued that some patients included in our study already had glaucomatous optic neuropathy (GON) at baseline and therefore would not truly have suspected disease. However, the primary purpose of the study was to assess the predictive performance of an objective method of structural assessment, the HRT GPS parameter, without relying on subjective assessment of optic disc morphology. To accomplish this, it was necessary to define the inclusion criteria based solely on visual fields. Although the inclusion of subjects with GON at baseline could overestimate the predictive ability of GPS, the design of our study replicates the clinical situation in which this technology is used to replace subjective assessment of the optic disc for evaluation of the risk of development of glaucoma. It is also important to emphasize that, although an outside normal limits result on the GPS was associated with increased likelihood of progression to visual field loss or optic disc damage, most of the subjects with GPS outside normal limits did not reach the endpoint during the study period. For example, for final GPS classification, 41 (18%) of all patients had an outside normal limits result at baseline. Of these, 17 (42%) had conversion to glaucoma and 24 (58%) did not. Further follow-up will be necessary to ascertain whether these patients will have visual field damage in the future or whether these were false-positive results.
Although stereophotograph assessment and GPS were found to have similar predictive abilities in our study, their agreement was only fair. As these two methods use different techniques to measure different aspects of the optic disc and RNFL, the low agreement is not surprising. Stereophotograph evaluation relies on subjective assessment by graders, which takes into account several features of the optic nerve that would not be evaluated by the GPS, such as presence of hemorrhages, parapapillary atrophy, and localized RNFL defects. On the other hand, GPS analysis is an objective method, and its classification relies on cutoffs to achieve predetermined specificity levels according to a comparison to a normative database. Further, the GPS was originally designed as a global parameter, and modifications to evaluate sectoral optic nerve damage included division of the optic nerve head by sectors. However, only two of the original five parameters are provided in sectors—rim steepness and cup size—whereas cup depth and vertical and horizontal nerve fiber layer curvatures are only given as global measurements. Therefore, the sectoral parameters do not incorporate information about localized damage to the RNFL or localized changes in cup depth. The inability to measure local damage may also explain, at least in part, the weak relationship between the abnormal GPS sector and the subsequent area of VF loss found in our study. In fact, the performance of sectoral parameters has not been demonstrated to be superior to that of GPS global for detection of glaucomatous damage.24
In conclusion, the results of our study demonstrate that the HRT parameter GPS is able to predict development of glaucomatous field loss or optic disc deterioration in patients with suspected glaucoma. Further, they demonstrate that GPS analysis has predictive ability similar to that of subjective stereophotograph assessment by glaucoma experts. These findings suggest that objective assessment of the optic disc topography using the HRT GPS technology has the potential to replace subjective optic disc evaluation as a predictive tool for estimating the likelihood of conversion in a patient with suspected glaucoma.
| Footnotes |
|---|
Submitted for publication January 30, 2007; revised May 23, August 26, and November 4, 2007; accepted March 13, 2008.
Disclosure: L.M. Alencar, None; C. Bowd, None; R.N. Weinreb, Heidelberg Engineering (F, R), Carl Zeiss Meditec (F, R); L.M. Zangwill, Heidelberg Engineering (F), Carl Zeiss Meditec (F); P.A. Sample, Carl Zeiss Meditec (F); F.A. Medeiros, Carl Zeiss Meditec (F, R), Heidelberg Engineering (R)
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
Corresponding author: Felipe A. Medeiros, Hamilton Glaucoma Center, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0946; fmedeiros{at}eyecenter.ucsd.edu.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |