|
|
||||||||
From the Vision Science Program, School of Optometry, University of California at Berkeley, Berkeley, California.
| Abstract |
|---|
|
|
|---|
METHODS. Eighteen diabetic patients were examined at baseline and at three annual follow-ups. Ophthalmic examinations, including fundus photographs and mfERG testing, were performed at each visit. Thirty-five retinal zones were constructed from the 103-element stimulus array, and each zone was assigned the maximum IT z-score within it based on 30 age-similar control subjects. Logistic regression was used to investigate the development of retinopathy in relation to baseline mfERG IT delays and additional diabetic health variables. Receiver operating characteristic (ROC) curves were used to evaluate the models.
RESULTS. Retinopathy developed in 77 of the 1208 retinal zones, of which 25 had recurring retinopathy. Multivariate analyses yielded baseline mfERG IT, duration of diabetes, and blood glucose concentration as the most important predictors of recurring retinopathy. mfERG ITs were not predictive of transient retinopathy. ROC curves based on the multivariate model for the prediction of recurring retinopathy resulted in an area under the curve of 0.95, sensitivity of 88%, and specificity of 98%. Ten-fold cross-validation confirmed the high sensitivity and specificity of the model.
CONCLUSIONS. The development of recurring retinopathy over a 3-year period can be well predicted by using a multivariate model based on mfERG implicit time. Multifocal ERG delays are promising candidate measures for trials of novel therapeutics directed at preventing or slowing the progression of NPDR.
The suggestion of neural dysfunction in diabetes, in addition to vascular dysfunction, goes back many decades.6 The means of studying this neural dysfunction in humans has involved electrophysiology of the eye and visual psychophysics. Several excellent reviews note deficits in full-field electroretinogram (ERG) oscillatory potentials, foveal cone ERGs, 30-Hz flicker delays, nonphotic electro-oculogram responses, visual acuity, contrast sensitivity, color vision, motion thresholds, and other functional measures.7 8 9
In the past several years our group has been investigating the implicit time of the multifocal electroretinogram (mfERG) as a prospective predictor for the development of diabetic retinopathy. The mfERG has been used to assess retinal function in many retinal disorders, including diabetic retinopathy.10 11 12 13 The implicit time measure of the mfERG is spatially associated with retinopathy,14 correlates with retinopathy severity,14 and is a predictor for the development of retinopathy over a 1-year period.15 This has not been shown for the amplitude measure of the mfERG. The first prominent positive peak (P1) of the mfERG response that our group has investigated is the easiest to measure, and the implicit time measure of P1 has one tenth of the coefficient of variation of the amplitude measure.16 It has also been shown that the mfERG timing abnormalities (delays) can be used in a quantitative model to predict areas of retinopathy development with good accuracy: sensitivity of 86% and specificity of 84%.17
This quantitative model was the first to make predictions of diabetic retinopathy lesions in discrete retinal areas. The study, by Han et al.,17 involved only one follow-up visit and thus could not examine whether the lesions they predicted were either more transient or sustained in nature. More recently, a review by Bearse et al.16 included new data that extended the study by Han et al.17 for another year. Bearse et al. examined only cumulative retinopathy occurrence, however, and did not examine the time course of the lesions.16
A retinal area that shows retinopathy lesions over a longer period represents more significant pathophysiological alterations—namely, increased vascular permeability and hypoxia. Thus, investigating these areas may be more clinically important than those areas that show retinal lesions that are present only transiently. It is well known that the very earliest clinical signs of diabetic retinopathy wax and wane. One study found that over a 2-year period, 52% of microaneurysms show spontaneous resolution.18
We addressed two primary questions in this study. First, given that predictions are more difficult over longer intervals,16 19 20 can we still predict diabetic retinopathy lesions with good accuracy over a longer follow-up period? Second, given that early diabetic retinopathy lesions come and go,18 21 are mfERG implicit times equally predictive of diabetic retinopathy lesions that may be transient as it is of those that are persistent?
To answer both of these questions, we continued our original longitudinal study for two more years, which allowed us to examine mfERG measures in areas without retinopathy at baseline and then the development of transient or more sustained retinopathy over the next 3 years.
| Methods |
|---|
|
|
|---|
|
|
The nature of the study and any potential consequences of study participation were explained to the study subjects, and written informed consent was obtained from all subjects before data collection. Procedures complied with the tenets of the Declaration of Helsinki, and the University of California Committee for the Protection of Human Subjects approved the protocol.
mfERG Recording
mfERGs were recorded with a commercial system (VERIS 4.3; EDI, Redwood City, CA). Subjects pupils were fully dilated with 1.0% tropicamide and 2.5% phenylephrine. The cornea was anesthetized with 0.5% proparacaine before a Burian-Allen bipolar contact lens electrode (Hansen Ophthalmic, Solon City, IA) was placed on the tested eye. A ground electrode was clipped to the right ear lobe, and the contralateral eye was occluded.
The stimulus consisted of 103 scaled hexagonal elements that subtended
45° of the retina (Fig 1A) . Subjects fixated a small target in the center of the stimulus array during the 8-minute recording session. Each hexagonal element was temporally modulated between white (200 cd/m2) and black (<2 cd/m2), according to a 215– 1 binary m-sequence.25 Room lights were kept on throughout the study session, providing an ambient illumination approximately equal to the average luminance of the stimulus. Data acquisition occurred in 16 segments that were each approximately 30 seconds in duration. The quality of recordings and fixation stability were monitored in real time, and contaminated segments were discarded and repeated. The retinal signals were amplified 100,000 times and band-pass filtered 10 to 100 Hz.26 The mfERGs were processed using one iteration of artifact rejection and spatial averaging with one sixth of the surrounding responses before exporting the signals for data analysis.
Data Analysis
The mfERGs were analyzed as has been reported in detail.17 The first-order kernel local mfERG implicit times were measured by using the template-scaling method.27 Waveform templates were constructed from the mean local waveforms of the normal subjects. Each template was then multiplicatively scaled in amplitude and time until the maximum achievable least squares fit to the subjects local response was obtained. The subjects implicit time for a particular response was then derived as the time from the focal flash onset to the first prominent positive peak (P1) of the response (Fig. 1B) . The mean and SD of each local mfERG implicit time measure was calculated from the normative data, and these were used to calculate a z-score for each local mfERG implicit time obtained from the diabetic subjects.
The 103 mfERG hexagonal stimulus elements were grouped into 35 zones (Figs. 1C 1D) , as reported previously,17 to be spatially conservative. The zones were arranged in an approximately symmetric manner across the test area. Using zones instead of individual stimulus elements allows for possible spatial mismatches that could occur when mapping retinal lesions identified in photographs onto the mfERG stimulus array. Each zone was assigned the maximum z-score of the two to three elements that it consisted of. In accord with previous studies,10 14 we set an a priori rule that elements that had statfits greater than or equal to 0.75 were not allowed to determine the z-score for a zone. A statfit of zero would mean that a response perfectly matched the scaled response template, an unexpected result. Conversely, a statfit of 1.0 would mean the template fitted the response as well as a flat line, implying that the response was extremely noisy or even absent. A statfit of less than 0.75 has been shown to indicate a false alarm rate less than 3%.27 The a priori rule for trace rejection was unnecessary, because 3708 retinal elements (36 eyes x 103 retinal areas) were analyzed via the template scaling method, and none of these had a statfit greater than or equal to 0.75. The mean ± SD was 0.24 ± 0.09, with a range from 0.06 to 0.63, indicating very good fits of the scaled templates to the local responses.
Statistical Analysis
We investigated three dichotomized outcomes of retinopathy occurrence. A cumulative outcome (cumulative incidence) was defined as any zone with diabetic retinopathy at any time during the course of the three follow-ups. A transient outcome was defined as any zone with diabetic retinopathy that occurred at only one follow-up visit and was absent on the other two follow-up visits. Finally, a recurring outcome was defined as any zone having diabetic retinopathy that was present at two (not necessarily consecutive) follow-up visits or all three follow-ups. Retinal zones with baseline retinopathy were not included in the analysis, as we were interested in predicting the subsequent development of retinopathy in this study. All statistical analyses were performed in commercial software (Stata 9.2; StataCorp LP, College Station, TX). A criterion of P < 0.05 was used to define statistically significant results, except in the case of multiple t-tests and relative risks (RRs) where the conservative P = 0.017 (0.05/3 tests) was used.28
Logistic regression was performed to investigate the association between new retinopathy occurrence and mfERG implicit time, and several other risk factors: diabetic duration, blood glucose level, age, gender, diabetic type, and baseline retinopathy status. Because the sampling units (mfERG zones) may be spatially correlated within an eye, generalized estimating equations (GEEs) were applied to account for this possible correlation.29 30 The GEE method allows the specification of a working correlation structure in the data. A compound symmetric covariance structure was chosen that assumes a common covariance between mfERG zones within the eyes of a subject and independence between subjects. It is the ability to specify this structure that allows the method to estimate models that account for correlation. The correlation model is used to correct the variance-covariance matrix in a weighted regression. For example, highly correlated zones would have less influence on the final regression estimates. If the GEE method were not used, the covariance among the observations would not be accounted for and the standard errors of the regression coefficients would be artificially small. Robust standard errors were used with the GEE method to ensure appropriate inferences in the presence of any disparity between the empiric and specified covariance structures.
Since measures from the patients two eyes might also be correlated, we assumed the same covariance structure between eyes as we did within eyes. The appropriateness of this structure, common with unordered data, was confirmed by the similarity between the naïve and robust variance estimators. Using the GEE method allows the formulation of an accurate model that accounts for covariance within and across eyes of subjects. Thus, analyses proceeded using both eyes of the subjects.
In building predictive models, we first performed univariate analyses of each risk factor as we have done previously.16 17 We then built a preliminary multivariate model out of those risk factors that have been shown to be the most useful in the prior studies: mfERG implicit time, diabetic duration, and blood glucose level. Next, standard forward stepwise regression was used based on the univariate analyses. The inclusion of an eye variable was used in the multivariate analyses to detect confounding, and none was found. For a risk factor to enter and remain in the model, criterion P < 0.05 and 0.10 were used, respectively.
Probabilities derived from logistic regression analyses were then used in receiver operating characteristic (ROC) curve analyses. Empiric ROC curves were derived from the known outcome of a zone and its associated probability derived from logistic regression. The area under the ROC curve (AUC) was used as a measure of discrimination and the optimal sensitivity and specificity were reported for each ROC curve.31 32 33
Ten-fold cross-validation was used both in model selection and validation.34 35 36 In this technique, the data are randomly parsed into 10 data sets. Each set is used in validating models derived from the remaining nine sets. Generalized accuracy is then obtained from the average of the 10 validations.
| Results |
|---|
|
|
|---|
Relative Risks and Outcome Type
We first wanted to understand the general association between baseline mfERG implicit times and retinopathy development. Overall, retinal zones with abnormal implicit time z-scores (
2) at baseline had a 71% increased risk of development of diabetic retinopathy over the 3 years (cumulative outcome, relative risk [RR] = 1.71, P = 0.017, 95% confidence interval [CI] = 1.10–2.68). Outcomes classified as transient retinopathy were not significantly associated with abnormal implicit time z-scores (RR = 0.75, P = 0.41, 95% CI = 0.38–1.48). Recurring retinopathy outcomes were significantly associated with abnormal baseline implicit time z-scores (RR = 7.6, P < 0.001, 95% CI = 3.21–18.03). These results suggest that retinal areas with recurring retinopathy underlie the marginally significant association found with the cumulative retinopathy outcome.
Baseline Implicit Times and Outcome Type
We then examined whether baseline mfERG implicit times were more abnormal in those retinal zones in which retinopathy developed than those in zones that remained free of retinopathy, and also whether the degree of abnormality varied by the type of outcome (cumulative, transient, or recurring). Table 2 shows that for outcomes classified as cumulative, there were marginal differences between the mean implicit time z-scores of zones in which retinopathy developed compared with zones in which it did not (P = 0.04). Zones that had retinopathy transiently did not differ statistically from zones in which it never developed (P = 0.16); however, there were significant abnormalities in zones in which recurring retinopathy developed (P < 0.001). Thus, recurring retinopathy is the only outcome that showed significant differences in baseline implicit times between areas in which retinopathy did and did not develop over 3 years.
|
An ROC curve was plotted to assess the performance of the recurring model (Fig. 2A) . Using the probabilities derived from the logistic regression model and the known retinopathy outcome (occurring or not), we plotted a function of sensitivity versus 1—specificity for various cutoff or criterion probabilities. The AUC ranges from 0 to 1.0 with 0.5 and 1.0 representing chance and perfect performance, respectively. ROC analysis of this model yielded an AUC of 0.83 with a sensitivity of 84% and specificity of 73% at a cutoff probability of 0.03.
|
|
Ten-fold Cross-Validation of Multivariate Models
We cross-validated both the full and reduced multivariate models as a way of model selection and, more important, to obtain an estimate of the generalized accuracy of the models. Cross-validation gave 78% (SEM = 11%) sensitivity and 98% (SEM = 0.4%) specificity for the full model and 86% (SEM = 6%) sensitivity and 98% (SEM = 0.4%) specificity for the reduced model at cutoff probabilities of approximately 0.04 and 0.12, respectively. Thus, the reduced model performed as well as the full model, and because it is more parsimonious, it is the preferred predictive model.
| Discussion |
|---|
|
|
|---|
ROC analysis showed that, with a cutoff probability of 0.12, the model has high accuracy (88% sensitivity, 98% specificity) in discriminating between those retinal areas that remain free of retinopathy (or have only transient retinopathy) and those areas in which recurring retinopathy develops. The model was validated with 10-fold cross-validation, which showed that it had accuracy (86% sensitivity, 98% specificity) extremely close to that empirically derived by ROC analysis. Cross-validation showed that the inclusion of diabetes type was unnecessary in the final model, and that if it were included, it would result in a lower sensitivity.
The predictive risk factors in this study are shared with our previous models,16 17 with the exception of baseline retinopathy status. Hyperglycemia has been well established as a risk factor for diabetic retinopathy and progression in large-scale, randomized, controlled clinical trials.37 38 Similarly, duration of disease has been established as a prominent risk factor for retinopathy development in a large-scale epidemiologic study.39 40 In the original study by Han et al.17 the presence of any baseline retinopathy within an eye was a predictive factor, with a large OR, but also a wide CI. This result was because 11 of 12 diabetics in whom new retinopathy lesions developed 1 year later had some retinopathy at baseline, and retinopathy developed in only 1 of 16 who did not have any retinopathy at baseline. In this study, 4 of 6 eyes in which new retinopathy lesions developed and recurred over 3 years had retinopathy at baseline, and 1 of 30 eyes without baseline retinopathy had recurring retinopathy. Thus, baseline retinopathy status was not significant in the 3-year model, because 25% fewer eyes with baseline retinopathy had the recurring outcome measure than in the study by Han et al.
Han et al.17 evaluated their multivariate model by applying it to data obtained from a second-year follow-up in the contralateral eye of eight subjects who were originally in the model-making group and to four new subjects who were not used in the derivation of the model.17 It is unknown whether formal cross-validation of the data on the model would have significantly affected the final model reported by Han et al.
It is important to note that the implicit time of the mfERG provides the only locally variable parameter in the multivariate model and, therefore it alone allows the model to predict development of retinopathy in a specific retinal area. Knowledge of a retinal area at high risk of retinopathy also makes it possible, of course, to identify specific eyes and patients at high risk.
The mfERG P1 component analyzed in this study is generated primarily by bipolar cells,41 42 which lie within the inner nuclear layer of the retina. Thus, the neurons primarily involved in generating the index of retinal function that we are investigating lie in the same intraretinal location as the vascular cells that are implicated in diabetic retinopathy lesions. Several recent reports have identified definite neuroglial dysfunction in diabetic mice. At the cellular level, there are neural alterations in the absence of morphologic changes in the vasculature.43 44 45 46 In the first study to quantify neural cell loss in diabetes, Barber et al.45 found that the inner nuclear layer, where the primary generators of the mfERG P1 implicit time lie, had the greatest reduction in thickness. Thus, these findings in mice complement the electrophysiology studies of human diabetic subjects.
A potential utilization of mfERG implicit time in randomized clinical trials would be as part of the inclusion criteria. In general, clinical trials enroll a large number of patients over a 3- to 5-year period. Often, the incidence of the primary or secondary outcome measures are low, and it is critical that they enroll patients at high risk for progressing toward or reaching those end points.47 The mfERG implicit time has been shown to be predictive of new retinopathy lesions over a short period. Thus, its inclusion as entry criteria into a clinical trial would identify patients at much greater risk for development of diabetic retinopathy lesions. Enrolling these patients would ensure that evaluation of a novel therapeutic for preventing or slowing the progression of diabetic retinopathy would be more efficacious.
The development of surrogate outcomes also remains an important and critical task, as the current end points are of limited value in therapeutics targeted for earlier stages of diabetic retinopathy.3 4 The implicit time of the mfERG has been shown to be a highly reproducible measure48 49 50 that is correlated to diabetic retinopathy lesions14 51 and predictive of development of retinopathy.16 17 These are critical requirements for a surrogate outcome measure and in fact they satisfy three of the four criteria set forth by Berger.51 The fourth criterion, evidence of modulation in the candidate measure by therapeutic intervention, can only be evaluated once the mfERG implicit time is used in a clinical trial, though it has been proposed that complete validation of a surrogate endpoint in a diabetic retinopathy trial should not be a prerequisite for drug approval.3
Models of the type we have formulated could be extended to predict specific lesion types—in particular, diabetic macular edema—and also to predict progression of retinopathy. We have already begun these longitudinal studies. Another step in evaluating the utility of mfERG implicit times in diabetic retinopathy is to know how and whether they change in eyes with diabetes, in eyes with diabetic retinopathy, in areas with retinopathy, and in areas without retinopathy. These questions are currently under investigation and have been reported in preliminary form (Bearse MA et al. IOVS 2006;47:ARVO E-Abstract 4732).
In this 3-year longitudinal prospective study, we examined the predictive ability of the mfERG to identify sites of new diabetic retinopathy. The study complements the most recent advances made in experimental diabetes that suggest that neuroglial dysfunction occurs before the overt development of morphologic changes in the vasculature. Further, it extends our previous work showing predictive ability of the technique over 1- and 2-year periods by examining retinal areas with recurrent retinopathic lesions. These lesions, which are indicative of more severe retinopathy and thus are of greater clinical interest, were associated with greater neural abnormalities at baseline than were lesions that were transient. The mfERG may be of value when used either as inclusion criteria to enroll, or as a surrogate outcome measure to be evaluated, in clinical trials of novel therapeutics for earlier stages of diabetic retinopathy.
| Acknowledgements |
|---|
| Footnotes |
|---|
Submitted for publication September 5, 2007; revised November 30, 2007; accepted February 14, 2008.
Disclosure: J.S. Ng, None; M.A. Bearse, Jr., None; M.E. Schneck, None; S. Barez, None; A.J. Adams, None
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
Corresponding author: Jason S. Ng, UC Berkeley School of Optometry, 360 Minor Hall, Berkeley, CA 94720-2020; jsnng{at}berkeley.edu.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |