|
|
||||||||
1From the Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, Tennessee; the 2Center for Human Genetics, Duke University Medical Center, Durham, North Carolina; the 3Department of Medicine and Miami Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, Florida; and the 4Miami Institute for Human Genomics, University of Miami, Miami, Florida.
| Abstract |
|---|
|
|
|---|
METHODS. The haplo.stats program was used to test for association of the protective haplotypes after adjusting for age in the dataset of 584 sporadic cases and 248 control samples. Logistic regression modeling and likelihood ratio tests were used to investigate an interaction between a particular haplotype and smoking status. The HBAT option of FBAT was used to confirm the associations in an independent dataset of 201 families.
RESULTS. Two protective (P) haplotypes in a family-based dataset (P1 = CAATTTAG, P = 0.021; and P2 = CGGCTTAG, P = 0.018) were identified for the first time. Age-adjusted score statistics provided support for these protective haplotypes in the case–control dataset (P1 frequency in cases
13%, in controls
20%, P = 0.001; P2 frequency in cases
5%, in controls
8%, P = 0.077). There was also tentative evidence of an interaction between one of the protective haplotypes and cigarette smoking (P = 0.04 likelihood ratio test for P2–smoking interaction).
CONCLUSIONS. Replication of the association between the protective haplotypes and decreased AMD susceptibility provides increased evidence that these associations have biological meaning. The suggestion of a haplotype–smoking interaction adds to the growing body of evidence that smoking is an important environmental covariate in AMD that should be considered in genetic studies. Identification of the protective variant(s) carried within these haplotypes is critical for understanding the etiology of AMD.
Both genetic and nongenetic risk factors are known to be involved in the pathogenesis of AMD. Epidemiologic studies have successfully identified many nongenetic or lifestyle risk factors for AMD, including increased age, smoking, and body mass index.3 4 The connection between smoking and risk of AMD has been particularly well documented. A recent literature review reported that 13 of 17 studies showed a statistically significant association between smoking and AMD.5 Pooled data from 9523 participants in three population-based cohort studies in the United States, The Netherlands, and Australia show an estimated odds ratio for current smokers versus never smokers at 2.83 (1.15–6.93) for geographic atrophy and 2.35 (1.30–4.27) for late AMD, which includes cases with geographic atrophy and choroidal neovascularization.6 The adjusted population attributable risk percentage for smoking is 20%, suggesting that smoking is likely to be the largest known modifiable risk factor for AMD.7
A strong genetic component has long been suspected in the etiology of AMD, based on the results of familial aggregation studies,8 9 10 segregation analysis,11 and twin studies.12 13 14 15 16 17 A meta-analysis of the six largest genomic linkage screens identified the chromosome regions 10q26, 1q, 2p, 3p, and 16 as the most likely regions to harbor AMD susceptibility loci.18 In Caucasian populations, the association between AMD risk and the Y402H variant in the complement factor H (CFH) gene located on chromosome 1 has been validated by multiple studies,19 20 21 22 23 24 25 26 27 and promising initial reports of a strong association between a polymorphism in the LOC387715 gene on chromosome 10 and AMD risk have also been replicated.7 24 28 29 A recent meta-analysis estimates that the odds ratio for rs10490924 in LOC387715 as 2.5 (95% confidence interval [CI], 2.2–2.9) in heterozygotes for the risk allele and 7.3 in homozygotes (95% CI, 5.7–9.4).30 In addition, genetic and lifestyle risk factors for AMD have been tied together by the discovery that the effect of the LOC387715 variant on AMD risk is modified by cigarette smoking.7 The most recently reported major AMD susceptibility loci are the factor B (BF) and complement component 2 (C2) genes in the major histocompatibility complex class III region of chromosome 6.29 31
The precise role of CFH in the AMD process is still being elucidated, but it is thought that the nonsynonymous substitution of histidine for tyrosine in exon 9, which lies in the protein's heparin binding site, may disrupt the ability of CFH to attenuate complement response and lead to damage of ocular tissue.22 Recently, data supporting the existence not only of a risk haplotype, but also of protective haplotypes in the CFH gene have been reported.19 32 33 Hageman et al.19 used a tag single nucleotide polymorphism (SNP) approach to select eight SNPs in the CFH gene for haplotype analysis in a large case–control sample of individuals of European-American descent ascertained at the University of Iowa (Iowa City, IA) and Columbia University (New York, NY). Two haplotypes had protective effects, with the SNPs that distinguish the protective haplotypes concentrated between exons 2 and 11. Using a subset of four of these eight SNPs, Okamoto et al.32 confirmed a protective haplotype in a Japanese case–control dataset. Whereas this haplotype had a frequency of 12% in cases versus 21% in controls in the individuals of European-American descent and an odds ratio of 0.54 (0.41–0.73) in the Japanese sample, the haplotype had a frequency of 18% in cases and 35% in controls and an odds ratio of 0.63 (0.26–0.67).
In contrast, CFH SNPs in a separate Caucasian dataset have also been selected for haplotype analysis using a stepwise regression approach.33 Notably, the Y402H variant was not selected as having the strongest influence on AMD risk. Instead, five SNPs spanning the latter half of the CFH gene from intron 9 to 18 were chosen. Two of the four most common haplotypes had protective effects (haplotype 1 frequency of 14% cases, 44% in controls; haplotype 2 frequency of 3% in cases, 11% in controls).
Our goal was to study the presence of the risk and protective haplotypes in CFH and to examine the possibility of a haplotype–smoking interaction, as smoking appears to be one of the most important environmental contributors to AMD, and previous studies have shown that the effect of genetic risk factors for AMD may be modified by smoking.7
| Materials and Methods |
|---|
|
|
|---|
|
|
Statistical Methods
Haploview36 was used to examine the linkage disequilibrium (LD) patterns between SNPs in each dataset, to verify that all SNPs were in Hardy-Weinberg equilibrium, and to perform single marker tests of association. Of the 33 SNPs we genotyped within the region of complement activation (RCA) gene cluster, alleles at eight tagging SNPs in the CFH gene identical with those of Hageman et al.19 were used to define haplotypes (rs3753394, rs529825, rs800292, rs3766404, rs1061170, rs203674, rs3753396, and rs1065489). Of the 36 haplotypes observed at least once, only four haplotypes had an overall frequency greater than 5%; these four haplotypes accounted for 81.9% of the observed haplotypes. We estimated haplotype frequencies and tested association of each haplotype with a frequency of at least 5% in our case–control dataset with age-adjusted score statistics using the haplo.stats software package.37 We calculated score statistics in the overall AMD dataset (grade 3, 4, and 5 AMD cases compared with grade 1 and 2 controls) and also in a subset of grade 5 neovascular AMD cases compared with grade 1 controls. Empiric probabilities were determined through simulation by using the default simulation options in haplo.stats (P precision criterion Pthreshold = 0.25, so that simulations continued until the sample standard error was less than Pthreshold times the probability, minimum number of simulations = 1000, and maximum number of simulations allowed = 20,000). To determine whether the potential protective haplotypes were underrepresented in cases after taking into account the overrepresentation of the risk haplotype in cases, the frequency of the haplotype of interest in cases was compared to the expected frequency of the haplotype given no unequal transmission according to the
2 test. This test has been used to show that the frequency of the APOE epsilon 2 allele is reduced in Alzheimer's disease, even after allowing for the increased frequency of the risk APOE epsilon 4 allele in cases of Alzheimer's.38 A logistic regression model considering P1, P2, and risk haplotypes with the neutral haplotype as the baseline gave similar results. Haplotypes were tested for association with AMD in the family-based dataset by using the HBAT option of the FBAT software package.39
In the case–control dataset of all AMD individuals we used the haplo.glm module of haplo.stats to build a regression model to study haplotype–smoking interactions.40 According to the recommendation of Lake et al.,40 we modeled only haplotypes with a frequency greater than 5%. Four haplotypes in our dataset met this criterion, and one was selected as the baseline haplotype. Though the selection of the baseline neutral haplotype is somewhat arbitrary, we chose the TGGTTTGT haplotype (Fig. 1) because it was the most frequent haplotype occurring in the same percentage of cases and controls (13.5% in controls and 13.8% in cases), and its association score statistic comparing all AMD individuals (grades 3, 4, and 5) to all controls (grades 1 and 2) indicated neither a protective nor a risk effect (haplotype score = –0.997, P = 0.311). Smoking status was obtained from patient questionnaires and dichotomized by classifying individuals who had smoked >100 cigarettes in their lifetime as "smokers." The full model contained age, smoking, the three most frequent haplotypes (later denoted as protective haplotype P1, protective haplotype P2, and the risk haplotype), a coefficient for the combined effect of the haplotypes with a frequency less than 5%, and four haplotype–smoking interaction terms. The neutral haplotype was used as the referent group in this model. A separate coefficient was used to estimate the effect of the rare haplotypes instead of pooling them with the baseline haplotype, to ensure that any effect of the rare haplotypes would not confound the neutral baseline haplotype. We followed up the P2–smoking interaction with a likelihood ratio test to see whether a model containing the P2–smoking interaction term is significantly different from a model that does not contain this interaction. First, we used a full model containing all haplotypes with a frequency greater than 5%, age, smoking, and the haplotype–smoking interaction terms compared with a reduced model without the four interaction terms. Because we were particularly interested in only the effect of the P2–smoking interaction, we also performed a likelihood ratio test comparing the full model to a reduced model missing only the P2–smoking interaction. Finally, a stratified regression analysis was used to examine the effect of P2 separately in smokers and nonsmokers after adjustment for age.
|
| Results |
|---|
|
|
|---|
|
|
|
Confirmation of the Risk Haplotype
The risk haplotype was very strongly associated with AMD in the overall case–control dataset and in cases with neovascular AMD (P
1.0–5 in both analyses; Table 4 ). The association was confirmed in the overall family-based analysis (P = 0.04) and trended toward significance in the family-based dataset when only individuals with neovascular AMD were classified as affected (P = 0.067; Table 5 ).
Protective Haplotype–Smoking Interaction
Because smoking is a known risk factor for AMD and the frequencies of the haplotypes in cases and controls varied by smoking status (Table 6) , we built a regression model incorporating smoking and haplotype–smoking interactions. The full regression model considered age, smoking, haplotype, and haplotype–smoking interactions. The main effects of age, smoking, the risk haplotype, and the pooled rare haplotypes were statistically significant (Table 7) . The interactions between P2 and smoking (P = 0.008) and the pooled rare haplotypes and smoking (P = 0.032) appear to be significant, but a more formal likelihood ratio test is needed for a true assessment of their significance. The likelihood ratio test between the full model with age, smoking, haplotypes with a frequency greater than 5%, and the haplotype–smoking interactions compared to a model without the four haplotype–smoking interaction terms trended toward significance (
2 = 8.37, df = 4, P = 0.08).
|
|
2 = 4.22, df = 1, P = 0.04). Furthermore, a stratified regression analysis testing the effect of P2 in smokers versus nonsmokers after adjustment for age and controlling for the effects of the other common haplotypes showed that the protective effect of P2 bordered on significance in smokers (P = 0.070), but not in nonsmokers (P = 0.196), lending additional confidence to the haplotype–smoking interaction. | Discussion |
|---|
|
|
|---|
Although statistical analyses and confirmation in multiple independent datasets of both families and singleton cases and controls has provided strong support for a protective effect of these haplotypes, functional experiments are needed to determine which variant(s) on these haplotypes are exerting the protective influence and the underlying biological mechanism by which they are operating. The CFH protein is composed of 20 short consensus repeats (SCRs) and contains binding sites for complement component C3b, heparin, C-reactive protein (CRP), and sialic acid.41 CFH functions as a brake on the alternative complement cascade by preventing the activation of C3 to C3a and C3b and by inactivating existing C3b. Sialic acid, heparin, and CRP all facilitate the interaction between CFH and C3b42 43 44 and thus lead to a stronger inhibitory signal. Of the eight SNPs defining the protective haplotypes, three are nonsynonymous amino acid substitutions (rs800292 I62V, rs1061170 Y402H, and rs1065489 D936E), one is a synonymous substitution (rs3753396 Q672), one is a putative promoter variant, and three are intronic changes. The valine-to-isoleucine substitution at rs800292 is located within SCR 1 of CFH, which is one of three C3b binding sites within the protein.41 The putative promoter polymorphism rs3753394 may influence transcription. Thus, both of these variants may have functional consequences for the CFH protein.
However, the protective effect does not seem to be explained by any of the eight polymorphisms alone (Fig. 1) . The P1, P2, and risk haplotypes all carry the same allele for the putative promoter variant, and P1 and P2 carry different alleles for the I62V, intron 1, and intron 6 polymorphisms, and so none of these variants alone seems plausible as the source of the protective effect. In addition, both protective haplotypes share the alleles TTAG at the last four SNPs of the haplotype, but the A and G alleles at the last two SNPs in P1 and P2 are also found on the risk haplotype, and the T allele at the intron 10 SNP is also found on the neutral haplotype. Therefore, these variants are unlikely to exert a strong protective effect.
As expected, both protective haplotypes carry the nonrisk T allele at the Y402H polymorphism, but several lines of evidence suggest that the absence of the Y402H risk allele is not sufficient to explain the protective effect. First, the nonrisk T allele is also present on the neutral haplotype. Moreover, the frequency of both P1 and P2 was increased in controls, even after allowing for the decreased frequency of the risk haplotype in controls. Finally, in the Japanese the Y402H C allele did not contribute significantly to increased risk for AMD,45 46 but other risk and protective haplotypes in CFH were present,32 which suggests that despite the strong single-locus Y402H effect in Caucasians, additional non-Y402H variation in CFH modulates susceptibility to AMD.
This work, as well as the previous report of an interaction between smoking and the LOC387715 variant rs10490924 on chromosome 10 that modifies AMD risk,7 demonstrate the importance of considering known lifestyle risk factors for AMD when performing genetic studies. Smokers have decreased plasma levels of CFH,47 but little is known about the biological function of LOC387715 and how smoking affects it. Future molecular work aimed at unraveling the complex interplay between smoking, CFH, LOC387715, and AMD will surely provide intriguing results.
One concern with haplotype analysis in general is that when using case–control data, the haplotype for an individual is often not directly observed and may have to be inferred using the person's genotype data. The haplo.glm function avoids the need to infer haplotypes directly by using a weighted regression approach that incorporates the posterior probability of an individual's having a particular haplotype into the model.40 The interpretation of haplo.glm results is more difficult when significant haplotype ambiguity exists. This haplotype ambiguity is reduced as the proportion of individuals in the dataset with high conditional haplotype probabilities increases. In our case, 86% of individuals had posterior probabilities greater than 0.95, suggesting that, at worst, only modest haplotype ambiguity was present.
Multiple testing is a significant issue in genetic studies and cannot be ignored. We limited the problem of multiple testing in this study in several ways. First, we tested only haplotypes with a frequency greater than 5% for association with AMD, rather than test all observed haplotypes. Second, we believe that confirmation of the protective and risk haplotypes in two independent datasets of different types (case–control and family-based) increases confidence that these associations are true-positive results. Finally, only two haplotype–smoking interactions were tested by likelihood ratio methods (an overall test of all haplotypes interacting with smoking and a specific test of the P2–smoking interaction), not all possible haplotype–smoking interactions.
In summary, our data provide support for the presence of two protective haplotypes in the CFH gene. A potential interaction between one of the protective haplotypes and smoking was also identified. Further work is needed to characterize the variant(s) on the haplotypes that are exerting the protective influence and how cigarette smoking may modify this effect.
| Acknowledgements |
|---|
| Footnotes |
|---|
Submitted for publication December 1, 2006; revised April 16, 2007; accepted July 11, 2007.
Disclosure: K.L. Spencer, None; M.A. Hauser, None; L.M. Olson, None; N. Schnetz-Boutaud, None; W.K. Scott, None; S. Schmidt, None; P. Gallins, None; A. Agarwal, None; E.A. Postel, None; M.A. Pericak-Vance, None; J.L. Haines, None
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
Corresponding author: Jonathan L. Haines, Center for Human Genetics Research, 519 Light Hall, Vanderbilt University Medical Center, Nashville, TN 37232; jonathan{at}chgr.mc.vanderbilt.edu.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
K. L. Spencer, M. A. Hauser, L. M. Olson, S. Schmidt, W. K. Scott, P. Gallins, A. Agarwal, E. A. Postel, M. A. Pericak-Vance, and J. L. Haines Deletion of CFHR3 and CFHR1 genes in age-related macular degeneration Hum. Mol. Genet., April 1, 2008; 17(7): 971 - 977. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |