|
|
||||||||
From the Division of Ophthalmology and Visual Science, Tottori University Medical School, Yonago, Japan.
| Abstract |
|---|
|
|
|---|
METHODS. The transcriptional differences of primary and recurrent pterygia were first determined by microarray analyses. Computational analyses were used to extract the biological significance of the genes accurately, and a significant functional classification of the genes was made by unsupervised methodologies. After confirming the functional classification for primary and recurrent pterygia by a clustering algorithm, a support vector machine (SVM) algorithm was applied. Based on a machine learning technique, the minimum number of genes that can accurately classify primary and recurrent pterygia was determined.
RESULTS. Clustering analyses classified primary and recurrent pterygia transcriptomes and identified 10 clusters associated with distinct biological processes. When the SVM algorithm was applied to the microarray-analyzed products from three primary and three recurrent cases, periostin, TIMP-2, and L-3-phosphoserine phosphatase homolog (PSPHL) were identified as the minimum set of predictors with 100% accuracy. A differential expression of these genes in primary and recurrent pterygia was confirmed by immunohistochemistry. When the 24 patients with primary disease and the 8 patients with recurrent disease were analyzed with this gene set, an accuracy of classification of 84.38% was achieved.
CONCLUSIONS. Periostin, TIMP-2, and PSPHL can be used as predictor genes for the recurrence of pterygia. Their biological activities may explain the events leading to recurrences of pterygia and thus may be genes to target for pharmaceutical interventions.
To determine the cause of recurrences more precisely, it is necessary to undertake a detailed analyses of the etiology of pterygia at the molecular level. Our initial analysis of the pterygia transcriptome2 and the observations of others3 4 confirmed that the mechanisms associated with the development of pterygia are multifaceted, and thousands of transcripts are differentially expressed in pterygia and normal conjunctiva.
Histopathologically, the tissues of recurrent and primary pterygia are very similar although quite different from normal conjunctival tissues. Eyes with recurrent pterygia are known to have a higher rate of recurrence after surgery with increased cellular proliferative responses in the subepithelial fibrovascular layers.5 This indicates that the mechanisms determining the genesis and the recurrence of a pterygium may not be the same. Because this also suggests transcriptional alterations in the tissues of recurrent pterygia, it seemed reasonable to assume that the tissues of recurrent pterygia possess molecular signatures relating to their recurrence.
To identify the recurrence-related genes, we used a differential whole genome scanning approach with a computational analysis of gene expression to isolate a functionally significant classification of genes. A clustering algorithm identified functional gene clusters and accurately classified tissues obtained from primary and recurrent cases. With a support vector machine (SVM) algorithm,6 we extracted the minimum gene set that would accurately classify primary and recurrent pterygia. We showed that only three genes are necessary to differentiate primary and recurrent pterygia: periostin, tissue inhibitor of metalloproteinase-2 (TIMP-2), and L-3-phosphoserine phosphatase homolog (PSPHL).
| Materials and Methods |
|---|
|
|
|---|
Reference RNA was obtained from pooled RNA extracted from 12 normal nasal conjunctival specimens of volunteers undergoing cataract surgery. The study protocol conformed to the tenets of the Declaration of Helsinki, and the procedures used were approved by the Tottori University Ethics Committee. A signed informed consent was obtained from all patients.
Microarray Procedures
To minimize the influence of confounding factors (e.g., age, gender, hormonal effects, race, UV exposure time) and the effects of geographical location, we initially studied three primary and three recurrent pterygia from postmenopausal women. Their mean age of the patients with primary pterygia was 66.3 ± 3.4 years and of those with recurrent pterygia was 74.0 ± 2.1 years.
Pterygia head tissues (
3 x 3 mm) were collected during surgery by two authors (DM and YI) who used, as exactly as possible, the same surgical procedures. The tissues were immediately transferred into storage medium (RNAlater; Ambion, Austin, TX). Total RNA was isolated from the tissue samples (RNA STAT-60; Tel-Test, Inc., Friendswood, TX) and purified (RNeasy Mini Kit; Qiagen, Hilden, Germany), according to the manufacturers protocols. Total RNA was reverse-transcribed and amplified (Amino Allyl MessageAmp aRNA kit; Ambion) for Cy5 and Cy3 dye labeling. The Cy5 dye was used to generate the experimental cRNA probe from the pterygia tissues, and the Cy3 dye was used to generate the reference cRNA probe from normal conjunctiva. The labeled cRNA probes were hybridized on oligo microarrays (AceGene; Hitachi, Tokyo, Japan) corresponding to 30,336 genes, and scanned (FLA-8000 scanner; Fuji Film, Tokyo, Japan). The intensities of the fluorescent signals were quantified with a computer program (DNASIS Array program; Hitachi). After background subtraction, the signal from the gene spots was adjusted to compensate for excitation differences between the two dyes. Then, the fluorescent signal was corrected for image intensity, background–spatial artifacts, and chip-to-chip comparisons, using a custom database constructed by the gene array program (DNASIS Array).
Cluster Analysis
Before clustering and display, the logarithm of the ratio of the measured fluorescence for each gene was centered by subtracting the arithmetic mean of all ratios measured for that gene (DNASIS STAT program; Hitachi). The centering made all subsequent analyses independent of the amount of each genes mRNA in the reference pool. We extracted the genes by normalized fluorescence ratios for the 30,336 transcriptions on the arrays by applying different selection criteria for subset selection.
We applied a hierarchical clustering algorithm on the genes using the matching coefficient of Weinstein et al.7 and Eisen et al.8 as a measure closely linked to the average clustering. For visual display of the rows and columns in the initial data, tables were reordered to conform to the structures of the dendrogram. The data in the table were represented graphically by coloring each gene by the measured fluorescence ratio. Each gene in the cluster-ordered data table was replaced by a graded color, pure red through black to pure green, representing the mean-adjusted ratio for each gene.
Construction and Evaluation of Support Vector Machine-Based Classifier
We attempted to extract the essential genes by transforming this task into one that differentiated the two groups using a minimum number of genes (i.e., a minimum number to construct a binary classifier). For this purpose, a machine learning algorithm is a well-known approach used for its efficiency, similar to that of a neural network. Of these, we applied the SVM algorithm developed by Vapnik.9
The SVM algorithm design facilitates the generation of efficient classifiers especially for untrained data points. Another advantage is that the SVM relies only on machine-selected data points as support vectors, thus minimizing the dimensional complexity of the classifier. Using the DNASIS STAT program, classifiers were generated with defined sets of genes. To evaluate their performance, we used the leave-one-out cross-validation method. In this procedure, classifiers for n data points were generated by supervised training with n– 1 data points. Next, data not used for training, were tested for correctness of the answer by the generated classifier. This procedure was repeated n times to test all untrained data points. The mean percentage of questions answered correctly was defined as the classification accuracy. For comparison with other classification algorithms, including the multilayer perception (MLP) method and the k-nearest-neighbor (KNN) method, the leave-one-out cross-validation experiment was performed to evaluate the effectiveness of the classification.
Extraction of Classifier Genes for Recurrence by SVM
To generate the initial classifiers, genes were selected by application of the Mann-Whitney test. When the classifier was 100% accurate in the classification, gene sets that were used to generate the classifier served as the starting template. We then randomly discarded a gene from the classifier to remove any redundant genes from the template, and then the accuracy of classification was tested by the leave-one-out cross-validation method. The discarding procedure was repeated as long as the remaining gene sets maintained 100% accuracy of classification. If the discarding of a gene resulted in a loss of accuracy, the procedure was canceled and returned to the previous step of random discarding. The resultant minimum gene set was defined as classifier genes of pterygia recurrence.
Real-Time RT-PCR
One microgram of 32 pterygia head RNA samples and 11 normal nasal conjunctiva tissues were reversed transcribed using random hexamers (Superscript III; Invitrogen, Carlsbad, CA). The c-DNAs were amplified and quantified by thermocycler (LightCycler; Roche, Mannheim, Germany, with a QuantiTect SYBR Green PCR kit; Roche). The sequences of the used real-time PCR primer pairs are listed in Table 1 . Primers were designed using Primer 3 (http://fokker.wi.mit.edu/primer3/ developed by Steve Rozen and Helen Skaletsky, Whitehead Institute Center for Genome Research, Cambridge, MA) setting the importance on primer dimers, self-priming formation, and mispriming. All primer pairs that have an equal optimal annealing temperature of 58°C and similar GC content were selected and generated by Sigma-Aldrich Corp. (St. Louis, MO). The specificity of the primers was verified by agarose gel electrophoresis and melting curve analysis. To ensure equal loading and amplification, we normalized all products to GAPDH transcript as an internal control. Actual copy numbers of the products were calculated based on cloned templates of respective genes using the second derivative maximum method applied to threshold cycles of fluorescence detection.
|
Statistical Analyses
Data are presented as the mean ± SEM. Statistical analysis was performed by unpaired Students t-test (two tailed) or the Mann-Whitney test, as appropriate. P < 0.05 was considered significant.
| Results |
|---|
|
|
|---|
Cluster Analyses
When these genes were used for cluster analyses, the resultant dendrogram successfully generated two clusters for tissues from primary and recurrent pterygia (Fig. 1) . This finding indicated that the transcriptional information in the extracted genes set was accurate and sufficient to provide a basis for the classification.
|
|
Supervised Classification
We initially hypothesized that complex biological processes associated with recurrences might well be simplified to representative genes that might trigger or regulate recurrences. These genes may still be traceable after surgical intervention. If this interpretation is valid, an accurate classification may be achieved when a suitable algorithm is applied, even though a minimum number of genes is used as input.
To apply a classification algorithm, we initially used the 184 genes of a twofold change. The performance of the classifier was tested by using leave-one-out cross-validation tests. The SVM generated classifiers with 100% accuracy. In contrast, MLP generated classifiers with 83.33% accuracy while KNN generated classifiers with 66.67% accuracy. Because the SVM classifiers were totally accurate, it was used to extract the recurrence-associated classification genes.
We extracted three classifier genes—periostin, TIMP-2, and PSPHL—that showed 100% accuracy of classification. Of interest, no other minimum sets of genes were found to have 100% accuracy. Therefore, we reasoned that these three genes are the minimally required parameters for detecting recurrences in the initial small sample groups. When these genes were tested to generate classifiers using MLP and KNN methods, accuracy of the resultant classifiers was 100% and 83.33%, respectively (Table 3) .
|
|
To determine whether all the 32 pterygia RNAs were correctly classified by these genes, they were analyzed by real-time PCR and compared with control conjunctiva (Fig. 2b) . Of the three genes, five times more transcripts of periostin were detected in recurrent than in primary cases (primary: 3.5 x 109 ± 8.4 x 108 copies/µg RNA; recurrent: 2.1 x 1010 ± 1.0 x 1010 copies/µg RNA, P < 0.05; control: 5.2 x 105 ± 3.3 x 105 copies/µg RNA). Recurrent cases also had more TIMP-2 transcripts, but the difference was not statistically significant (primary: 1.1 x 109 ± 5.6 x 108 copies/µg RNA; recurrent: 1.5 x 109 ± 9.2 x 108 copies/µg RNA, P = 0.746; control: 1.9 x 107 ± 7.4 x 10 6 copies/µg RNA). For PSPHL, the recurrent cases had less transcripts, but the difference was not statistically significant (primary: 6.4 x 1016 ± 1.0 x 1016 copies/µg RNA; recurrent: 4.2 x 1016 ± 2.0 x 1016 copies/µg RNA; P = 0.321, control: 1.5 x 1010 ± 7.8 x 109 copies/µg RNA).
To validate that these transcriptional alterations are actually translated, immunohistochemical analysis was performed. Immunoreactivity to the three proteins was detected in all pterygia tissue. Consistent with the transcriptional analysis, periostin and TIMP-2 were more prominently expressed in the recurrent tissue, whereas the expression of PSPHL was reduced (Fig. 3) . Their distinctive localization suggested an interesting perspective on their involvement in the disease process. First, periostin was localized to the basement membrane of pterygia epithelial cells and diffusedly distributed in the superficial stroma (Figs. 3a 3b) . In recurrent pterygia tissues, periostin staining was more intense and diffuse but centered on the basement membrane. Together with the most remarkable change in periostin expression, this strongly suggests that recurrence involves alterations of pathologic processes, especially in the basement membrane of the diseased epithelia.
|
Evaluation of SVM-Based Classifier
To test whether the three genes provide sufficient dimensionality, the copy number–based SVM classifiers generated by the three primary and three recurrent cases were evaluated by the accuracy for all the pterygia samples (n = 32; Table 3 ). The performance of the classifier was tested by the leave-one-out cross-validation procedure. These classifiers predicted with 84.38% accuracy in differentiating the two groups. When one of the three genes was omitted in training the SVM-based classifiers, the classification accuracy decreased to 50%, 83.33%, and 83.33% without periostin, TIMP-2, and PSPHL, respectively. The high accuracy of the three genes further indicates that they are candidate genes in recurrent pterygia.
To evaluate the classification efficacy of SVM-based classifier, the MLP and KNN methods were also used to generate classifier using the three genes copy numbers for all the samples. The performance of the classifier was again tested by the leave-one-out cross-validated procedure. Under these conditions, the MLP-based classifier predicted a recurrence with 83.33% accuracy. The KNN-based classifier predicted with 16.67% accuracy, again showing that the SVM based-classifier performed with the highest accuracy.
| Discussion |
|---|
|
|
|---|
A pterygium is known to have altered growth characteristics that may indicate an involvement of neoplastic pathways in the pathophysiology of genesis and recurrence. For example, pterygia fibroblasts display characteristics of transformed cells including the loss of heterozygosity and microsatellite instability.12 13 In addition, the pterygial epithelium displays reduced apoptosis and in recurrent pterygia, an increased proliferation is noted in the fibrovascular layer.5
Previous observations suggest that the development and recurrence of pterygia have certain properties of carcinogenesis. How do the identified prognostic factors, periostin, TIMP-2, PSPHL, influence its recurrence after surgical removal? We identified periostin as a statistically significant determinant of recurrence. It is a 90-kDa heparin-binding N-glycosylated protein, originally isolated as an osteoblast-specific factor, that functions as a cell-adhesion molecule for preosteoblasts.14 15 16 Periostin contains fasciclin domains, interacts with
vß3 and
vß5 integrins, and increases fibronectin-dependent motility of epithelial cells.17 Its interaction with
vß5 at sites of focal adhesion also suggests its contribution to cell adhesion and motility.
Of interest, periostin has a high amino acid homology with the TGF-ß-induced protein ß-igh3, which promotes the adhesion and spread of fibroblasts and is a gene mutated in granular and lattice dystrophy. Recently, periostin has also been shown to be overexpressed in colon cancer, and it also promotes metastasis and angiogenesis. At the signaling level, periostin interacts with
vß3 to activate the Akt kinase/protein kinase B (Akt/PKB) pathway, leading to increased cell survival.18 Thus, the presumed neoplastic nature of pterygia recurrence may well be explained by periostin.
Collagen types I, II, III, and IV constitute a large part of the extracellular matrix of pterygial tissue where extensive collagen remodeling is observed.1 19 In the degradation and remodeling of extracellular matrix, the MMPs, a family of zinc-dependent endopeptidases, are known to be critical components. The MMPs also participate in tumor cell invasion, metastasis, and angiogenesis.20 MMPs have also gained significant interest for their involvement in pterygia. Earlier, an abundant expression of MMPs was found in pterygia tissue, whereas little or no expression was observed in normal conjunctiva.1 Of the MMPs, MMP-1, -2, -3, -7, and -9 are known to be involved. Di Girolamo et al.21 suggested that MMP-1 may be one of the principle MMPs in the UVB-related pathogenesis of pterygia, because MMP-1 and not MMP-2, MMP-9, or the TIMPs, was strongly and dose dependently induced in cultured pterygia epithelial cells by UVB.
The activities of MMPs are physiologically regulated by alterations of gene expression, activation of latent zymogens, tissue localization, and inhibition by tissue inhibitors of metalloproteinase (TIMPs). Many studies have shown that the TIMPs have a central role in pterygia.22 23 24 25 Based on the physiological roles of TIMPs, synthetic inhibitors of MMPs have been developed for potential therapeutic use. Our results identified TIMP-2 as one of prognostic determinant of pterygia recurrence. TIMP-2 complexes with MMPs, including MMP-1, -2, -3, -7, -8, -9, -10, -13, -14, and -15, contribute to the control of MMP activity.26 In addition to MMP inhibition, TIMP-2 also acts as an activator of MMP-2 via the ternary MT1-MMP/MMP-2/TIMP-2 complex.27 Independent of the well-known MMP-dependent activities, TIMP-2 has been shown to abrogate angiogenic factor-induced endothelial cell proliferation in vitro and angiogenesis by its
3ß1 integrin-mediated binding.28 This binding has also been shown to result in a negative regulation of growth factor activation that required the activity of the protein tyrosine phosphatase (PTP) SHP-1. These observations suggest a suppressive role of TIMP-2 in pterygia recurrence. Our observations of relatively higher copy numbers of TIMP-2 in recurrent cases might be explained by its presumable counterbalancing role in recurrences.
TIMP-2 is also known to be pluripotential. It stimulates quiescent cells to proliferate and functions as a metanephric mesenchymal growth factor.29 30 These properties may further complicate its role as a recurrence-suppressing factor.
PSPH (PSPHL) is the other classification–determinant factor. It is the rate limiting enzyme for the synthesis of serine by hydrolysis of phosphoserine,31 and is a known marker of neoplasticity in the lung and colon cells.32 Considering the possible neoplastic aspects of pterygia, the reduction of PSPHL in recurrent pterygia is not easily explained. Our detection of PSPHL may reflect the nonregulated nature of pterygia tissue outgrowth, possibly originating from limbal stem cells of the primary pterygia. However, the present data do not answer the question of whether PSPHL is actually recurrence promoting or suppressing.
Our immunohistochemical data showed another perspective of PSPHL. The location of periostin and TIMP-2 expression strongly suggests that the recurrence process would significantly affect the basement membrane of the epithelial cells (Fig. 3) . In the context of basement membrane involvement, differentiation of adenocarcinoma cells is promoted by contact with the basement membrane. Of interest, PSPH is involved in this process,33 which may suggest that the impaired induction of PSPHL in recurrent pterygia is secondary to drastic alterations of the basement membrane structure.
Our finding of differential expression patterns in primary and recurrent pterygium tissues by immunohistochemistry further confirmed the protein level and supports the importance of genes filtered by transcriptional differences. Our findings of periostin by immunochemistry in recurrent pterygia may well explain the neoplastic nature of pterygias recurrence and may result in increased proliferation. Based on the location of the classifier genes, we propose that interaction of pterygia epithelial cells and basement membrane is critical in the process of pterygia recurrence.
In conclusion, of the large number of genes that are differentially expressed in primary and recurrent pterygia tissues, we extracted periostin, TIMP-2, and PSPHL as classification determinants. Understanding the mechanisms of action of these factors may help develop methods leading to less recurrence of pterygia.
| Footnotes |
|---|
Disclosure: C.-H. Kuo, None; D. Miyazaki, None; N. Nawata, None; T. Tominaga, None; A. Yamasaki, None; Y. Sasaki, None; Y. Inoue, None
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be marked "advertisement" in accordance with 18 U.S.C.
1734 solely to indicate this fact.
Corresponding author: Dai Miyazaki, Division of Ophthalmology and Visual Science, Tottori University Medical School, 36-1 Nishimachi, Yonago, Japan; dm{at}grape.med.tottori-u.ac.jp.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |