Skip to main content

A six-gene prognostic model predicts overall survival in bladder cancer patients



The fatality and recurrence rates of bladder cancer (BC) have progressively increased. DNA methylation is an influential regulator associated with gene transcription in the pathogenesis of BC. We describe a comprehensive epigenetic study performed to analyse DNA methylation-driven genes in BC.


Data related to DNA methylation, the gene transcriptome and survival in BC were downloaded from The Cancer Genome Atlas (TCGA). MethylMix was used to detect BC-specific hyper-/hypo-methylated genes. Metascape was used to carry out gene ontology (GO) enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses. A least absolute shrinkage and selection operator (LASSO)-penalized Cox regression was conducted to identify the characteristic dimension decrease and distinguish prognosis-related methylation-driven genes. Subsequently, we developed a six-gene risk evaluation model and a novel prognosis-related nomogram to predict overall survival (OS). A survival analysis was carried out to explore the individual prognostic significance of the six genes.


In total, 167 methylation-driven genes were identified. Based on the LASSO Cox regression, six genes, i.e., ARHGDIB, LINC00526, IDH2, ARL14, GSTM2, and LURAP1, were selected for the development of a risk evaluation model. The Kaplan–Meier curve indicated that patients in the low-risk group had considerably better OS (P = 1.679e−05). The area under the curve (AUC) of this model was 0.698 at 3 years of OS. The verification performed in subgroups demonstrated the validity of the model. Then, we designed an OS-associated nomogram that included the risk score and clinical factors. The concordance index of the nomogram was 0.694. The methylation levels of IDH2 and ARL14 were appreciably related to the survival results. In addition, the methylation and gene expression-matched survival analysis revealed that ARHGDIB and ARL14 could be used as independent prognostic indicators. Among the six genes, 6 methylation sites in ARHGDIB, 3 in GSTM2, 1 in ARL14, 2 in LINC00526 and 2 in LURAP1 were meaningfully associated with BC prognosis. In addition, several abnormal methylated sites were identified as linked to gene expression.


We discovered differential methylation in BC patients with better and worse survival and provided a risk evaluation model by merging six gene markers with clinical characteristics.


Bladder cancer (BC) is one of the most difficult to treat and costly cancers due to its relapse tendency and chemoresistance [1]. In total, 76,000 new cases and 16,000 deaths are attributed to BC in the USA per year [2]. With such a large patient population, accurately diagnosing and effectively treating BC have become difficult challenges for basic medical researchers and urologists.

Epigenetic dysregulation is an important mechanism of tumorigenesis that affects the expression of numerous genes [3]. Aberrant DNA methylation, i.e., hyper- or hypomethylation, on CpG islands of promoters is one such mechanism, resulting in aberrant gene expression and having a major impact on the biological behaviour of BC [4, 5]. DNA methylation could also serve as a good biomarker for clinical diagnosis because of its stable and easily detectable attributes in many types of clinical specimen [6, 7]. Dulaimi et al. [8] reported that the detection of hypermethylation in the APC, RASSF1A, and ARF genes in BC patients may act as a non-invasive method for early diagnosis. Casadio et al. [9] also indicated that the methylation frequencies of HIC1, GSTP1 and RASSF1A could predict BC recurrence. Ohad et al. [10] found that CDH13 is downregulated by promoter methylation in BC patients, and this may be closely associated with tumour development.

The TCGA project aims to catalogue and discover major molecular changes to create a comprehensive “map” of the human cancer genome [11]. The multiple dimensions of data and massive samples not only provide a more comprehensive view of cancer but also enable the finding of better biomarkers, which could affect cancer treatment and prognosis [12]. DNA methylation data are also included in the massive data set, and a computational protocol called MethylMix can distinguish disease-specific hyper/hypomethylation genes, both of which are publicly available [13]. Several studies have been conducted to assess methylation-driven genes using the MethylMix algorithm and TCGA database [13,14,15].

In this study, we identified BC-related methylation-driven genes by using the data from the TCGA database. By coupling DNA methylation and gene transcriptome data, we identified methylation-driven genes and further constructed a model of DNA methylation status to predict prognosis in BC patients.

Materials and methods

Data processing and analysis

We downloaded DNA methylation, gene transcriptome and clinical survival data of BC patients from TCGA [16]. There were 437 samples with DNA methylation data (21 normal and 416 cancer), 430 samples with gene transcriptome data (19 normal and 411 cancer), and 404 patients with valid survival data. These data are an open resource, and no ethical issues were involved.

First, we applied the Limma package in R to extract the DNA methylation data. Next, we used the edgeR package to obtain the gene expression data. A comprehensive analysis was performed to obtain the following three data matrices: DNA methylation (normal, cancer) and gene expression. Subsequently, we used MethylMix [13, 17] to compare DNA methylation of cancer with that of normal tissue to detect specific genes, particularly BC-specific hyper/hypomethylation genes, and the methylation level of these genes was described as ‘transcriptionally predictive’. A mixture model of each gene was built, and Wilcoxon rank tests were computed with the following parameters: set as logFC > 0, P < 0.01, and Cor < − 0.3.

Functional enrichment and pathway analysis

Metascape [18] integrates several authoritative data resources, such as GO, KEGG, UniProt and DrugBank, so that it can execute pathway enrichment and biological process annotation to provide comprehensive and detailed information for each gene [19]. GO enrichment and KEGG pathway analyses of the genes identified by MethylMix were performed. Only terms with P < 0.01, a minimum count of 3 and an enrichment factor > 1.5 were considered significant.

Construction of the risk assessment model

First, the genes identified by MethylMix were applied to a univariate Cox regression. Second, we used a LASSO regression to narrow the range of target genes because the predictor variable was much larger than the sample content in the gene expression data. A strong correlation often exists between the variables, which is suggestive of high dimensionality and collinearity, and this method could decrease the characteristic dimension [20]. Then, we built a multivariate Cox regression model to select the genes that were most tightly associated with survival [21]. In addition, we validated this model in subgroups based on different characteristics. The following 12 subgroups based on different clinical characteristics and 9 subgroups based on different mRNA subtypes and mutational signatures [5] were subjected to further tests: high grade (n = 381), low grade (n = 20), stage I (n = 2), stage II (n = 128), stage III (n = 139), stage IV (n = 133), muscle-invasive (n = 368), non-muscle-invasive (n = 4), no distant metastasis (n = 193), distant metastasis (n = 11), lymph node metastasis (n = 169), no lymph node metastasis (n = 235), or Msig 1 (n = 28), Msig 2 (n = 220), Msig 3 (n = 99), Msig 4 (n = 55), basal squamous (n = 137), luminal (n = 26), luminal infiltrated (n = 77), luminal papillary (n = 140), and neuronal (n = 20).

The sensitivity and specificity of the model in the diagnosis of BC were analysed by a time-dependent ROC curve.

Furthermore, an OS-associated nomogram including the risk score and clinical factors was designed using the rms [22] and the Hmisc [23] packages in R. Calibration curves were drawn, and the concordance index (C-index) was computed to assess the efficiency of the nomogram.

Survival analysis

Kaplan–Meier curves were used to distinguish the connection between these genes and prognosis. A subgroup analysis was performed by dividing the patients based on clinical characteristics. A methylation/methylation site and gene expression matched survival analysis was carried out to explore the prognostic significance of these genes individually. The relationships between gene expression and methylated sites were additionally examined.

Data processing

All data analyses were performed with R [24]. Student’s t-test was used to evaluate the differences between two groups. The log-rank test was applied in the Kaplan–Meier survival examination.


TCGA data acquisition and filtering methylation-driven genes

In total, 167 genes were identified (Fig. 1; Additional files 1, 2) by applying MethylMix to the three matrices, and a mixture model of each gene was constructed (Fig. 2). Intuitively, the relationship between the peak curve and the black bar indicates whether a gene is hyper- or hypomethylated.

Fig. 1

Heatmap of 167 BC-related methylation-driven genes. Red to green shows a trend from hypermethylation to hypomethylation

Fig. 2

Mixture models of 6 of the 167 genes. The x-axis indicates the degree of methylation, the y-axis indicates the proportion at different degrees, the curve indicates the peak value, and the black bar indicates the normal methylation degree (af)

Functional enrichment and pathway analysis

The Metascape analysis shows the top 20 clusters of enriched sets (Fig. 3). These genes were enriched in the molecular function (MF) categories structural constituents of muscle and RNA polymerase II distal enhancer sequence-specific DNA binding. For biological process (BP), these genes showed enrichment in anterior/posterior pattern specification, chordate embryonic development, intrinsic apoptotic signalling pathway in response to DNA damage and so on (Additional file 3). The KEGG pathway data were enriched in Glutathione metabolism and Cardiac muscle contraction.

Fig. 3

Metascape analysis. a Network of enriched sets coloured by ID. Threshold: 0.3 kappa score; similarity score > 0.3. b Heatmap coloured by P-values

Construction of the risk assessment model

The results of the univariate Cox regression analysis of 167 genes were used in the LASSO regression to identify robust markers. A set of twelve genes (DAPP1, TCEAL7, PAXIP1-AS1, TDRD1, NUPR1, ARHGDIB, LINC00526, IDH2, ARL14, KLHDC7A, GSTM2, and LURAP1) and their coefficients were computed (Fig. 4a, b). Then, multivariate Cox regression analyses were performed, and a six-gene model was constructed according to their methylation levels and coefficients. Risk score = (ARHGDIB * 4.533910954) + (LINC00526 * 1.999499891) + (IDH2 * − 2.048441591) + (ARL14 * 0.779318158) + (GSTM2 * − 1.375204374) + (LURAP1 * − 1.504186188).

Fig. 4

Identification of prognostic genes in BC patients. a LASSO coefficients. b Plots of the cross-validation error rates. The dashes signify the value of the minimal error and greater λ value. c Risk score distribution in the two groups. d Survival overview in the two groups. e Heatmap of six genes in the two groups. f Survival curve of the two groups. g Time-dependent ROC curve for 3-year survival prediction

The risk score of each BC patient was computed, and the patients were assigned to the low-risk (n = 202) or high-risk (n = 202) group based on the median cut-off value (Additional file 4, Fig. 4c). Intuitively, the number of deaths was significantly higher in the high-risk group (Fig. 4d). The distribution of the six genes across all samples showed that the patients in the low-risk group were likely to have a higher degree of methylation of IDH2, GSTM2 and LURAP1. In contrast, the patients in the high-risk group were inclined to have higher methylation of ARHGDIB, LINC00526, and ARL14 (Fig. 4e). The Kaplan–Meier analysis of all patients (Fig. 4f) indicated that the survival of the patients in the low-risk group was significantly better than that of the patients in the high-risk group (P = 1.679e−05). The AUC of the survival assessment model of the six methylation-driven genes was 0.698 at 3 years of OS (Fig. 4g).

We further tested the survival assessment model by Kaplan–Meier analysis in subgroups. Of the 12 subgroups classified by clinical characteristics, there were no enough cases for stage I (n = 2), non-muscle-invasive (n = 4), and distant metastasis (n = 11), and all patients in the low grade group (n = 20) were alive, the test in the remaining 8 groups showed the same results as in all patients (Fig. 5a–h). Although the P-value in the stage III group was not statistically significant (Fig. 5h), these patients all showed the same predictive trends. Of the 9 subgroups (Additional file 4) classified by different mRNA subtypes or mutational signatures of BC [5]. The Kaplan–Meier curves (Additional file 5) show that this model is still effective in the Msig2, Msig3, luminal infiltrated, luminal papillary, and neuronal groups. Thus, the model has a certain reliability and practicability in evaluating prognosis.

Fig. 5

Kaplan–Meier survival curves. Validation of the six-gene model based on different clinical characteristics (ah)

Establishment and evaluation of the nomogram

We designed a nomogram to predict the survival probability of each patient. In the nomogram, each predictor was assigned a score. Based on the Cox analysis results (Table 1), six genes were integrated in the nomogram to predict the survival probability of BC patients (Additional file 6). The hypermethylation of ARHGDIB, LINC00526, and ARL14 is a risk factor for OS. Similarly, we carried out an analysis of the risk score and five clinical factors (Table 2; Fig. 6a). Based on the univariate Cox analysis, four factors (race, age, gender, and stage) and the risk scores were included in the multivariate Cox analysis (the factor ‘grade’ was not suitable for further analysis according to R). We constructed a nomogram to predict the OS probability (Fig. 6b). The C-index of this model was 0.694 (Fig. 6c). The predicted survival rate is close to the actual survival situation, and the prediction accuracy is similar to the ROC curve.

Table 1 Coefficients based on a Cox regression analysis of six genes
Table 2 Coefficients based on a Cox regression analysis of the risk score and clinical factors
Fig. 6

Six-gene model for survival prediction. a Multivariate Cox proportional hazard model of the risk score and clinical factors. b OS-associated nomogram. c Nomogram calibration plots. ***P < 0.001

Prognostic assessment of methylation-driven genes

The survival status evaluation of 6 genes was computed by the Survival package in R. IDH2 and ARL14 were identified as independent prognostic indicators (Fig. 7a, b), and the hypomethylation of IDH2 and hypermethylation of ARL14 were related to worse prognosis in BC patients. The methylation/methylation-site and gene expression matched evaluation was additionally carried out to discover the prognostic value. We found that a high expression and hypomethylation of ARHGDIB and ARL14 were meaningfully correlated with better prognosis (Fig. 7c, d). Among the six genes, 6 methylated sites in ARHGDIB (Fig. 8a–f), 1 methylated site in ARL14 (Fig. 8g), 3 methylated sites in GSTM2 (Fig. 8h–j), 2 methylated sites in LINC00526 (Fig. 8k, l) and 2 methylated sites in LURAP1 (Fig. 8m, n) were significantly associated with BC prognosis. The hypermethylation of 5 sites in LURAP1 and GSTM2 is associated with better prognosis; in contrast, the hypermethylation of another 9 sites in ARHGDIB, LINC00526 and ARL14 is associated with poor prognosis. This result is consistent with the results shown in Figs. 4e and 5a. The hypermethylation of IDH2, LURAP1, and GSTM2 may act as a protective factor in BC patients. Other three genes, i.e., ARHGDIB, LINC00526, and ARL14, may have the opposite effect. Additionally, several abnormally methylated sites were identified as linked to gene expression (Table 3; Additional file 7).

Fig. 7

Kaplan–Meier survival curves. Hyper/hypomethylation analysis of ARL14 and IDH2 (a, b). Methylation and gene expression matched analysis of ARL14 and ARHGDIB (c, d)

Fig. 8

Kaplan–Meier survival curves. Six methylation sites in ARHGDIB (af). One methylation site in ARL14 (g). Three methylation sites in GSTM2 (hj). Two methylation sites in LINC00526 (k, l). Two methylation sites in LURAP1 (m, n)

Table 3 Correlation between gene expression and methylated sites


Urothelial carcinoma is generally classified as non-muscle-invasive bladder cancer (NMIBC) or muscle-invasive bladder cancer (MIBC). The standard treatment for NMIBC is transurethral resection, and the universal treatment for MIBC is radical cystectomy, but a considerable number of NMIBC patients (50% to 80%) have tumour recurrence [1, 2]. Pathological staging is a key factor in current clinical decision making and prognosis of BC; nevertheless, the clinical outcomes of patients with the same stage often differ, indicating that the current staging system is not sufficient to reflect biological heterogeneity, and accurately determining the prognosis of patients is challenging. A new prognostic evaluation model based on molecular entities could guide individualized treatment and improve the therapeutic effect.

DNA methylation is an epigenetic modification that affects the interaction between DNA and regulatory factors, which, in turn, regulates gene expression [25]. Hypermethylation inhibits gene expression, while hypomethylation promotes gene expression. In addition, the DNA methylation status is faithfully inheritable through cell division but also revisable, it plays a very important role in the dynamic regulation of expression. Numerous studies based on either a genome-wide view or a gene-specific view have demonstrated that DNA methylation drives abnormal gene expression and is a crucial factor in the development and progression of tumours [26]. Therefore, the methylation profiles of methylation-driven genes in tumour patients could serve as potential biomarkers [27]. This phenomenon in BC patients is extensive, and many genes have been suggested to be factors involved in pathogenesis and are used as diagnostic and prognostic biomarkers [28, 29]. Our study provides a comprehensive view of methylation-driven genes in BC, and a prognosis model based on the methylation profile of six genes was developed and has implications for both basic research and clinical applications.

We identified a cohort of 167 methylation-driven genes in BC. The functional annotation demonstrated that these genes are widely scattered in diverse biological processes and pathways ranging from signal transduction, gene regulation, and development to metabolism and cell structure. These results demonstrate that DNA methylation is involved in the dysregulation of genes with distinct functions and suggest possible mechanisms by which DNA methylation is functionally linked to outcomes in BC patients.

Six genes (IDH2, GSTM2, LURAP1, ARHGDIB, LINC00526, and ARL14) with methylation profiles closely related to survival were selected by a LASSO Cox regression. Based on their methylation level and coefficients with survival, a prognostic model was developed. The verification of this model in the whole patient set and subsets grouped by either clinical or molecular characteristics showed that the low-risk group has a better survival status. The AUC of the ROC curve of the whole cohort based on this model was 0.698 at 3 years of OS.

For further potential application of this model in clinical work, a nomogram was generated. The nomogram integrates multiple predictors and simplifies the statistical prediction model to the probability of outcome events; thus, the survival probability of individual patients can be calculated. The predicted survival rate is close to the actual survival situation (C-index: 0.694), and the nomogram has a prediction effectiveness similar to that of the ROC curve. These results indicate the excellent predictive ability of this model in the prognosis of BC patients.

The six genes included in the model were further analysed individually. The hypomethylation of IDH2 and hypermethylation of ARL14 were associated with poor prognosis, and a high expression matched hypomethylation of ARHGDIB and ARL14 was meaningfully correlated with better prognosis. Further analysis of the methylation sites showed that the hypermethylation of 5 sites in LURAP1 and GSTM2 is associated with better prognosis, and the hypermethylation of another 9 sites in ARHGDIB, LINC00526 and ARL14 is associated with poor prognosis in BC. Additionally, the methylation levels at several methylation sites were correlated with the expression levels of the associated genes, all with negative correlations, indicating that these individual methylation sites alone contributed to expression regulation.

The methylation levels of these six genes contributed to the risk score of this model either positively or negatively. Some of this contribution could be functionally explained by previous studies, but the remainder lacks explanation, as information regarding the role of these genes in cancer is very limited.

The methylation levels of ARHGDIB, LINC00526 and ARL14 are positively related to poor survival. ARHGDIB (Rho GDP dissociation inhibitor GDI beta), which is also known as RhoGDI2, is a member of the guanine nucleotide dissociation inhibitor (GDI) family [30]. Mounting evidence suggests that the reduced expression of ARHGDIB is associated with the development of several types of cancer and that its hypermethylation contributes to its reduced expression [31]. The CpG islands of ARHGDIB were relatively hypermethylated in cases of ovarian cancer relapse after chemotherapy [32]. Huang et al. [33] demonstrated that ARHGDIB is significantly associated with OS in lung cancer patients. In BC, the reduced expression of ARHGDIB is associated with shorter disease-free survival time [34,35,36]. In our study, the methylation level matched gene expression analysis of ARHGDIB, and the analysis of CpG sites showed that hypomethylation in the ARHGDIB gene is associated with better survival. Our result is consistent with the results of previous studies. LINC00526 is a long intergenic non-protein-coding RNA, and one study has demonstrated that it suppresses glioma progression [37]. ARL14 (ADP Ribosylation Factor Like GTPase 14) is a protein-coding gene that participates in GTP binding and signal transduction [38]. However, information regarding the role of ARL14 in cancer is lacking.

The methylation level of IDH2, LURAP1 and GSTM2 is negatively related to poor survival. IDH2 is a protein-coding gene. The function of IDH2 in cancer has been relatively well documented. Li et al. [39, 40] found that IDH2 promotes lung cancer cell growth and serves as a novel therapeutic target in lung cancer. Mutations of IDH2 are frequently observed in acute myeloid leukaemia [41], colon cancer [42, 43], and gliomas [44], causing alterations in metabolism and DNA methylation; these mutations could represent a possible mechanism of tumorigenesis [44] and provide potential avenues for therapeutic intervention. We found that hypermethylation in IDH2 is associated with a better prognosis in BC patients. In our study, the relationship among GSTM2, LURAP1 and prognosis showed similar characteristics to IDH2. Hypermethylation at 3 sites in GSTM2 and 2 sites in LURAP1 is correlated with a better prognosis. GSTM2 is a subtype of glutathione S-transferase (GSTs) that performs functions such as eliminating free radicals and is involved in cell protection and the regulation of cell growth. Consistent with our findings, Kresovich et al. [45] found that a high methylation level in the GSTM2 promoter could be involved in ER/PR-negative breast cancer progression. Ashour et al. [46] proved that the epigenetic silencing of GSTM2 is a common phenomenon in prostate cancer that could be used as a molecular marker for diagnosis.

To the best of our knowledge, these six genes have not been previously studied as a prognostic model in BC patients. Further verification of this model in other types of clinical specimen, such as urine sediment cells and circulating tumour cells from BC patients, could provide more information regarding its potential clinical application. For urologists, accurate prognostic assessments are critical for selecting the optimal treatment. Our nomogram is a predictive model that combines gene information and clinical factors to provide a prognostic indication for clinicians.

However, this study also has certain limitations. First, this is a retrospective study, and the application of this model requires further verification by increasing the sample size and performing prospective studies. Second, the treatments that the patients have received are highly heterogeneous and incomplete, thus we could not include this information in our analysis. Improving these aspects for future in-depth studies could further increase the persuasiveness of these results.

In summary, we screened methylation-driven genes in BC, and a six-gene model was constructed based on methylation profiles. This model was validated in groups with different disease characteristics and could be expected to serve as a predictive tool for clinical outcomes and guide personalized anticancer treatment. In addition, we analysed the relationships between individual CpG islands associated with these six genes and survival, which may provide important bioinformatic clues for mechanistic research related to the development and progression of BC.


Based on public data from the TCGA database, we used MethylMix in R and a LASSO Cox analysis to screen methylation-driven genes associated with prognosis in BC patients. A prediction model based on methylation of six genes (IDH2, GSTM2, LURAP1, ARHGDIB, LINC00526, ARL14) was constructed. The verification in different subgroups demonstrated the validity and consistency of the model. A nomogram was further constructed to predict the likelihood of OS. The ROC curve, nomogram calibration plots and comprehensive survival analysis of each gene revealed that this model is an effective predictive model that can be used as a prognostic marker in BC patients. These results indicate that methylation detection may be an important means to help establish a new evaluation system for prognosis and act as a therapeutic target for antitumour drug development.

Availability of data and materials

All data generated or analysed in this study are included in this published article.



least absolute shrinkage and selection operator


bladder cancer


gene ontology


Kyoto Encyclopedia of Genes and Genomes


The Cancer Genome Atlas


overall survival


biological process


molecular function


concordance index


non-muscle-invasive bladder cancer


muscle-invasive bladder cancer


confidence interval


hazard ratio


  1. 1.

    Sanli O, Dobruch J, Knowles MA, Burger M, Alemozaffar M, Nielsen ME, Lotan Y. Bladder cancer. Nat Rev Dis Primers. 2017;3:17022.

    PubMed  Article  Google Scholar 

  2. 2.

    Siegel RL, Miller KD, Jemal A. Cancer statistics, 2016. CA Cancer J Clin. 2016;66(1):7–30.

    Google Scholar 

  3. 3.

    Pietzak EJ, Bagrodia A, Cha EK, Drill EN, Iyer G, Isharwal S, Ostrovnaya I, Baez P, Li Q, Berger MF, et al. Next-generation sequencing of nonmuscle invasive bladder cancer reveals potential biomarkers and rational therapeutic targets. Eur Urol. 2017;72(6):952–9.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  4. 4.

    Esteller M. Epigenetics in cancer. N Engl J Med. 2008;358(11):1148–59.

    CAS  PubMed  Article  Google Scholar 

  5. 5.

    Robertson AG, Kim J, Al-Ahmadie H, Bellmunt J, Guo G, Cherniack AD, Hinoue T, Laird PW, Hoadley KA, Akbani R, et al. Comprehensive molecular characterization of muscle-invasive bladder cancer. Cell. 2017;171(3):540–56.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  6. 6.

    Chung W, Bondaruk J, Jelinek J, Lotan Y, Liang S, Czerniak B, Issa JP. Detection of bladder cancer using novel DNA methylation biomarkers in urine sediments. Cancer Epidemiol Biomark Prev. 2011;20(7):1483–91.

    CAS  Article  Google Scholar 

  7. 7.

    Dudziec E, Goepel JR, Catto JW. Global epigenetic profiling in bladder cancer. Epigenomics. 2011;3(1):35–45.

    CAS  PubMed  Article  Google Scholar 

  8. 8.

    Dulaimi E, Uzzo RG, Greenberg RE, Al-Saleem T, Cairns P. Detection of bladder cancer in urine by a tumor suppressor gene hypermethylation panel. Clin Cancer Res. 2004;10(6):1887–93.

    CAS  PubMed  Article  Google Scholar 

  9. 9.

    Casadio V, Molinari C, Calistri D, Tebaldi M, Gunelli R, Serra L, Falcini F, Zingaretti C, Silvestrini R, Amadori D, et al. DNA Methylation profiles as predictors of recurrence in non muscle invasive bladder cancer: an MS-MLPA approach. J Exp Clin Cancer Res. 2013;32:94.

    PubMed  PubMed Central  Article  Google Scholar 

  10. 10.

    Shoshany O, Mano R, Margel D, Baniel J, Yossepowitch O. Presence of detrusor muscle in bladder tumor specimens—predictors and effect on outcome as a measure of resection quality. Urol Oncol. 2014;32(1):17–40.

    Article  Google Scholar 

  11. 11.

    Zhang Z, Li H, Jiang S, Li R, Li W, Chen H, Bo X. A survey and evaluation of web-based tools/databases for variant analysis of TCGA data. Brief Bioinform. 2018.

    PubMed  Article  Google Scholar 

  12. 12.

    Xiong J, Bing Z, Guo S. Observed survival interval: a supplement to TCGA pan-cancer clinical data resource. Cancers (Basel). 2019;11(3):280.

    Article  Google Scholar 

  13. 13.

    Gevaert O, Tibshirani R, Plevritis SK. Pancancer analysis of DNA methylation-driven genes using MethylMix. Genome Biol. 2015;16:17.

    PubMed  PubMed Central  Article  Google Scholar 

  14. 14.

    Gao C, Zhuang J, Li H, Liu C, Zhou C, Liu L, Sun C. Exploration of methylation-driven genes for monitoring and prognosis of patients with lung adenocarcinoma. Cancer Cell Int. 2018;18:194.

    PubMed  PubMed Central  Article  Google Scholar 

  15. 15.

    Lu T, Chen D, Wang Y, Sun X, Li S, Miao S, Wo Y, Dong Y, Leng X, Du W, et al. Identification of DNA methylation-driven genes in esophageal squamous cell carcinoma: a study based on The Cancer Genome Atlas. Cancer Cell Int. 2019;19:52.

    PubMed  PubMed Central  Article  Google Scholar 

  16. 16.

    GDC Data Portal. Accessed 4 Apr 2019.

  17. 17.

    MethylMix. Accessed 4 Apr 2019.

  18. 18.

    Metascape. Accessed 4 Apr 2019.

  19. 19.

    Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, Benner C, Chanda SK. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10(1):1523.

    PubMed  PubMed Central  Article  Google Scholar 

  20. 20.

    Tang Z, Liu Z, Li R, Yang X, Cui X, Wang S, Yu D, Li H, Dong E, Tian J. Identifying the white matter impairments among ART-naive HIV patients: a multivariate pattern analysis of DTI data. Eur Radiol. 2017;27(10):4153–62.

    PubMed  Article  Google Scholar 

  21. 21.

    Ren S, Huang S, Ye J, Qian X. Safe feature screening for generalized LASSO. IEEE Trans Pattern Anal Mach Intell. 2018;40(12):2992–3006.

    PubMed  Article  Google Scholar 

  22. 22.

    rms. Accessed 15 Apr 2019.

  23. 23.

    Hmisc. Accessed 15 Apr 2019.

  24. 24.

    R. version 3.5.2, Accessed 20 Mar 2019.

  25. 25.

    Casadevall D, Kilian AY, Bellmunt J. The prognostic role of epigenetic dysregulation in bladder cancer: a systematic review. Cancer Treat Rev. 2017;61:82–93.

    CAS  PubMed  Article  Google Scholar 

  26. 26.

    Koch A, Joosten SC, Feng Z, de Ruijter TC, Draht MX, Melotte V, Smits KM, Veeck J, Herman JG, Van Neste L, et al. Analysis of DNA methylation in cancer: location revisited. Nat Rev Clin Oncol. 2018;15(7):459–66.

    CAS  PubMed  Article  Google Scholar 

  27. 27.

    Andresen K, Boberg KM, Vedeld HM, Honne H, Jebsen P, Hektoen M, Wadsworth CA, Clausen OP, Lundin KE, Paulsen V, et al. Four DNA methylation biomarkers in biliary brush samples accurately identify the presence of cholangiocarcinoma. Hepatology. 2015;61(5):1651–9.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  28. 28.

    Kandimalla R, van Tilborg AA, Zwarthoff EC. DNA methylation-based biomarkers in bladder cancer. Nat Rev Urol. 2013;10(6):327–35.

    CAS  PubMed  Article  Google Scholar 

  29. 29.

    Goessl C, Muller M, Straub B, Miller K. DNA alterations in body fluids as molecular tumor markers for urological malignancies. Eur Urol. 2002;41(6):668–76.

    CAS  PubMed  Article  Google Scholar 

  30. 30.

    Mehta D, Rahman A, Malik AB. Protein kinase C-alpha signals rho-guanine nucleotide dissociation inhibitor phosphorylation and rho activation and regulates the endothelial cell barrier function. J Biol Chem. 2001;276(25):22614–20.

    CAS  PubMed  Article  Google Scholar 

  31. 31.

    Ridley AJ. Rho proteins and cancer. Breast Cancer Res Treat. 2004;84(1):13–9.

    CAS  PubMed  Article  Google Scholar 

  32. 32.

    Zeller C, Dai W, Steele NL, Siddiq A, Walley AJ, Wilhelm-Benartzi CS, Rizzo S, van der Zee A, Plumb JA, Brown R. Candidate DNA methylation drivers of acquired cisplatin resistance in ovarian cancer identified by methylome and expression profiling. Oncogene. 2012;31(42):4567–76.

    CAS  PubMed  Article  Google Scholar 

  33. 33.

    Huang T, Yang J, Cai YD. Novel candidate key drivers in the integrative network of genes, microRNAs, methylations, and copy number variations in squamous cell lung carcinoma. Biomed Res Int. 2015;2015:358125.

    PubMed  PubMed Central  Google Scholar 

  34. 34.

    Theodorescu D, Sapinoso LM, Conaway MR, Oxford G, Hampton GM, Frierson HJ. Reduced expression of metastasis suppressor RhoGDI2 is associated with decreased survival for patients with bladder cancer. Clin Cancer Res. 2004;10(11):3800–6.

    CAS  PubMed  Article  Google Scholar 

  35. 35.

    Ma L, Xu G, Sotnikova A, Szczepanowski M, Giefing M, Krause K, Krams M, Siebert R, Jin J, Klapper W. Loss of expression of LyGDI (ARHGDIB), a rho GDP-dissociation inhibitor, in Hodgkin lymphoma. Br J Haematol. 2007;139(2):217–23.

    CAS  PubMed  Article  Google Scholar 

  36. 36.

    Niu H, Li H, Xu C, He P. Expression profile of RhoGDI2 in lung cancers and role of RhoGDI2 in lung cancer metastasis. Oncol Rep. 2010;24(237):465.

    CAS  PubMed  Google Scholar 

  37. 37.

    Yan J, Xu C, Li Y, Tang B, Xie S, Hong T, Zeng E. Long non-coding RNA LINC00526 represses glioma progression via forming a double negative feedback loop with AXL. J Cell Mol Med. 2019;23(8):5518–31.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  38. 38.

    Paul P, van den Hoorn T, Jongsma ML, Bakker MJ, Hengeveld R, Janssen L, Cresswell P, Egan DA, van Ham M, Ten BA, et al. A genome-wide multidimensional RNAi screen reveals pathways controlling MHC class II antigen presentation. Cell. 2011;145(2):268–83.

    CAS  PubMed  Article  Google Scholar 

  39. 39.

    Li JJ, Li R, Wang W, Zhang B, Song X, Zhang C, Gao Y, Liao Q, He Y, You S, et al. IDH2 is a novel diagnostic and prognostic serum biomarker for non-small-cell lung cancer. Mol Oncol. 2018;12(5):602–10.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  40. 40.

    Li J, He Y, Tan Z, Lu J, Li L, Song X, Shi F, Xie L, You S, Luo X, et al. Wild-type IDH2 promotes the Warburg effect and tumor growth through HIF1alpha in lung cancer. Theranostics. 2018;8(15):4050–61.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  41. 41.

    Marcucci G, Maharry K, Wu YZ, Radmacher MD, Mrozek K, Margeson D, Holland KB, Whitman SP, Becker H, Schwind S, et al. IDH1 and IDH2 gene mutations identify novel molecular subsets within de novo cytogenetically normal acute myeloid leukemia: a cancer and leukemia group B study. J Clin Oncol. 2010;28(14):2348–55.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  42. 42.

    Li WL, Xiao MS, Zhang DF, Yu D, Yang RX, Li XY, Yao YG. Mutation and expression analysis of the IDH1, IDH2, DNMT3A, and MYD88 genes in colorectal cancer. Gene. 2014;546(2):263–70.

    CAS  PubMed  Article  Google Scholar 

  43. 43.

    Lv Q, Xing S, Li Z, Li J, Gong P, Xu X, Chang L, Jin X, Gao F, Li W, et al. Altered expression levels of IDH2 are involved in the development of colon cancer. Exp Ther Med. 2012;4(5):801–6.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  44. 44.

    Borodovsky A, Seltzer MJ, Riggins GJ. Altered cancer cell metabolism in gliomas with mutant IDH1 or IDH2. Curr Opin Oncol. 2012;24(1):83–9.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  45. 45.

    Kresovich JK, Gann PH, Erdal S, Chen HY, Argos M, Rauscher GH. Candidate gene DNA methylation associations with breast cancer characteristics and tumor progression. Epigenomics. 2018;10(4):367–78.

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  46. 46.

    Ashour N, Angulo JC, Andres G, Alelu R, Gonzalez-Corpas A, Toledo MV, Rodriguez-Barbero JM, Lopez JI, Sanchez-Chapado M, Ropero S. A DNA hypermethylation profile reveals new potential biomarkers for prostate cancer diagnosis and prognosis. Prostate. 2014;74(12):1171–82.

    CAS  PubMed  Article  Google Scholar 

Download references


Not applicable.


National Natural Science Foundation of China (81572772).

Author information




JY and ZC designed the study; LW and JS performed the data analysis; YH, SL, JZ and HD contributed the analysis tools; and LW wrote the paper. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Jin Yang or Zhiwen Chen.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Expression level of 167 driven genes.

Additional file 2: Table S2.

Methylation level of 167 driven genes.

Additional file 3: Table S3.

Metascape functional analysis results.

Additional file 4: Table S4.

Clinical signatures, mRNA types and mutational signatures of all patients.

Additional file 5: Figure S1.

Kaplan–Meier curves based on mRNA types and mutational signatures.

Additional file 6: Figure S2.

Nomogram of six genes.

Additional file 7: Figure S3.

Correlation between methylated sites and gene expression.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wang, L., Shi, J., Huang, Y. et al. A six-gene prognostic model predicts overall survival in bladder cancer patients. Cancer Cell Int 19, 229 (2019).

Download citation


  • Bladder cancer
  • Methylation
  • TCGA
  • LASSO Cox
  • Nomogram
  • Survival analysis