Establishment and validation of a prognostic signature for lung adenocarcinoma based on metabolism‐related genes
Cancer Cell International volume 21, Article number: 219 (2021)
Given that dysregulated metabolism has been recently identified as a hallmark of cancer biology, this study aims to establish and validate a prognostic signature of lung adenocarcinoma (LUAD) based on metabolism-related genes (MRGs).
The gene sequencing data of LUAD samples with clinical information and the metabolism-related gene set were obtained from The Cancer Genome Atlas (TCGA) and Molecular Signatures Database (MSigDB), respectively. The differentially expressed MRGs were identified by Wilcoxon rank sum test. Then, univariate cox regression analysis was performed to identify MRGs that related to overall survival (OS). A prognostic signature was developed by multivariate Cox regression analysis. Furthermore, the signature was validated in the GSE31210 dataset. In addition, a nomogram that combined the prognostic signature was created for predicting the 1-, 3- and 5-year OS of LUAD. The accuracy of the nomogram prediction was evaluated using a calibration plot. Finally, cox regression analysis was applied to identify the prognostic value and clinical relationship of the signature in LUAD.
A total of 116 differentially expressed MRGs were detected in the TCGA dataset. We found that 12 MRGs were most significantly associated with OS by using the univariate regression analysis in LUAD. Then, multivariate Cox regression analyses were applied to construct the prognostic signature, which consisted of six MRGs-aldolase A (ALDOA), catalase (CAT), ectonucleoside triphosphate diphosphohydrolase-2 (ENTPD2), glucosamine-phosphate N-acetyltransferase 1 (GNPNAT1), lactate dehydrogenase A (LDHA), and thymidylate synthetase (TYMS). The prognostic value of this signature was further successfully validated in the GSE31210 dataset. Furthermore, the calibration curve of the prognostic nomogram demonstrated good agreement between the predicted and observed survival rates for each of OS. Further analysis indicated that this signature could be an independent prognostic indicator after adjusting to other clinical factors. The high-risk group patients have higher levels of immune checkpoint molecules and are therefore more sensitive to immunotherapy. Finally, we confirmed six MRGs protein and mRNA expression in six lung cancer cell lines and firstly found that ENTPD2 might played an important role on LUAD cells colon formation and migration.
We established a prognostic signature based on MRGs for LUAD and validated the performance of the model, which may provide a promising tool for the diagnosis, individualized immuno-/chemotherapeutic strategies and prognosis in patients with LUAD.
Lung cancer is one of the most commonly diagnosed cancer types with high mortality worldwide in men and women . Lung adenocarcinoma (LUAD), which is considered a highly molecular heterogeneous disease, is a prevalent pathological subtype of lung cancer with an average 5-year survival rate of only 15 % [2,3,4]. Molecular targeted therapy for LUAD has been widely accepted in recent years, and the epidermal growth factor receptor (EGFR) gene, the anaplastic lymphoma kinase (ALK) gene, and the Kirsten rat sarcoma viral oncogene (KRAS) gene are important targets of LUAD [5,6,7]. Despite great clinical improvements in the molecular basis, diagnosis and treatment modalities of LUAD, the recurrence rate still remains high, and the survival rate remains poor [4, 8]. As LUAD has the tendency of early metastasis, and most of them are found at advanced stages, which may be the most important cause of high mortality in LUAD patients [9, 10]. There is an urgent need, therefore, to develop more reliable and more effective biomarkers for the early detection, diagnosis, prognosis and monitoring of LUAD.
Dysregulated metabolism has been recently identified as a well-recognized hallmark of cancer biology, meeting the requirements of rapid proliferation and preferential survival of cancer cells [11, 12]. In the 1920 s, Otto Warburg first discovered that cancer cells vigorously take up glucose and convert pyruvate into lactate despite the presence of sufficient oxygen, a phenomenon now widely termed aerobic glycolysis, or the Warburg effect [13, 14]. This phenomenon not only provide a niche for the survival and proliferation of tumor cells, but also has a profound effect on the tumor microenvironment . In addition, it has recently been reported that high concentrations of lactate in the tumor microenvironment are associated with distant metastasis and poor prognosis in a multitude of cancers, including LUAD [16, 17]. There is general agreement that the metabolic processes play an important role in the pathogenesis and progression of lung cancer. However, few studies have comprehensively analyzed the relationship between metabolism-related genes (MRGs) and the diagnosis, risk stratification, prognosis, and survival of LUAD by high-throughput biomarker sequencing.
In the present study, we constructed a prognostic signature based on MRGs from The Cancer Genome Atlas (TCGA) database, which was further validated in the GSE31210 dataset to explore an efficient metabolic biomaker for the more accurate stratification management of LUAD. In addition, a nomogram that combined six MRGs was created for predicting the 1-, 3- and 5-year OS of LUAD. The accuracy of the nomogram prediction was evaluated using a calibration plot. Cox regression analysis was applied to identify the prognostic value and clinical relationship of the signature in LUAD. Moreover, the high-risk group patients have higher expression of immune checkpoint molecules and are more sensitive to immunotherapy. Finally, we confirmed six MRGs protein and mRNA expression in six lung cancer cell lines and firstly found that ENTPD2 might played an important role on LUAD cells colon formation and migration.
Materials and methods
The transcriptomic and the corresponding clinical data of patients with LUAD were downloaded from TCGA (https://portal.gdc.cancer.gov/) database and the Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/geo/) database. The RNA-seq data, including 497 LUAD and 54 adjacent non-tumor cases from TCGA database and 174 LUAD cases from GSE31210 dataset were examined. The MRGs were identified from the metabolic pathway-related gene sets of “c2.cp.kegg.v7.0.symbols” in gene set enrichment analysis (GSEA). MRGs can be further analyzed only when they are included in the above data sets.
Differentially expressed MRGs and enrichment analysis
The differentially expressed MRGs in LUAD and normal lung tissues were detected using the R package limma and the Wilcoxon test method . |logFC|>1 and adjusted P < 0.05 were considered as significant. To explore the characteristic biological function and potential pathways of these MRGs, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis were were carried out with R package clusterProfiler . Functional categories with a false discovery rate (FDR) smaller than 0.05 were presented.
Construction of the metabolism‑related signature for LUAD
To avoid the interference of irrelevant factors, patients with follow-up time between 0 day and 2000 days were included. The 426 LUAD samples with survival information in the TCGA dataset were taken as the training set for constructing the prognosis risk model, and the 174 LUAD samples with survival information in the GSE31210 dataset were explored for external validation. Firstly, univariate Cox analysis was used to screen out MRGs associated with the OS of patients with LUAD, and only MRGs with a P value < 0.001 were selected for subsequent analyses. To avoid the prognostic signature overfitting and narrow the genes for prediction of the OS, Lasso Cox regression was carried out using R “glmnet” package. MRGs detected via Lasso algorithm were evaluated by step wise multivariate cox regression analysis. By weighting the estimated cox regression coefficients, the model of tumor-related metabolism genes risk was constructed . The prognostic metabolism-related gene signatures were shown as risk score = Ʃ (βi × Expi), where βi, the coefficients, represented the weight of the respective signature and Expi represented the expression value. Based on the risk score formula, patients were assigned into low-risk group and high-risk group with the median risk score as the cut-off point. The Kaplan-Meier (K-M) survival curve was used the log-rank test to evaluate the differences in survival rate between the two groups. Furthermore, the receiver operating characteristic (ROC) curve was implemented by R “survival ROC” package  and the corresponding area under the ROC curve (AUC) was measured to assess the sensitivity and specificity of the metabolism-related signature.
Validation of the metabolism‐related signature for LUAD
To verify the prognostic value of metabolism-related signature, we used the GSE31210 dataset as the validation cohort. The same formula was used to calculate the risk scores for each patient. Survival and ROC curve analyses were implemented as described above. Finally, according to the results of multivariate cox regression analysis, a nomogram for predicting the likelihood of 1-year, 3-year and 5-year OS was constructed by R “rms” package. The calibration plots were used to evaluate the prognostic accuracy of the nomogram.
Analysis of these crucial MRGs expression level
Differential expression of these hub MRGs at the transcription level were examined by matching cancer and adjacent normal lung tissues from the TCGA database. For further validation of our analysis, The Human Protein Atlas (HPA) online database (http://www.proteinatlas.org/) was applied to identify the expression of these MRGs at the translational level .
Association of the prognostic signature and clinicopathological features
In addition, univariate and multivariate analyses were used to estimate the effect of risk score on OS and the clinicopathologic features (age, gender, clinical stage and pathological grading). We also explored the correlation between the expression of these MRGs and several clinical features. Time-dependent ROC curve was performed to compare the accuracy of the prediction between the clinicopathologic features and risk score.
Assessing the immuno-/chemotherapeutic response of the risk subtypes for LUAD patients
Immune checkpoint therapy has made important clinical advances and offered a new weapon against cancer, which would ideally be matched to those patients most likely to benefit [23, 24]. Then, we investigated the expression of crucial immunomodulators between low-risk group and high-risk group. Moreover, according to the Genomics of Drug Sensitivity in Cancer (GDSC, https://www.cancerrxgene.org), the chemotherapeutic response for common chemotherapy drugs of each LUAD patients was calculated by by R “pRRophetic” package.
Colon assay and Western blot analysis
Quantitative real-time PCR
Total RNA was extracted with the TRIzol Reagent (Invitrogen Carlsbad, CA, USA), and the concentration was measured using an ultraviolet (UV) spectrophotometer (UV-1201; Shimadzu Corporation, Kyoto, Japan). Reverse transcription (RT) was performed as described previously . Real-time PCR was conducted using the SYBR-Green PCR kit (Takara, Osaka, Japan) in a Rotor-Gene 3000 machine (Corbett Life Science, Sydney, Australia). The quantitative analysis of the transcription of CAT, LDHA and ENTPD2 was described previously . Each reaction was performed in a 25µL volume containing 2µL of cDNA, 0.5µL of 10µM per each primer, and 12.5 µL of the 2× SYBR-Green mixture. CAT: For: 5′-AGA TGC GGC GAG ACT TTC-3′, Rev: 5′-CAA CTG GGA TGA GAG GGT AG-3′. LDHA: For: 5′-CTG TAT GGA GTG GAA TGA ATG-3′, Rev: 5′ -GAT GTG TAG CCT TTG AGT TTG-3′. ENTPD2: For: 5′-GAC GCT GGT TCT TCA CAC − 3′, Rev: 5′ -CTC TTT GGG CAC ATC CTG-3′.
All statistical analyses were performed by version 3.6.1 of R software (https://www.r-project.org/) and version 5.28.1 of Perl software (http://www.perl.org). The Wilcoxon test was used to compare two paired groups. The Kaplan-Meier survival curves were compared with the log-rank test. If not otherwise stated, data were considered to be statistically significant with P value < 0.05.
Identification of differentially expressed MRGs in LUAD
According to the KEGG metabolic pathway-related gene sets, a total of 944 MRGs were obtained from the gene sets of “c2.cp.kegg.v7.0.symbols”. We matched these genes with the sequence data of LUAD related mRNA in the TCGA database and GSE31210 dataset, and only common mRNAs were used. Considering the cutoff criteria (adjusted P value < 0.05 and |log FC| > 1.0), 116 differentially expressed MRGs, which consist of 31 downregulated and 85 upregulated MRGs (Fig. 1), were selected for subsequent analysis.
Functional enrichment of the differentially expressed MRGs
To investigate the potential functional implication of these MRGs, 116 differentially expressed MRGs were further analyzed by GO functional enrichment analysis and KEGG pathway enrichment analysis. A total of 431 GO terms and 42 pathways were identified (adjusted P < 0.05). The top 30 enrichment GO analysis and top 30 enrichment KEGG analysis were displayed in Fig. 2. The top enriched GO terms in biological processes were carboxylic acid biosynthetic process and organic acid biosynthetic process, and those in cellular components were mitochondrial matrix, ficolin-1-rich granule lumen, and ficolin-1-rich granule, in terms of molecular function, genes were mostly enriched in terms of co-factor binding. In the KEGG pathway enrichment analysis, these genes were shown to be significantly associated with signaling pathway related to material synthesis and material metabolism, such as “biosynthesis of amino acids”, “arginine and proline metabolism”, “glycolysis/gluconeogenesis”, “carbon metabolism” and et al.
Establishment of metabolism‐related prognostic signature for LUAD
To identify MRGs associated with OS, a univariate cox proportional hazard regression analysis was initially performed on 116 differentially expressed MRGs in the TCGA database. The result showed that 12 MRGs were significantly associated with the OS (Fig. 3a; P < 0.001). Of the survival-related MRGs, 10 genes (ALDOA, TPI1, PKM, LDHA, GPI, PFKP, RRM2, TYMS, GNPNAT1, and ENTPD2) were considered risk factors (all P < 0.001; HRs, 1.0026–1.1103) and that their overexpression might reduce survival. While, overexpression of the remaining two genes (CAT and FBP1) (all P < 0.001; HRs,0.9747 and 0.9907, respectively) might improve the survival of patients. The Lasso regression analysis was then used to remove MRGs that may be highly related to other MRGs (Fig. 3b–c). Furthermore, a prognostic signature model was established based on multivariate Cox regression analysis. Finally, six MRGs were confirmed and applied to establish a metabolism-related signature (Fig. 3d). A prognostic model was constructed to evaluate the prognosis of each patient as follows: Risk score = (0.001709×expression value of ALDOA) + (-0.01187×expression value of CAT) + (0.082279×expression value of ENTPD2) + (0.030344×expression value of GNPNAT1) +(0.003499×expression value of LDHA) + (0.018476×expression value of TYMS).
Then, the risk score of each patient was calculated according to this prognostic model. Based on the median risk score, 426 LUAD patients were classified into a high risk group (n = 213) and low risk group (n = 213). The risk score, survival status and gene expression heatmap of these prognostic MRGs are presented in Fig. 4a–c. Kaplan-meier log-rank test indicated that patients in the high risk group showed markedly poorer OS than those in the low risk group (Fig. 4d). Areas under the curve value of the signature predicting the 1-, 3- and 5-year OS rates were 0.73, 0.703 and 0.854, indicating that this prognostic model exhibited a good sensitivity and specificity (Fig. 4e).
Validation of the metabolism‐related prognostic signature for LUAD
The GSE31210 dataset including 174 LUAD samples were used for the validation of the metabolism-related signature. According to the median risk score, we divided patients into high risk (n = 78) and low risk groups (n = 96). Consistent with the results derived from the TCGA database, the Kaplan-Meier curve demonstrated that patients in the high risk group exhibited markedly poorer OS than those in the low risk group (Fig. 5d). The risk score, survival status and gene expression heatmap of these prognostic MRGs were shown in Fig. 5a-c. The AUCs for 1-, 3- and 5-year OS rates were 0.654, 0.705 and 0.725 (Fig. 5e). A nomogram for predicting 1-, 3- and 5-year OS of patients with LUAD was constructed with the six prognostic genes that had most significant values in multivariate analysis (Fig. 6a). In addition, the calibration curve of the prognostic nomogram demonstrated good agreement between the predicted and observed survival rates for each of OS (Fig. 6b-d).
Analysis of these crucial MRGs expression level
To explore the six hub genes at the transcription level, the mRNA expression levels were analyzed using the TCGA database. The results demonstrated that the expression of ALDOA, ENTPD2, GNPNAT1, LDHA, and TYMS in LUAD tissues were all increased than those of adjacent normal lung tissues, while the expression of CAT was decreased than those of adjacent tissues (Fig. 7). Subsequently, we also investigated the expression of hub genes between low and high risk group. The results showed that high risk group had higher expression of five hub MRGs (ALDOA, ENTPD2, GNPNAT1, LDHA, and TYMS) than low risk group, while the expression of CAT was decreased than high risk group (Additional file 1: Figure S1, P < 0.001). To assess the six hub genes at the translational level, the protein expression levels were analyzed using the HPA database. The results showed that the protein level of ALDOA, ENTPD2, GNPNAT1, LDHA, and TYMS were increased in LUAD tissues than in normal tissues, matched their mRNA expression levels (Fig. 8). There is no difference between LUAD tissues and normal tissues for the protein level of CAT (Fig. 8b). Finally, to further investigate functional implication of six MRGs, GO and KEGG analysis were conducted in R software. GO analysis showed that six MRGs were mainly enriched in metabolic process, including purine nucleoside diphosphate, puriner ibonucleoside diphosphate, ribonucleoside diphosphate and nucleoside diphosphate metabolic process (Additional file 2: Figure S2a). KEGG analysis showed that six MRGs were significantly enriched glycolysis/gluconeogenesis, HIF-1 signaling pathway and carbon metabolism (Additional file 2: Figure S2b).
Clinical value of prognostic signature
Univariate and multivariate Cox regression analysis was conducted to evaluate the independent prediction ability of metabolism-related prognostic signature between the signature and other common prognostic factors, including age, gender, histological grade, pathological stage and TNM stage. Although univariate cox analysis indicated that pathologic stage, T stage, N stage and our model were markedly associated with OS (Fig. 9a; p < 0.001), after the multivariate analysis, only metabolism-related prognostic signature (p < 0.001) and pathological stage (p < 0.007) could be used as an independent prognostic factor (Fig. 9b). To further evaluate the clinical value of MRGs, the relationship between MRGs prognostic indicators and clinical features were investigated, and the results indicated that ALDOA, ENTPD2, GNPNAT1, LDHA, and CAT were differentially expressed in patients with various clinical features (Fig. 10). To validate the clinical value of the metabolism-related prognostic signature, the association between the risk score and clinical characteristics were subsequently assessed, and the results demonstrated that high risk scores were positively associated with survival status, gender, N stage, and pathologic stage in patients with LUAD (Fig. 10). To investigate more additional prognostic value of the prognostic signature, time-dependent ROC curve was performed. The AUC value of the risk score predicting the 0.5-, 1-, 2-, 3- and 5-year OS rates were 0.745, 0.779, 0.726, 0.716, 0.760 and 0.784, respectively, which was much higher than clinicopathologic features(Additional file 3: Figure S3). The results revealed that prognostic accuracy of the signature was superior to clinical risk factors.
Assessment of the immuno-/chemotherapeutic response in the risk subtypes for LUAD patients
Immune checkpoint therapy, which improved antitumor immune responses by regulating the activity of T cells, has been emerged as a new weapon against cancer. Thus, we analyzed the expression of immune checkpoint molecules (PD-L1, PD-L2, CTLA-4 and PD-1) between low-risk group and high-risk group in LUAD samples. As shown in Fig. 11a, high-risk patients with LUAD had significantly higher expression of immune checkpoint molecules than low-risk patients. Chemotherapy is an effective treatment for patients with advanced LUAD. The IC50 values of the low-risk and high-risk groups were calculated based on the GDSC data, as shown in Fig. 11b, the results indicated that no chemotherapeutic drugs with significant response sensitivity were found in the high-risk group. The above results demonstrateted that the poor prognosis of high-risk patients might be related to the immunosuppressive microenvironment and chemotherapy resistance. That is to say, the high-risk group patients have higher expression of immune checkpoint molecules and are more sensitive to immunotherapy.
The protein expression levels of ALDOA, ENTPD2, LDHA, TYMS and CAT were investigated in 5 lung cancer cell lines (A549, H460, H1299, H1975, PC9), normal airway epithelial cells (16HBE) as control. The same with the results we have developed from bioinformatics, ALDOA, ENTPD2, LDHA, TYMS were significantly increased in 5 lung cancer cell line, comparing with in 16HBE. CAT was significantly decreased in 5 lung cancer cell line, comparing with in 16HBE. The gray value of protein bands was quantified (Fig. 12).
The mRNA expression levels of CAT, ENTPD2 and LDHA were also investigated in 6 lung cancer cell lines (A549, H460, H1299, H1975, PC9, Lewis), 16HBE as control. The results were also same with protein data, ENTPD2 (Fig. 13a) and LDHA (Fig. 13b) were significantly increased, while CAT (Fig. 13c) was significantly decreased, comparing with in 16HBE.
Since ENTPD2 may be a good prognostic marker and therapeutic target for cancer patients, especially for those receiving immune therapy . We used ENTPD2 inhibitor POM-1 in 5 lung cancer cell lines, found that it could inhibit the formation of colonies in A549 and PC9, decreased colony-forming was also observed in H1975, which were all lung adenocarcinoma cell lines (Fig. 14b). The protein expression of ENTPD2 in 4 cell lines was confirmed by western blot after adding POM-1 (Fig. 14a). Most importantly, we found POM-1 could inhibit the migration of 5 lung cancer cell lines (Fig. 14c).
LUAD, which is highly heterogeneous in morphological characteristics and remarkably variable in prognosis, is the most prevalent subtype of non-small cell lung cancer (NSCLC) [4, 29]. More and more attention has been recently paid to the key role of gene signatures based on specific correlation in predicting the prognosis of LUAD because of the rapid advances in high-throughput technologies and bioinformatics methodology [30,31,32,33]. Moreover, the identification of novel gene signatures that predict the prognosis of patients is beneficial for the choice of treatment regimens and the improvement of survival rate [34, 35].
In recent years, interesting in dysregulated metabolism of cancer has been growing . Accumulating evidence showed that MRGs played a key role in cancer development and progression . Therefore, the identification of novel MRGs has lately become a hotspot in cancer research, both as a biomarker and potential therapeutic target. In this study, a total of 116 differentially expressed MRGs, which consist of 31 downregulated and 85 upregulated MRGs, were detected in the TCGA dataset. We found that 12 MRGs were most significantly associated with OS by using the univariate regression analysis in LUAD. After conducting the LASSO regression and multivariable Cox regression analyses, a novel prognostic signature which consisted of six MRGs (ALDOA, CAT, ENTPD2, GNPNAT1, LDHA, and TYMS) was established. Based on the gene signature, LUAD patients were classified into a high risk group and low risk group. Patients in the high risk group, which had a survival rate lower than 15 %, showed markedly poorer OS than the low risk group. The time-dependent ROC analysis demonstrated that the AUC for 1, 3, and 5 years were 0.73, 0.703, and 0.854, respectively, indicating that this prognostic signature have good sensitivity and specificity. The prognostic value of this signature was further successfully validated in the GSE31210 dataset. Moreover, the calibration curve of the prognostic nomogram demonstrated good agreement between the predicted and observed survival rates for each OS. Further analysis indicated that this signature could be an independent prognostic indicator after adjusting to other clinical factors. Moreover, the high-risk group patients have higher expression of immune checkpoint molecules and are more sensitive to immunotherapy. Finally, the signature was found to be associated with various clinicopathological features.
Furthermore, six genes (ALDOA, CAT, ENTPD2, GNPNAT1, LDHA, and TYMS) in this prognostic signature were selected as crucial MRGs. ALDOA is an important enzyme involved in the glycolysis pathway that is highly expressed in a wide range of cancers . Some studies also proved that the overexpression of ALDOA might contribute to tumorigenesis and the progression of cancers through modulation of HIF-1α signaling [39, 40]. Our results showed that ALDOA might be a tumor-promoting gene in LUAD. Abnormal expression or decreased activity of CAT can lead to an increase in intracellular ROS concentration, which directly or indirectly induces tumorigenesis [41, 42]. Consistently, our study found that compared with normal lung tissues, the mRNA level of CAT in LUAD tissues was down-regulated. GNPNAT1, a member of the GNAT protein superfamily, is a key enzyme in the metabolic pathway of N-acetylglucosamine synthesis . Zhao et al. reported that the overexpression of GNPNAT1 could promote the infiltration and adhesion of lung cancer cells . In line with their findings, we found GNPNAT1 was also increased in LUAD. LDHA catalyzes the conversion of pyruvate to lactate with concomitant oxidation of NADH to NAD+, which plays an essential role in metabolic pathways of the cancer cells . Recently, accumulating evidence showed that the overexpression of LDHA could promote the production of lactate, thus contributing to the acidification of the tumor microenvironment, which may limit the effect of anti-PD-L1 therapy [46, 47]. We also found increased LDHA in LUAD might associated with a poor prognosis. TYMS, a rate-limiting enzyme during the DNA synthesis, plays an important role in catalyzing the methylation of deoxyuridine monophosphate to deoxythymidine monophosphate . High levels of TYMS expression are related to worse responses to 5-FU, shorter survival times and other adverse clinical behaviors in a variety of solid tumors [49, 50]. Since only patients with low expression of TYMS can respond to 5-FU, individualized chemotherapy regimens can be selected according to the expression of TYMS and tumor classification . We also found TYMS as a part of the nomogram could predict LUAD patient prognosis.
In this study, six MRGs prognostic indicators were identified for the first time to be possibly associated with the survival outcome of LUAD. We also confirm the protein and mRNA expression in lung cancer cell lines by some experimental validation. To obtain a deep understanding of the selected genes, the functional annotation analyses of ENTPD2 were performed. ENTPD2 belongs to enzymes nucleoside triphosphate diphosphohydrolase family (NTPDase). NTPDase1(CD39) was played a key role in turning an ATP-mediated immune-stimulating into an adenosine-mediated immunosuppressant tumor microenvironment (TME) involving the coordinated control of inflammatory responses and tumor-associated antigen-specifific T cell immunity . While over-expression of ENTPD2 was a poor prognostic indicator for HCC, ENTPD2 inhibition was able to mitigate cancer growth and enhance the efficiency and efficacy of immune checkpoint inhibitors . In this study, we confirmed ENTPD2 played an important role on cell colon formation and migration in LUAD for the first time.
However, we should acknowledge that there are some limitations in the present study which should be addressed in future studies. First, the potential selection bias could not be ruled out because of the transcriptomic and the corresponding clinical data of patients with LUAD were obtained from public database. Second, the robustness of the prognostic signature must be validated in large prospective studies.
In summary, we identified a novel signature based on MRGs that could be applied to analyze the prognostic of patients with LUAD, and verified by the data from the GEO databases and experimental validation. Meanwhile, we firstly developed the function of ENTPD2 on cells colon formation and migration in 5 lung cancer cell lines. This signature may provide valuable information either for diagnosis or developing novel therapeutic options for LUAD patients in the future.
Availability of data and materials
All data generated or analyzed during the present study was downloaded from TCGA database, GEO database, HPA database and GDSC database.
The Cancer Genome Atlas
Gene Expression Omnibus
Kyoto Encyclopedia of Genes and Genomes
False discovery rate
Receiver operating characteristic
Tumour size/lymph nodes/distance metastasis, a tumour staging system used in oncology and constructed by the American Joint Committee on Cancer and the Union for International Cancer Control
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. Cancer J Clin. 2019;69(1):7–34.
Travis WD. Reporting lung cancer pathology specimens. Impact of the anticipated 7th Edition TNM Classification based on recommendations of the IASLC Staging Committee. Histopathology. 2009;54(1):3–11.
Ferlay J, Colombet M, Soerjomataram I, Mathers C, Parkin DM, Pineros M, Znaor A, Bray F. Estimating the global cancer incidence and mortality in 2018: GLOBOCAN sources and methods. Int J Cancer. 2019;144(8):1941–53.
Cancer Genome Atlas Research N. Author Correction: Comprehensive molecular profiling of lung adenocarcinoma. Nature. 2018;559(7715):E12–2.
Paez JG, Janne PA, Lee JC, Tracy S, Greulich H, Gabriel S, Herman P, Kaye FJ, Lindeman N, Boggon TJ, et al. EGFR mutations in lung cancer: Correlation with clinical response to gefitinib therapy. Science. 2004;304(5676):1497–500.
Solomon BJ, Mok T, Kim D-W, Wu Y-L, Nakagawa K, Mekhail T, Felip E, Cappuzzo F, Paolini J, Usari T, et al. First-Line Crizotinib versus Chemotherapy in ALK-Positive Lung Cancer. N Engl J Med. 2014;371(23):2167–77.
Kim YT, Kim T-y, Lee DS, Park SJ, Park J-Y, Seo S-J, Choi H-S, Kang HJ, Hahn S, Kang CH, et al. Molecular changes of epidermal growth factor receptor (EGFR) and KRAS and their impact on the clinical outcomes in surgically resected adenocarcinoma of the lung. Lung Cancer. 2008;59(1):111–8.
Field JK, Oudkerk M, Pedersen JH, Duffy SW. Prospects for population screening and diagnosis of lung cancer. Lancet. 2013;382(9893):732–41.
Collins LG, Haines C, Perkel R, Enck RE. Lung cancer: diagnosis and management. Am Family Phys. 2007;75(1):56–63.
Riihimäki M, Hemminki A, Fallah M, Thomsen H, Sundquist K, Sundquist J, Hemminki K. Metastatic sites and survival in lung cancer. Lung Cancer. 2014;86(1):78–84.
Hanahan D, Weinberg RA. Hallmarks of Cancer: The Next Generation. Cell. 2011;144(5):646–74.
Hirschey MD, DeBerardinis RJ, Diehl AME, Drew JE, Frezza C, Green MF, Jones LW, Ko YH, Le A, Lea MA, et al. Dysregulated metabolism contributes to oncogenesis. Semin Cancer Biol. 2015;35:129–50.
Warburg O. On the origin of cancer cells. Science. 1956;123(3191):309–14.
Upadhyay M, Samal J, Kandpal M, Singh OV, Vivekanandan P. The Warburg effect: insights from the past decade. Pharmacol Ther. 2013;137(3):318–30.
Ruocco MR, Avagliano A, Granato G, Vigliar E, Masone S, Montagnani S, Arcucci A. Metabolic flexibility in melanoma: A potential therapeutic target. Semin Cancer Biol. 2019;59:187–207.
Gallo M, Sapio L, Spina A, Naviglio D, Calogero A, Naviglio S. Lactic dehydrogenase and cancer: an overview. Front Biosci (Landmark edition). 2015;20:1234–49.
Gottfried E, Kreutz M, Mackensen A. Tumor metabolism as modulator of immune response and tumor progression. Semin Cancer Biol. 2012;22(4):335–41.
Noble WS. How does multiple testing correction work? Nature Biotechnol. 2009;27(12):1135–7.
Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters. Omics J Integrative Biol. 2012;16(5):284–7.
Zhang Z, Reinikainen J, Adeleke KA, Pieterse ME, Groothuis-Oudshoorn CGM. Time-varying covariates and coefficients in Cox regression models. Annals Transl Med 2018, 6(7).
Heagerty PJ, Lumley T, Pepe MS. Time-dependent ROC curves for censored survival data and a diagnostic marker. Biometrics. 2000;56(2):337–44.
Thul PJ, Akesson L, Wiking M, Mahdessian D, Geladaki A, Blal HA, Alm T, Asplund A, Bjork L, Breckels LM, et al: A subcellular map of the human proteome. Science 2017, 356(6340).
Postow MA, Callahan MK, Wolchok JD. Immune Checkpoint Blockade in Cancer Therapy. J Clin Oncol. 2015;33(17):1974–82.
Sharma P, Allison JP. The future of immune checkpoint therapy. Science. 2015;348(6230):56–61.
Yang W, Soares J, Greninger P, Edelman EJ, Lightfoot H, Forbes S, Bindal N, Beare D, Smith JA, Thompson IR, et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 2013;41(Database issue):D955–61.
Zhong G, Qin S, Townsend D, Schulte BA, Tew KD, Wang GY. Oxidative stress induces senescence in breast cancer stem cells. Biochem Biophys Res Commun. 2019;514(4):1204–9.
Yang A, Qin S, Schulte BA, Ethier SP, Tew KD, Wang GY. MYC Inhibition Depletes Cancer Stem-like Cells in Triple-Negative Breast Cancer. Cancer Res. 2017;77(23):6641–50.
Chiu DK, Tse AP, Xu IM, Di Cui J, Lai RK, Li LL, Koh HY, Tsang FH, Wei LL, Wong CM, et al. Hypoxia inducible factor HIF-1 promotes myeloid-derived suppressor cells accumulation through ENTPD2/CD39L1 in hepatocellular carcinoma. Nature communications. 2017;8(1):517.
Matsuda T, Machii R: Morphological distribution of lung cancer from Cancer Incidence in Five Continents Vol X. Japanese J Clin Oncol 2015, 45(4):404.
Wang X, Yao S, Xiao Z, Gong J, Liu Z, Han B, Zhang Z. Development and validation of a survival model for lung adenocarcinoma based on autophagy-associated genes. J Transl Med. 2020;18(1):149.
Li W, Gao LN, Song PP, You CG. Development and validation of a RNA binding protein-associated prognostic model for lung adenocarcinoma. Aging. 2020;12(4):3558–73.
Xue L, Bi G, Zhan C, Zhang Y, Yuan Y, Fan H. Development and Validation of a 12-Gene Immune Relevant Prognostic Signature for Lung Adenocarcinoma Through Machine Learning Strategies. Front Oncol. 2020;10:835.
Li W, Li N, Gao L, You C. Integrated analysis of the roles and prognostic value of RNA binding proteins in lung adenocarcinoma. PeerJ. 2020;8:e8509.
Wang Z, Wang Z, Niu X, Liu J, Wang Z, Chen L, Qin B. Identification of seven-gene signature for prediction of lung squamous cell carcinoma. OncoTargets Ther. 2019;12:5979–88.
Ma X, Ren H, Peng R, Li Y, Ming L. Identification of key genes associated with progression and prognosis for lung squamous cell carcinoma. PeerJ. 2020;8:e9086.
Porta C, Sica A, Riboldi E. Tumor-associated myeloid cells: new understandings on their metabolic regulation and their influence in cancer immunotherapy. FEBS J. 2018;285(4):717–33.
Pavlova NN, Thompson CB. The Emerging Hallmarks of Cancer Metabolism. Cell Metabol. 2016;23(1):27–47.
Asaka M, Kimura T, Meguro T, Kato M, Kudo M, Miyazaki T, Alpert E. Alteration of aldolase isozymes in serum and tissues of patients with cancer and other diseases. J Clin Lab Anal. 1994;8(3):144–8.
Ji S, Zhang B, Liu J, Qin Y, Liang C, Shi S, Jin K, Liang D, Xu W, Xu H, et al. ALDOA functions as an oncogene in the highly metastatic pancreatic cancer. Cancer Lett. 2016;374(1):127–35.
Saito Y, Takasawa A, Takasawa K, Aoyama T, Akimoto T, Ota M, Magara K, Murata M, Hirohashi Y, Hasegawa T, et al: Aldolase A promotes epithelial-mesenchymal transition to increase malignant potentials of cervical adenocarcinoma. Cancer Sci 2020.
Hasegawa Y, Takano T, Miyauchi A, Matsuzuka F, Yoshida H, Kuma K, Amino N. Decreased expression of catalase mRNA in thyroid anaplastic carcinoma. Jpn J Clin Oncol. 2003;33(1):6–9.
Glorieux C, Dejeans N, Sid B, Beck R, Calderon PB, Verrax J. Catalase overexpression in mammary cancer cells leads to a less aggressive phenotype and an altered response to chemotherapy. Biochem Pharmacol. 2011;82(10):1384–90.
Peneff C, Mengin-Lecreulx D, Bourne Y. The crystal structures of Apo and complexed Saccharomyces cerevisiae GNA1 shed light on the catalytic mechanism of an amino-sugar N-acetyltransferase. J Biol Chem. 2001;276(19):16328–34.
Zhao M, Li H, Ma Y, Gong H, Yang S, Fang Q, Hu Z: Nanoparticle abraxane possesses impaired proliferation in A549 cells due to the underexpression of glucosamine 6-phosphate N-acetyltransferase 1 (GNPNAT1/GNA1). Int J Nanomed 2017, 12:1685–1697.
Johnson KP, Hillman JD. Competitive properties of lactate dehydrogenase mutants of the oral bacterium Streptococcus mutans in the rat. Archives Oral Biol. 1982;27(6):513–6.
Hinzman CP, Aljehane L, Brown-Clay JD, Kallakury B, Sonahara F, Goel A, Trevino J, Banerjee PP. Aberrant expression of PDZ-binding kinase/T-LAK cell-originated protein kinase modulates the invasive ability of human pancreatic cancer cells via the stabilization of oncoprotein c-MYC. Carcinogenesis. 2018;39(12):1548–59.
Zhang J, Wolfgang CL, Zheng L. Precision Immuno-Oncology: Prospects of Individualized Immunotherapy for Pancreatic Cancer. Cancers 2018, 10(2).
Gangjee A, Yu J, McGuire JJ, Cody V, Galitsky N, Kisliuk RL, Queener SF. Design, synthesis, and X-ray crystal structure of a potent dual inhibitor of thymidylate synthase and dihydrofolate reductase as an antitumor agent. J Med Chem. 2000;43(21):3837–51.
Lee SW, Chen TJ, Lin LC, Li CF, Chen LT, Hsing CH, Hsu HP, Tsai CJ, Huang HY, Shiue YL. Overexpression of thymidylate synthetase confers an independent prognostic indicator in nasopharyngeal carcinoma. Exp Mol Pathol. 2013;95(1):83–90.
Formentini A, Henne-Bruns D, Kornmann M. Thymidylate synthase expression and prognosis of patients with gastrointestinal cancers receiving adjuvant chemotherapy: a review. Langenbeck’s Archives Surg. 2004;389(5):405–13.
Qiu LX, Tang QY, Bai JL, Qian XP, Li RT, Liu BR, Zheng MH. Predictive value of thymidylate synthase expression in advanced colorectal cancer patients receiving fluoropyrimidine-based chemotherapy: evidence from 24 studies. Int J Cancer. 2008;123(10):2384–9.
Canale FP, Ramello MC, Núñez N, Araujo Furlan CL, Bossio SN, Gorosito Serrán M, Tosello Boari J, Del Castillo A, Ledesma M, Sedlik C, et al. CD39 Expression Defines Cell Exhaustion in Tumor-Infiltrating CD8(+) T Cells. Cancer Res. 2018;78(1):115–28.
This research was supported by a Grant from the National Natural Science Foundation of China (Grant: 81700012).
Ethics approval and consent to participate
Consent for publication
All authors declare no conflict of interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The expression of hub MRGs between low and high risk group. *** p < 0.001.
GO (a) and KEGG (b) analysis of hub MRGs.
Comparison of time-dependent ROC curves among the age, gender, Pathological_stage, T_stage, M_stage, N_stage, and prognostic signature. a 0.5-year OS; b 1-year OS; c 2-year OS; d 3-year OS; e 4-year OS; f 5-year OS.
About this article
Cite this article
Wang, Z., Embaye, K.S., Yang, Q. et al. Establishment and validation of a prognostic signature for lung adenocarcinoma based on metabolism‐related genes. Cancer Cell Int 21, 219 (2021). https://doi.org/10.1186/s12935-021-01915-x