- Research
- Open Access
- Published:
A novel NHEJ gene signature based model for risk stratification and prognosis prediction in hepatocellular carcinoma
Cancer Cell International volume 23, Article number: 59 (2023)
Abstract
Background
Non-homologous DNA end joining (NHEJ) is the predominant DNA double-strand break (DSB) repair pathway in human. However, the relationship between NHEJ pathway and hepatocellular carcinoma (HCC) is unclear. We aimed to explore the potential prognostic role of NHEJ genes and to develop an NHEJ-based prognosis signature for HCC.
Methods
Two cohorts from public database were incorporated into this study. The Kaplan–Meier curve, the Least absolute shrinkage and selection operator (LASSO) regression analysis, and Cox analyses were implemented to determine the prognostic genes. A NHEJ-related risk model was created and verified by independent cohorts. We derived enriched pathways between the high- and low-risk groups using Gene Set Enrichment Analysis (GSEA). CIBERSORT and microenvironment cell populations-counter algorithm were used to perform immune infiltration analysis. XRCC6 is a core NHEJ gene and immunohistochemistry (IHC) was further performed to elucidate the prognostic impact. In vitro proliferation assays were conducted to investigate the specific effect of XRCC6.
Results
A novel NHEJ-related risk model was developed based on 6 NHEJ genes and patients were divided into distinct risk groups according to the risk score. The high-risk group had a poorer survival than those in the low-risk group (P < 0.001). Meanwhile, an obvious discrepancy in the landscape of the immune microenvironment also indicated that distinct immune status might be a potential determinant affecting prognosis as well as immunotherapy reactiveness. High XRCC6 expression level associates with poor outcome in HCC. Moreover, XRCC6 could promote HCC cell proliferation in vitro.
Conclusions
In brief, this work reveals a novel NHEJ-related risk signature for prognostic evaluation of HCC patients, which may be a potential biomarker of HCC immunotherapy.
Introduction
Globally ranking the fourth leading cause of cancer- related death, hepatocellular carcinoma (HCC) is a malignant disease with poor prognosis [1]. Despite continuous progress in diagnosis and treatment, the 5-year survival rate of HCC remains poor due to the spread, metastases and high rate of recurrence. Several studies aimed to construct effective predictive models in HCC previously, including deep learning-based multi-omics model, radiomics model and gene signature models [2,3,4,5,6]. Studies have proved that m6A-related genes, ferroptosis-related genes, and aging-related genes were all associated with cancer prognosis [7, 8]. However, it is still hard to predict patient’s prognosis effectively, highlighting the need to identify HCC biomarkers.
Non-homologous DNA end joining (NHEJ) is the predominant DNA double-strand break (DSB) repair pathway in mammalian cells. NHEJ is also important for B cell and T cell development. Mutation or absence of NHEJ can result in immunodeficiency [9]. Several studies have reported that core NHEJ factors are overexpressed in certain tumor tissues. The dysregulation or hyperactivation of NHEJ machinery has been linked to cancer and resistance to anti-tumor treatment [10]. As such, NHEJ components have emerged as drug targets for cancer therapy [11], and DNA-dependent protein kinase (DNA-PK) inhibitors have entered clinical trials. DNA damage repair factors, like XRCC6, XRCC5, PARP1, were crucial and closely related to cancer development and progression [12, 13]. Whether NHEJ factors-based model is effective for HCC prediction is still unclear and worth to be revealed.
Since that there were only 20% of HCC patients which exhibited response to PD-1/PD-L1 antibody, patients’ stratification and selection is crucial and meaningful [14]. The efficacy of immunotherapy is partly dependent on immune infiltration, especially cytotoxic T cells [15]. Therefore, to identify patients who would benefit from immunotherapy, prediction of immune infiltration is favorable. Here, we also aimed to understand the correlation between NHEJ genes and immune infiltration in HCC.
In this study, we identified six prognosis-related NHEJ factors using the LASSO methods, on basis of which, we subsequently constructed a predictive model for HCC patients. We first used data from The Cancer Genome Atlas (TCGA) to construct an NHEJ-based signature associated with the survival rates of liver hepatocellular carcinoma (LIHC) patients. One of large data from the Gene Expression Omnibus (GEO) ,GSE14520, was then used to validate the predictive ability of this signature. Our model was proved to be effective to predict prognosis of HCC in two independent cohorts. In addition, The NHEJ-related model was confirmed to be associated with tumor immune infiltration and might be used to predict immunotherapy response of HCC.
Methods and materials
Patient data collection
The RNA-seq transcriptome data and clinical information of LIHC patients were extracted from TCGA (https://tcga-data.nci.nih.gov/tcga/) and GSE14520 databases (https://www.ncbi.nlm.nih.gov/geo/). Patients with missing survival data or overall survival (OS) < 30 days, or without definitive histopathological diagnosis were excluded. Patients with OS < 30 days were excluded from analysis, as these patients may have had too advanced disease or complications of treatments. The TCGA dataset (n = 356) served as a training cohort. The GSE14520 dataset (n = 225) was used as the validation cohort. The RNA-seq transcriptome data of TCGA dataset were downloaded in the format of fragments per kilobase of exon model per million mapped reads (FPKM) normalized. The count data of expression array from GSE14520 were acquired by “GEOquery” package. The different gene expression datasets were normalized using the “limma” and “SVA” R packages to remove the potential batch effect. NHEJ-related genes, shown in Supplementary Table S1, were selected and downloaded from hallmark gene sets in the Molecular Signatures Database (MSigDB).
Human HCC samples
The samples of HCC and the paired adjacent normal tissues were obtained from surgical resection at Sun Yat-sen Cancer Center (n = 175, from January 2013 to June 2015). All patients have pathology confirmed diagnosis of HCC. Patients with missing survival data or overall survival (OS) < 30 days were excluded. The tissue microarray was constructed containing a total of 175 pairs of HCC samples and matched adjacent normal tissues. Paired data were analyzed by paired t-test.
Construction and validation of the NHEJ signature
Thirteen NHEJ genes were first subjected to univariate Cox regression analysis (p < 0.05). Following this, the LASSO regression analysis was performed to narrow down the prognostically significant candidate NHEJ genes. Then, multivariate Cox regression analysis was used to determine the best weighting coefficient of each prognostically significant candidate NHEJ genes. The risk score was calculated using the following the equation according to the literature [16]: Risk score = \(\sum (\mathrm{expression level of each target gene }\times \mathrm{ corresponding coefficients})\).
According to the cut-off point of risk scores derived from maximally selected log-rank statistics, LIHC patients in the TCGA training cohort were divided into low and high-risk groups. The Kaplan–Meier method was utilized to estimate OS and the log-rank test was used to compare the differences of OS between the two groups.
To validate the NHEJ signature, the risk score of HCC cases in the GSE14520 dataset were calculated using the same formula as the TCGA cohort. Cases in the validation set were also divided into two groups according to the cut-off point of risk score obtained from the maximally selected log-rank statistics. Survival curves of the low- and the high-risk groups in the validation cohort were also estimated using the Kaplan–Meier method and were compared via the log-rank test.
Functional enrichment analysis
To investigate the potential molecular mechanisms of the NHEJ signature, GSEA were performed in the TCGA and GSE14520 datasets. The analyses of perform Genetic Ontology (GO) term and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway were conducted by GSEA 4.0.1 software. After 1000 permutations, significant enrichment was defined as the pathway with the value of false discovery rate (FDR) < 0.25 and normalized p < 0.05.
Evaluation of the immune landscape
The infiltrating immune cells levels were calculated by CIBERSORT [17] and microenvironment cell populations-counter (MCP)-counter algorithms [18] in each HCC sample and compared between the high-score and low-score groups. The Mann–Whitney U test was performed to compare the differential expression levels of PDCD1, CD274, PDCD1LG2 and CTLA4 between the two risk groups.
Establishment of a predictive nomogram based on the NHEJ signature
Using the TCGA training set, a nomogram integrating the NHEJ signature and clinical stage to predict individual survival was established. In addition, calibration curves and the area under the curve (AUC) for the OS probability at 1, 3, 5 years were plotted to evaluate the predictive accuracy of this nomogram in the TCGA set and the GEO validation set.
Additional information is provided in Additional file 1: Methods S1.
Statistical analysis
Continuous data are shown as the mean ± standard deviation (SD) and were compared using Student’s t-test. Categorical variables were analyzed using the chi-square (χ2) test. Cox regression analyses were performed to determine the significantly independent prognostic factors for OS. A prognostic nomogram model was established using the “rms” R package, while its predictive accuracy was assessed via the creation of calibration curves. Statistical analysis was performed using SPSS (version 22.0) and R software (version 4.0.1). The threshold of statistical significance was set at a p-value < 0.05.
Results
Identification of prognostic NHEJ genes
As represented in the flowchart (Additional file 1: Fig. S1), our study focused on the NHEJ pathway genes. After excluding 31 cases with unsatisfied follow-up or those lacking important clinical information, 343 cases from the TCGA training set were included to identify prognosis-related NHEJ genes. In addition, 229 cases from the GSE14520 dataset were used as verification cohort. Six NHEJ genes, DCLRE1C, FEN1, PRKDC, XRCC4, XRCC5 and XRCC6, were identified to be associated with HCC prognosis using univariate Cox regression (Table 1). All the six genes were negatively correlated with OS of HCC patients, indicating that NHEJ genes might act as oncogenes in HCC. Then the six genes were subjected to LASSO Cox regression, with a significant correlation between and OS at minimum values (Fig. 1A). Further disciplinary regression was performed to take λ.min criteria as independent risk factors for prognosis in patients with HCC (Fig. 1B). Finally, a six-NHEJ risk signature was derived according to 343 LIHC cases in the TCGA dataset, whose risk score was specifically calculated based on a linear combination of gene expression levels and their corresponding regression coefficients from the multivariate Cox analysis. The specific formula was as follows: Risk score = DCLRE1C × 0.249100386163019 + FEN1 × 0.155181162762813 + PRKDC × 0.227886948457229 + XRCC4 × 0.101357794279081 + XRCC5 × 0.204722206320064 + XRCC6 × 0.0928527312741607.
Identification of a prognosis-related NHEJ-based signature in the TCGA training cohort. A LASSO coefficients of prognosis-associated NHEJ genes, each curve represents a gene. B Selection of the optimal candidate genes in the LASSO model. The two dotted vertical lines were drawn at the optimal scores by λ.min criteria and 1-s.e. criteria (At λ.min criteria including all the six genes)
Prognostic Value of the NHEJ Signature in the Training Cohort
The cut-off value of risk scores was determined as 3.84 using the maximally selected log-rank statistics in the TCGA training cohort (Additional file 1: Fig. S2A) to divide the cases into low-risk and high-risk groups. The distribution of the risk score showed that more death events were observed in the high-risk group (Fig. 2A). We found that all six genes were significantly up-regulated in the high-risk group, which was consistent with their prognostic value (Fig. 2B). In the TCGA cohort, the Kaplan–Meier curve suggested that the OS of patients in the low-risk group was significantly longer than that of patients in the high-risk group (p < 0.001; Fig. 2C). Figure 2D showed the results of multivariate Cox regression analysis (HR = 2.50, 95% CI = 1.65–3.78, p < 0.001).
Assessment of prognostic value of the NHEJ signature model in the TCGA training cohort. A. The risk score, survival time and survival status in the training cohort. B The heatmap showing expression profiles of the 6 NHEJ genes. C Kaplan–Meier curves for the OS of patients in the high- and low-risk group. D Multivariate Cox regression analysis of NHEJ genes signature and other clinicopathological factors in the training cohort
Prognostic validation of the NHEJ signature in GSE14520 dataset
According to the risk score based on the maximally selected log-rank statistics, 229 cases were divided into the high- and low-risk groups in the GSE14520 validation cohort (Additional file 1: Fig. S2B). The distribution of the risk score showed that more death events were observed in the high-risk group (Fig. 3A). The six genes were also significantly up-regulated in the high-risk group in the GSE14520 cohort (Fig. 3B). The Kaplan–Meier curve suggested that the OS of patients in the low-risk group was significantly longer than that of patients in the high-risk group (p < 0.001; Fig. 3C). Figure 3D showed the results of multivariate Cox regression analysis (HR = 2.98, 95% CI = 1.49–5.99, p = 0.002).
Assessment of prognostic value of the NHEJ signature model in the GSE14520 validation cohort. A The risk score, survival time and survival status in the validation cohort. B The heatmap showing expression profiles of the 6 NHEJ genes. C Kaplan–Meier curves for the OS of patients in the high- and low-risk group. D Multivariate Cox regression analysis of NHEJ genes signature and other clinicopathological factors in the validation cohort
Functional enrichment analysis
We then performed the GSEA to verify differential pathways between low and high-risk group in order to investigate the underlying functional mechanism. In the high-risk group, KEGG enrichment analysis found that genes were primarily enriched in DNA replication, mismatch repair, homologous recombination, cell cycle, non-homologous end joining in both datasets. By contrast, multiple metabolism pathways were enriched in low-risk group mainly (Table 2, and Additional file 1: Fig. S3). GO enrichment analysis showed similar results and found that genes were primarily enriched in cell cycle DNA replication, double strand break repair in the high-risk group, while multiple metabolism pathways were enriched in low-risk group mainly (Additional file 1: Fig. S3). Full GSEA results are available in Additional file 2: Table S4 and Additional file 3: Table S5.
Tumor immunity landscape in HCC
To conform whether the NHEJ-related signature was associated with immune infiltration and immunotherapy, we employed the CIBERSORT algorithm. In the TCGA cohort, the infiltration of B cells memory, T cells follicular helper and macrophages M0 were significantly higher in the high-risk group. However, in the low-risk group, monocytes and mast cells resting were more abundant (Fig. 4A). In the GSE14520 cohort, the infiltration of B cells naïve, T cells CD4 naïve, T cells gamma delta and NK cells resting were higher in the high-risk group. B cells memory, T cells CD4 memory activated, Tregs, macrophages M0, dendritic cells resting, dendritic cells activating and neutrophils were significantly more abundant in the low-risk group (Fig. 4C). We also performed correlation analysis among the 22 types of immune cells and found notable correlation between immune cells in both cohorts, such as B cells naïve and Plasma cells, CD8 T cells and macrophages M2 (Fig. 4B, D). And cytotoxic lymphocytes, NK cells, neutrophils, endothelial cells were more abundant in the low-risk group and fibroblasts were higher in the high-risk group (Fig. 4E, F). Thus, we proved that our NHEJ signature was closely related to the immune microenvironment.
The results of immune infiltration analyses in the LIHC training cohort and GEO validation cohort. A Violin plot showing differences of infiltrating immune cell types between the low- and the high-risk group of CIBERSORT in TCGA cohort. B Correlation of risk scores and immune cell infiltration in TCGA cohort. C Violin plot showing differences of infiltrating immune cell types between the low- and the high-risk group of CIBERSORT in GSE14520 cohort. D Correlation of risk scores and immune cell infiltration in GSE14520 cohort. E MCP-counter show the differences of 22 types of immune cell infiltrated between the two risk groups in TCGA cohort. F MCP-counter show the differences of 22 types of immune cell infiltrated between the two risk groups in GSE14520 cohort
The expression levels of four immune checkpoint genes were further investigated between the low- and high-risk groups. Compared with HCC patients in the low-risk group, patients in the high-risk group expressed higher levels of PDCD1 and CTLA4 (Fig. 5A–D) in TGGA cohort. Importantly, infiltrating immune cells in the tumor overexpress PDCD1 as a strategy to evade immune responses. In GEO cohort, patients in the high-risk group expressed higher level of CTLA4 (Fig. 5E–H). In both cohorts, the high-risk group had a higher expression level of CTLA4. Data from different data base-derived analyses may have the regional heterogeneity of HCC. This could account for upregulation of PDCD1 in TCGA cohort but no difference in GEO cohort. Collectively, our results suggested an immunosuppressive landscape in HCC of high-risk group.
Predictive nomogram construction
To construct a predictive nomogram, we performed the multivariate Cox analysis and found that the risk score of the NHEJ signature and tumor stage were independent risk factors of OS in both training cohort and validation cohort as shown in Fig. 2D and Fig. 3D. These independently associated risk factors were used to form a nomogram (Fig. 6A). The resulting model was internally validated using the bootstrap validation method. The nomogram demonstrated good accuracy, with an unadjusted C index of 0.69 (95% CI, 0.64–0.75) in the training cohort and 0.75 (95% CI, 0.68–0.82) in the validation cohort. In addition, calibration plots graphically showed good agreement between the risk estimation by the nomogram and actual survival information (Fig. 6B, C). The AUC indicated that our nomogram was more effective to predict OS of HCC patients than the sole tumor stage in both training cohort and validation cohort (Fig. 6D, E). Therefore, our risk score-based nomogram was effective to predict HCC survival.
Development of a nomogram based on NHEJ genes signature for predicting OS of patients in TCGA cohort and GEO cohort. A The nomogram plot integrating NHEJ genes risk score, and stage. B The calibration plot for the probability of 1-, 3-, and 5-years OS in the TCGA training cohort. C Time ROC curves nomogram-based OS prediction in the TCGA training cohort. D The calibration plot for the probability of 1-, 3-, and 5-years OS in the GSE14520 cohort. E Time ROC curves nomogram-based OS prediction in the GSE14520 cohort
Deeply validation of the negative value of XRCC6 via basic exploration
Among six NHEJ genes, XRCC6 exhibited the highest expression level in TCGA RNA-seq data and was one of the core genes in NHEJ pathway. Also, XRCC6 expression was significantly elevated in HCC and elevated XRCC6 correlated with a worse OS in TCGA cohort (Fig. 7A, B). We performed IHC staining in 175 paired peritumor and tumor samples to verify the results. The representative IHC staining images of XRCC6 are shown in Fig. 7C. The protein level of XRCC6 in tumor tissues were significantly higher than paired peritumor tissues (Fig. 7D). Prognostic analysis showed that elevated XRCC6 correlated with a worse OS and progression-free survival (PFS) in the SYSUCC (n = 175) cohort (Fig. 7E, F). The expression of XRCC6 in HCC cells was also detected, which was up-regulated in most HCC cell lines (Additional file 1: Fig.S4A). PLC/PRF/5 cells exhibited the highest expression of XRCC6. Knockdown of XRCC6 was performed to explore its role in cell proliferation (Additional file 1: Fig.S4B, C) and suppressed HCC cell proliferation was observed in XRCC6 knockdown group (Fig. 7G).
Validation of XRCC6 upregulation in HCC samples and clinical associations. A XRCC6 levels have the highest expression among the six NHEJ genes from TCGA RNA-seq data. B OS rate of HCC patients categorized according to median XRCC6 expression in TCGA cohort. C The representative images of IHC stain of XRCC6 in peritumor and tumor samples. D The IHC score of XRCC6 in paired peritumor and tumor samples (n = 175 pairs) in SYSUCC cohort. E OS rate of HCC patients categorized according to median XRCC6 expression in SYSUCC cohort. F PFS rate of HCC patients categorized according to median XRCC6 expression in SYSUCC cohort. G Effects of sh-XRCC6 on proliferation abilities of PLC/PRF/5 cells measured by the CCK-8 assay
Discussion
Considering the increase in data-driven biological research, and ease of access to corresponding data from public databases, many studies concentrated on the relationship between RNA-seq data of specific gene sets and individual outcomes with the of numerous public databases [19]. For examples, pyroptosis-associated, platelet-related and ferroptosis-related genes were reported to predict prognosis and demonstrate immune infiltration in HCC patients [20,21,22]. However, there have not yet been studies of NHEJ-related genes for predicting the prognosis of HCC patients.
To clarify the relationship between NHEJ genes and the prognoses of patients with HCC, we constructed a novel prognostic risk score based on six NHEJ genes: DCLRE1C, FEN1, PRKDC, XRCC4, XRCC5 and XRCC6. The risk score was used to stratify HCC patients into tow risk categories and predicted their prognosis based on the TCGA database, and then was validated in the GEO cohort. The risk score was confirmed to be an independent prognostic factor for OS according to the multivariate Cox regression analysis. Further, a predictive nomogram based on NHEJ signature was developed and validated. Moreover, we found that this NHEJ risk signature was significantly related to different antitumor immune cell infiltration levels in the tumor microenvironment of HCC.
The connection between NHEJ and DSBs is widely accepted, as a conserved pathway to repair DSBs [23]. The genomic instability is an evolving hallmark of cancer [24] and the failure of DNA repair leads to a subsequent accumulation of mutations as well as structural aberrations, usually generating particularly aggressive tumors. It’s reported that disruption of NHEJ process can drive genomic instability and accelerate the development of HCC [25]. However, failure to repair DSBs can result in increased instability and cell death through apoptosis, an essential mechanism for removing pre-cancerous cells [26]. During tumor progression or on therapy-induced tumor evolution, the DDR machinery including NHEJ pathway can be reconstituted to cope with increased replication stress and elevated levels of endogenous DNA damage [27]. In this study, we identified three NHEJ genes (FEN1, PRKDC and XRCC6) were upregulated in liver cancer tissue based on TCGA data. Besides, six of total thirteen NHEJ genes were associated with poor prognosis. Noteworthy, PRKDC is with 2.1% the sixth most frequently mutated DNA repair gene in all cancers and is identified as a candidate driver of hepatocarcinogenesis or therapy resistance, exhibiting frequent copy number gains [28, 29]. A large population-based study in Taiwan Province of China shows that XRCC6 may play an important role in HCC carcinogenesis [30]. As a result of our study, GO and KEGG enrichment analysis indicated that cell cycle and DNA replication were both significantly enriched in the high-risk group. Consistently, XRCC6 knockdown suppressed cell proliferation in vitro. Perhaps these results suggest that HCC cells become more dependent on NHEJ mechanisms to survive, proliferate and acquire resistance to treatments.
Immunotherapy of cancer has been the last major breakthrough in the fight against cancer [31]. More recently, immune checkpoint inhibition (ICI) has emerged as a first-line treatment for advanced HCC [32]. Indeed, ICIs have largely improved the prognosis of patients with intermediate and advanced HCC. However, not all patients benefit from immunotherapy and most patients would eventually experience disease progression. Thus, predictive biomarkers of ICI response are urgently needed to guide treatment decision and patient selection. The tumor microenvironment (TME) of HCC is a complex and spatially structured mixture of hepatic non-parenchymal resident cells, tumor cells, immune cells and tumor-associated fibroblasts [33]. Tumor-associated macrophages (TAMs) have a key role in cancer-related inflammation and immune response/immune escape [34]. In the present study, the high-risk group had higher proportions of M0 macrophages. While analyzed with MCP-counter algorithm, fibroblasts were higher in the high-risk group. As the most abundant components of tumor stroma, cancer-associated fibroblasts (CAFs) have been involved in the progression of liver cancer. Numerous studies have reported that CAFs promote tumor immune escape by influencing the proportion and activity of tumor immune microenvironment (TIME) [35]. NK cells play a vital role in immune monitoring to prevent the development and progression of cancer. NK cell-based anti-HCC therapeutic approaches are becoming increasingly attractive [36]. We also observed that infiltrating proportions of NK cells were apparently higher in low-risk patients. Moreover, the high-risk group had a higher expression level of CTLA4. Taken together, these results revealed an immunosuppressive TME in high-risk group patients. Therefore, our results suggest that the risk score could provide a basis for immunotherapy to screen patients who respond to ICI treatment.
There are several limitations in our study. Firstly, we found that NHEJ-based risk model was closely related to the TIME of HCC patients. However, we failed to include and analyze immunotherapy cohorts to explore whether the model could predict its efficacy. Secondly, although the immune cell composition was calculated based on various algorithm, it is still inaccurate compared with IHC and flow cytometry. Thirdly, we proved the function of XRCC6 in HCC cells, the other NHEJ factors should also be verified in HCC. Lastly, the biological function of XRCC6 was just verified in vitro, further animal experiments should be performed in the future.
In conclusion, we constructed a risk signature and nomogram to predict prognosis and tumor immune infiltration of HCC in two independent cohorts with high accuracy. The NHEJ risk model has the potential to be used as a biomarker to develop more individualized treatment for HCC patients.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found here: TCGA databases (https://tcga-data.nci.nih.gov/tcga/) and the GEO databases (https://www.ncbi.nlm.nih.gov/geo/). In accordance with the journal’s guidelines, the data presented in this study are available on request from the corresponding author.
References
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71:209–49.
Chaudhary K, Poirion OB, Lu L, Garmire LX. Deep learning-based multi-omics integration robustly predicts survival in liver cancer. Clin Cancer Res. 2018;24:1248–59.
Nam JY, Sinn DH, Bae J, Jang ES, Kim JW, Jeong SH. Deep learning model for prediction of hepatocellular carcinoma in patients with HBV-related cirrhosis on antiviral therapy. JHEP Rep. 2020;2: 100175.
Gan X, Luo Y, Dai G, Lin J, Liu X, Zhang X, Li A. Identification of gene signatures for diagnosis and prognosis of hepatocellular carcinomas patients at early stage. Front Genet. 2020;11:857.
Ji GW, Zhu FP, Xu Q, Wang K, Wu MY, Tang WW, Li XC, et al. Machine-learning analysis of contrast-enhanced CT radiomics predicts recurrence of hepatocellular carcinoma after resection: a multi-institutional study. EBioMedicine. 2019;50:156–65.
Xu X, Zhang HL, Liu QP, Sun SW, Zhang J, Zhu FP, Yang G, et al. Radiomic analysis of contrast-enhanced CT predicts microvascular invasion and outcome in hepatocellular carcinoma. J Hepatol. 2019;70:1133–44.
Liang JY, Wang DS, Lin HC, Chen XX, Yang H, Zheng Y, Li YH. A novel ferroptosis-related gene signature for overall survival prediction in patients with hepatocellular carcinoma. Int J Biol Sci. 2020;16:2430–41.
Tang R, Zhang Y, Liang C, Xu J, Meng Q, Hua J, Liu J, et al. The role of m6A-related genes in the prognosis and immune microenvironment of pancreatic adenocarcinoma. PeerJ. 2020;8: e9602.
Zhao B, Rothenberg E, Ramsden DA, Lieber MR. The molecular basis and disease relevance of non-homologous DNA end joining. Nat Rev Mol Cell Biol. 2020;21:765–81.
Sishc B, Davis A. The role of the core non-homologous end joining factors in carcinogenesis and cancer. Cancers. 2017;9:81.
Kefala Stavridi A, Appleby R, Liang S, Blundell TL, Chaplin AK. Druggable binding sites in the multicomponent assemblies that characterise DNA double-strand-break repair through non-homologous end joining. Essays Biochem. 2020;64:791–806.
Zhang N, Zhang Y, Qian H, Wu S, Cao L, Sun Y. Selective targeting of ubiquitination and degradation of PARP1 by E3 ubiquitin ligase WWP2 regulates isoproterenol-induced cardiac remodeling. Cell Death Differ. 2020;27:2605–19.
Bau DT, Tsai CW, Wu CN. Role of the XRCC5/XRCC6 dimer in carcinogenesis and pharmacogenomics. Pharmacogenomics. 2011;12:515–34.
Zongyi Y, Xiaowu L. Immunotherapy for hepatocellular carcinoma. Cancer Lett. 2020;470:8–17.
Braun DA, Hou Y, Bakouny Z, Ficial M, Sant’ Angelo M, Forman J, Ross-Macdonald P, et al. Interplay of somatic alterations and immune infiltration modulates response to PD-1 blockade in advanced clear cell renal cell carcinoma. Nat Med. 2020;26:909–18.
Huang HY, Wang Y, Wang WD, Wei XL, Gale RP, Li JY, Zhang QY, et al. A prognostic survival model based on metabolism-related gene expression in plasma cell myeloma. Leukemia. 2021;35:3212–22.
Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, Hoang CD, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. 2015;12:453–7.
Becht E, Giraldo NA, Lacroix L, Buttard B, Elarouci N, Petitprez F, Selves J, et al. Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression. Genome Biol. 2016;17:218.
Zhai WY, Duan FF, Chen S, Wang JY, Lin YB, Wang YZ, Rao BY, et al. A novel inflammatory-related gene signature based model for risk stratification and prognosis prediction in lung adenocarcinoma. Front Genet. 2021;12: 798131.
Li X, Zhao K, Lu Y, Wang J, Yao W. Genetic analysis of platelet-related genes in hepatocellular carcinoma reveals a novel prognostic signature and determines PRKCD as the potential molecular bridge. Biol Proced Online. 2022;24:22.
Zhang Y, Ren H, Zhang C, Li H, Guo Q, Xu H, Cui L. Development and validation of four ferroptosis-related gene signatures and their correlations with immune implication in hepatocellular carcinoma. Front Immunol. 2022;13:1028054.
Li G, Zhang D, Liang C, Liang C, Wu J. Construction and validation of a prognostic model of pyroptosis related genes in hepatocellular carcinoma. Front Oncol. 2022;12:1021775.
Singh SK, Roy S, Choudhury SR, Sengupta DN. DNA repair and recombination in higher plants: insights from comparative genomics of arabidopsis and rice. BMC Genomics. 2010;11:443.
Negrini S, Gorgoulis VG, Halazonetis TD. Genomic instability–an evolving hallmark of cancer. Nat Rev Mol Cell Biol. 2010;11:220–8.
Saha J, Bae J, Wang SY, Lu H, Chappell LJ, Gopal P, Davis AJ. Ablating putative Ku70 phosphorylation sites results in defective DNA damage repair and spontaneous induction of hepatocellular carcinoma. Nucleic Acids Res. 2021;49:9836–50.
Kosicki M, Tomberg K, Bradley A. Repair of double-strand breaks induced by CRISPR–Cas9 leads to large deletions and complex rearrangements. Nat Biotechnol. 2018;36:765–71.
Trenner A, Sartori AA. Harnessing DNA double-strand break repair for cancer treatment. Front Oncol. 2019;9:1388.
Chae YK, Anker JF, Carneiro BA, Chandra S, Kaplan J, Kalyan A, Santa-Maria CA, et al. Genomic landscape of DNA repair genes in cancer. Oncotarget. 2016;7:23312–21.
Cornell L, Munck JM, Alsinet C, Villanueva A, Ogle L, Willoughby CE, Televantou D, et al. DNA-PK—a candidate driver of hepatocarcinogenesis and tissue biomarker that predicts response to treatment and survival. Clin Cancer Res. 2015;21:925–33.
Hsu CM, Yang MD, Chang WS, Jeng LB, Lee MH, Lu MC, Chang SC, et al. The contribution of XRCC6/Ku70 to hepatocellular carcinoma in Taiwan. Anticancer Res. 2013;33:529–35.
Galon J, Bruni D. Tumor immunology and tumor evolution: intertwined histories. Immunity. 2020;52:55–81.
Leslie J, Mackey JBG, Jamieson T, Ramon-Gil E, Drake TM, Fercoq F, Clark W, et al. CXCR2 inhibition enables NASH-HCC immunotherapy. Gut. 2022;71:2093–106.
Sangro B, Sarobe P, Hervás-Stubbs S, Melero I. Advances in immunotherapy for hepatocellular carcinoma. Nat Rev Gastroenterol Hepatol. 2021;18:525–43.
Kusano T, Ehirchiou D, Matsumura T, Chobaz V, Nasi S, Castelblanco M, So A, et al. Targeted knock-in mice expressing the oxidase-fixed form of xanthine oxidoreductase favor tumor growth. Nat Commun. 2019;10:4904.
Peng H, Zhu E, Zhang Y. Advances of cancer-associated fibroblasts in liver cancer. Biomarker ReS. 2022;10:59.
Parameswaran R, Ramakrishnan P, Moreton SA, Xia Z, Hou Y, Lee DA, Gupta K, et al. Repression of GSK3 restores NK cell cytotoxicity in AML patients. Nat Commun. 2016;7:11154.
Acknowledgements
The authors thank all participants for their important contribution and all public databases for providing data.
Funding
This work was supported by grants from the National Natural Science Foundation of China (No. 82103601).
Author information
Authors and Affiliations
Contributions
ZL and JLQ designed the project. ZL, ZKH, YXS and YCY performed the statistical analysis and figures output. ZL and ZKH contributed to manuscript writing. YN, BKL, and YFY helped with part of the English correction. JLQ critically revised the final manuscript. All authors read and approved the final maunscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of Sun Yat-sen University Cancer Center (GZR2021-166). Written informed consent has been obtained from the patients to publish this paper.
Consent for publication
All authors read and approved the publication of the final manuscript.
Competing interests
The authors declare no competing interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1: Methods S1.
Table S1. NHEJ gene set from MSigDB (http://www.gsea-msigdb.org). Table S2. Antibodies included in the study. Table S3. The primers and shRNA used in present study. Figure S1. Flow chart of data collection and analysis. Figure S2. Assessment of prognostic value of the NHEJ signature model in the TCGA and GEO cohort. Figure S3. Gene set enrichment analysis between the high- and low-risk subgroups in TCGA training cohort and the GEO validation cohort. Figure S4. Validation of XRCC6 upregulation in HCC samples and clinical associations.
Additional file 2: Table S4.
Gene set enrichment analysis between the high- and low-risk subgroups in TCGA training cohort.
Additional file 3:
Table S5. Gene set enrichment analysis between the high- and low-risk subgroups in GEO validation cohort.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Lin, Z., Huang, Z., Shi, Y. et al. A novel NHEJ gene signature based model for risk stratification and prognosis prediction in hepatocellular carcinoma. Cancer Cell Int 23, 59 (2023). https://doi.org/10.1186/s12935-023-02907-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12935-023-02907-9
Keywords
- Hepatocellular carcinoma
- Non-homologous DNA end joining
- Immune infiltrated landscape
- Prognostic model
- Risk stratification