NEIL3 may act as a potential prognostic biomarker for lung adenocarcinoma

Background Lung adenocarcinoma (LUAD) is the leading cause of cancer-related death. This study aimed to develop and validate reliable prognostic biomarkers and signature. Methods Differentially expressed genes were identified based on three Gene Expression Omnibus (GEO) datasets. Based on 1052 samples’ data from our cohort, GEO and The Cancer Genome Atlas, we explored the relationship of clinicopathological features and NEIL3 expression to determine clinical effect of NEIL3 in LUAD. Western blotting (22 pairs of tumor and normal tissues), Real-time quantitative PCR (19 pairs of tumor and normal tissues), and immunohistochemical analyses (406-tumor tissues subjected to microarray) were conducted. TIMER and ImmuCellAI analyzed relationship between NEIL3 expression and the abundance of tumor-infiltrating immune cells in LUAD. The co-expressed-gene prognostic signature was established based on the Cox regression analysis. Results This study identified 502 common differentially expressed genes and confirmed that NEIL3 was significantly overexpressed in LUAD samples (P < 0.001). Increased NEIL3 expression was related to advanced stage, larger tumor size and poor overall survival (p < 0.001) in three LUAD cohorts. The proportions of natural T regulatory cells and induced T regulatory cells increased in the high NEIL3 group, whereas those of B cells, Th17 cells and dendritic cells decreased. Gene set enrichment analysis indicated that NEIL3 may activate cell cycle progression and P53 signaling pathway, leading to poor outcomes. We identified nine prognosis-associated hub genes among 370 genes co-expressed with NEIL3. A 10-gene prognostic signature including NEIL3 and nine key co-expressed genes was constructed. Higher risk-score was correlated with more advanced stage, larger tumor size and worse outcome (p < 0.05). Finally, the signature was verified in test cohort (GSE50081) with superior diagnostic accuracy. Conclusions This study suggested that NEIL3 has the potential to be an immune-related therapeutic target and an independent predictor of LUAD prognosis. We also developed a prognostic signature for LUAD with a precise diagnostic accuracy. Supplementary Information The online version contains supplementary material available at 10.1186/s12935-021-01938-4.

LUAD is only 5% because about 57% of patients are diagnosed with advanced stage and metastatic disease [3,4]. Patients with LUAD have improved overall survival (OS) after surgery and radiotherapy when diagnosed sooner. With the development of immune checkpoint inhibitors (atezolizumab, pembrolizumab, nivolumab, etc.), the survival of LUAD patients has significantly improved; thus, such treatments have attracted considerable attention [5,6]. However, limited data are available regarding the relationship between biomarkers and immune responses. Therefore, exploring and identifying effective immunerelated biomarkers for LUAD to reduce the mortality rates and develop innovative targeted therapies remains crucial.
In the past ten years, researchers have conducted new studies and explored meaningful genes using bioinformatics techniques. Here we identified 502 commonly expressed genes from three LUAD datasets (GSE32863, GSE33532, and GSE43458). After an extensive literature review, we found that DNA endonuclease VIII-like 3 (NEIL3) is a very promising DNA glycosylase [7]. DNA glycosylase is one kind of enzymes to initiate the base excision repair (BER) pathway, recognize, and excise damaged bases from DNA [7][8][9]. NEIL3 has a glycolysis domain and apurinic/apyrimidinic lyase activity that excises damaged bases and generates a single-strand break [7,9]. NEIL3 repairs telomere oxidative damage and protects telomere integrity during the S phase to enable accurate chromosome segregation in actively dividing cells [10]. Some findings based on a Neil3−/− mice model indicated that the NEIL3 mutation was linked to impaired B cell function and severe autoimmunity [11]. NEIL3 promotes the proliferation of cardiac fibroblasts and neural progenitor cells in the brain and tends to be overexpressed in cells with a high proliferative capacity, such as cancer cells and those in the bone marrow [8,[12][13][14]. The transcription of NEIL3 as a cell cycle dependent gene is regulated by BRG1 in breast cancer cells [15,16]. Moreover, NEIL3 is highly expressed in primary malignant melanomas, which may induce metastatic tumors [17]. Much evidence suggests that NEIL3 as a DNA repair gene is connected to tumorigenesis, therapy resistance, and poor prognosis in astrocytoma [18,19]. Evidence has shown that NEIL3 participates in regulating the cell proliferation process. However, the association between NEIL3, cancer prognosis, and immune response in LUAD is unclear.
Here we analyzed the correlation between NEIL3 expression and the clinical characteristics, and the prognosis of LUAD patients based on the gene expression and clinicopathology data of three different cohorts [The Cancer Genome Atlas (TCGA), Gene Expression Omnibus (GEO), and our patient group]. We also investigated the relationship between NEIL3 expression and tumorinfiltrating immune cells using Immune Cell Abundance Identifier (ImmuCellAI) and Tumor Immune Estimation Resource (TIMER). To better understand the role of NEIL3 in LUAD development, we performed a gene set enrichment analysis (GSEA) and screened out 370 NEIL3 co-expressed genes. Importantly, nine hub genes were independent predictors for LUAD; thus, we attempted to construct a prognostic signature and validate its prognostic accuracy in other cohorts.

Human tissue samples
Twenty-two pairs of frozen LUAD and adjacent noncancerous tissues obtained from the Affiliated Hospital of Nantong University between March 2018 and June 2019 were subjected to real-time quantitative polymerase chain reaction (RT-qPCR) and western blotting. Fresh tissues were frozen in liquid nitrogen and stored at − 80 °C until the experiment. LUAD tissue microarray and clinical data for 406 tumor samples and 50 normal lung tissue samples were derived from the Pathology Department of the Affiliated Hospital of Nantong University and used as our cohort. The inclusion criteria were as follows: (1) lobectomy and pneumonectomy with mediastinal lymph node dissection or sampling and (2) tumor diagnosed as invasive lung adenocarcinoma by postoperative pathology examination. The exclusion criteria were as follows: (1) no mediastinal lymph node dissection or sampling; (2) a history of other malignancies; (3) miss key clinical information such as age, sex, TNM stage, overall survival, distant and lymph node metastasis. Informed consent was obtained from all patients before the study. The ethical committee of Affiliated Hospital of Nantong University approved the study (number: 2018-L068), which was also conducted according to the Declaration of Helsinki.

Data resources and preprocessing
The GEO cohort included five gene expression datasets (http:// www. ncbi. nlm. nih. gov/ geo). GSE33532, GSE30219, and GSE43458 were used to screen out differentially expressed genes (DEGs) between LUAD and adjacent lung tissues by GEO2R (logFC > 1; adjusted p < 0.001), which included 40 LUAD tissues and 20 adjacent lung tissues, 85 LUAD tissues and 14 adjacent lung tissues and 80 LUAD tissues and 30 adjacent lung tissues, respectively. The GEO cohort also included GSE31210 (226 LUAD tissues and 20 adjacent lung tissues) and GSE50081 (127 LUAD tissues of stage I and II), which were used to explore the relationship between NEIL3 expression and the clinical outcomes of LUAD patients.
We utilized the gene expression profile and clinical information of the TCGA cohort (workflow type: HTSeq-FPKM; https:// portal. gdc. cancer. gov/ proje cts), and determined NEIL3 gene expression using R software (version: 3.5.3) and Strawberry Perl (version: 5.30.2.1). The TCGA cohort included 293 tumor samples and 54 normal lung tissues after the elimination of samples for which key clinical information was missing such as age, sex, tumor-node-metastasis stage (according to the 8th edition of the AJCC TNM staging system), overall survival, distant and lymph node metastasis. Meanwhile cases with a follow-up time of less than 90 days were deleted. Our work was performed in accordance with the TCGA publication requirements.

Functional enrichment analyses
GSEA, a powerful calculation software (http:// softw are. broad insti tute. org/ gsea/ index. jsp) based on Gene Ontology (GO) and the Kyoto Gene and Genomic Encyclopedia (KEGG), was used to investigate the possible biological functions of NEIL3. The cohort of LUAD patients was divided into high and low NEIL3 expression groups by median values. The gene sets used in this work (c2.cp.kegg.v5.2.symbols.gmt) were downloaded from the Molecular Signatures Database (http:// softw are. broad insti tute. org/ gsea/ msigdb/ index. jsp) [20]. We performed functional enrichment of the NEIL3 and co-expressed genes using the Bohao Online Enrichment Tool (http:// enrich. shbio. com/) [21]. When a false discovery rate (FDR) and the nominal p were less than 0.05, the enrichment results were deemed statistically significant.

Immune infiltrates analysis
TIMER is a user-friendly web tool that is used to investigate the molecular characterization of tumor immune system interactions including six major analytic modules (https:// cistr ome. shiny apps. io/ timer/). We evaluated the correlation between NEIL3 expression and the abundance of the six tumor-infiltrating immune subsets in LUAD: B cells, CD4+ T cells, neutrophils, CD8+ T cells, dendritic cells (DCs), and macrophages [22]. Immu-CellAI has a powerful ability for tumor-immune infiltration estimation, especially in the abundance of 18T-cell subsets (http:// bioin fo. life. hust. edu. cn/ ImmuC ellAI# !/) [23]. A gene set signature of TCGA LUAD patients was uploaded to the website. ImmuCellAI predicted the abundance of 24 immune cells in the sample including 18T-cell subsets. We measured the immune response of 24 immune cells to evaluate their association with NEIL3 expression in LUAD and performed "vioplot" package to visualize the data. At values of p < 0.05, the results were considered statistically significant.

Predictive signature construction and risk score calculation
The Cox proportional hazards model is often applied to survival analyses. We selected out nine prognostic signatures from among the 10 hub genes via "Survival" package and Cox regression analysis. We also performed a multivariate Cox regression analysis including NEIL3 and nine prognostic signatures, and constructed a 10-gene prognostic model to evaluate individual survival risk as follows: risk The optimal cutoffs and related specificity and sensitivity from receiver operating characteristic (ROC) curves were determined using a conventional method. Values of p < 0.05 were considered statistically significant.

Western blotting assay
Fresh tissues and protein lysis buffer (including protease inhibitors) were added in homogenizers and grinded fully for extracting proteins. Protein was measured using the bicinchoninic acid method, and separated on a 12% sodium dodecyl sulfate polyacrylamide gel via electrophoresis (cat. #XP00100BOX; Thermo, USA). The protein was then transferred onto polyvinyl difluoride membranes (cat. #88518; Thermo), and the membranes were blocked with 5% skim milk in Tris-buffered saline with Tween-20. After the membranes were incubated with rabbit anti-NEIL3 (1:2000 dilution; cat. #PA5-51022; Thermo) or β-actin overnight at 4 °C (1:10000 dilution; cat. #AM4302; Thermo) and then incubated with anti-rabbit secondary antibodies (1:10,000 dilution; cat. #A32733; Thermo) or anti-mouse secondary antibodies (1:2000 dilution; cat. #A-10654; Thermo) for 1 h. The enhanced chemiluminescence technique was used to develop the signals. The density of the protein bands was quantified by ImageJ software (National Institutes of Health, Bethesda, MD) and normalized to β-actin. Relative protein levels were calculated as the density ratios of interest protein to β-actin.

Statistical analysis
All statistical analyses were performed using R (v.3.5.3) and SPSS version 15.0 (SPSS Inc., Chicago, IL, USA). 22 pairs of tumor and normal tissues were used in Western blotting, which statistical power is 0.8291. 19 pairs of tumor and normal tissues were used in PCR, which statistical power is 0.8130. Statistical power calculations are performed using STATA v.15.0. To evaluate the correlation between NEIL3 expression and the other variables (sex, age, TNM stage, tumor size, distant metastasis, and OS), we performed Spearman correlation test, Chisquare Tests and Wilcoxon/Kruskal-Wallis test. We stratified the cohort into patients with a high or low median NEIL3 expression value or median risk score as the cut-off value. Multivariate Cox regression analysis was conducted to identify independent prognostic factors. Values of p < 0.05 were considered as statistically significant.

NEIL3 overexpression in LUAD
We thoroughly screened the LUAD data in the GEO database and selected three miRNA-sequencing datasets: GSE30219, GSE43458, and GSE33532. We identified 502 common DEGs among them (Fig. 1a), including 126 up-regulated DEGs in cancer tissues (Fig. 1b). After an extensive literature review, we found that NEIL3 is a very promising DNA repair gene. TIMER data showed that NEIL3 expression increased in 19 kinds of tumor tissues compared to adjacent normal tissues, especially in LUAD (Fig. 1c).
Analysis of the NEIL3 gene expression data in the TCGA and GSE31210 cohorts confirmed that NEIL3 is overexpressed in LUAD tissue (Fig. 2a, b). Therefore, RT-qPCR and western blotting were verified that the NEIL3 protein and mRNA levels were up-regulated in 22 LUAD tissues compared with the matched normal tissues (Fig. 2c, d). NEIL3 staining was positive in LUAD tissues but negative in normal lung tissue after immunohistochemistry staining (Fig. 2e). The results consistently showed that NEIL3 was obviously overexpressed in LUAD compared with the matched normal tissues (p < 0.001) (Fig. 2).

Correlation between NEIL3 expression and clinicopathological characteristics of LUAD patients
The characteristics of our 406 LUAD cohort subjects and 293 TCGA LUAD cohort subjects are summarized in Tables 1 and 2. All samples were divided into low and high expression groups by IHC-score and median expression level. The results revealed that NEIL3 overexpression was closely associated with advanced TNM stage (p < 0.05; Fig. 3a) and larger tumor size (p < 0.05; Fig. 3b, c). No significant correlation was noted between NEIL3 expression and other clinicopathological characteristics. Previous research has proved that NEIL3 could protect telomere integrity during the S phase and accurately segregate chromosomes in actively dividing cells [10], which The 126 overexpressed DEGs in lung adenocarcinoma. c NEIL3 expression in various cancer tissues and normal tissues. *p < 0.05, **p < 0.01, ***p < 0.001 may explain the advanced T classification in NEIL3-overexpressing patients.

NEIL3 overexpression predicts poor prognosis in LUAD patients
This study included all follow-up survival information of the GSE50081, GSE31210, TCGA datasets and our cohort. As is shown in Fig. 3d-g, Kaplan-Meier curve and log-rank test analyses indicated that up-regulated NEIL3 expression significantly reduced the OS and RFS of LUAD patients (p < 0.05). All risk factors and NEIL3 expression levels were subjected in the univariate Cox regression analysis (Table 3); of them, these were associated with OS: TNM stage, tumor size, lymph node metastasis, and NEIL3 expression (p < 0.001). Moreover, the multivariate Cox analysis result showed that increased NEIL3 expression [hazard ratio (HR) = 1.638, 95% confidence interval (CI) 1.338-2.005; p < 0.001] and advanced TNM stage (HR = 1.305, 95% CI 1.021-1.669; p < 0.05) were independent predictors of poor prognosis (Table 3 and Fig. 3h).

NEIL3 expression is associated with immune cell infiltration in LUAD
Patients with the same histological type of cancer may have different degree of immune infiltration cells that lead to diverse clinical outcomes [22,25]. The fact that an increased number of tumor-infiltrating lymphocytes in primary tumor tissue relates to good prognosis has been reported in several cancers, including LUAD [26]. The TIMER result showed that NEIL3 expression had a significant negative correlation with the infiltration of B cells, CD4+ T cells, and DCs (p < 0.05; Fig. 4a). The TIMER "Survival" module showed that high infiltrating levels of B cells benefit OS in contrast to NEIL3 expression (p < 0.05; Fig. 4b). We speculated that NEIL3 overexpression could affect OS by regulating the degree of B-cell infiltration in LUAD.
The ImmuCellAI analysis showed that the frequencies of natural T regulatory (nTreg), induced T regulatory (iTreg), Th1 cells, exhausted T cells, NK cells, effector memory and Gamma delta T cells exhibited a positive correlation with NEIL3 expression (p < 0.001), whereas the proportion of CD4 naïve cells, Tregulatory1 (Tr1), Th17, follicular helper T cell (Tfh), NK T, DC, and CD4 T cells exhibited a negative correlation (p < 0.001) (Fig. 4c). This funding suggested that NEIL3 has a regulatory effect on the formation of the LUAD immune microenvironment, especially on T-cell subsets, NK cells, and DCs. Figure 4d shows the correlation between different types of immune cell subsets. Exhausted T cells had the strongest positive correlation with Cytotoxic T lymphocyte (Pearson correlation = 0.73), while Cytotoxic T lymphocyte had the strongest negative correlation with CD4 naïve cells (Pearson correlation = − 0.66). Taken together, these findings indicate that NEIL3 plays a key role in the regulation of immune-infiltrating cells in LUAD.

KEGG and GO enrichment analysis of NEIL3 and co-expressed genes in LUAD
According to FDR < 0.050 and normalized enrichment score, the GSEA analysis revealed that the pathways of the cell cycle, nucleotide excision repair, DNA replication, mismatch repair, and the P53 signaling pathway were enriched in the high NEIL3 expression phenotype (Fig. 5a). The asthma and aldosterone-regulated sodium reabsorption pathways were enriched in the low NEIL3 expression cohort (Fig. 5a). It is worth noting that the cell cycle pathway, playing a crucial part in tumorigenesis and development, is associated with NEIL3 expression.  Subsequently, we analyzed mRNA sequencing data of TCGA cohort and acquired 370 genes co-expressed with NEIL3 (Additional file 3: Table S1, absolute Pearson correlation coefficient > 0.5, p < 0.001). As showed in the protein-protein interaction network (PPI), there were 323 genes (yellow dots) positively related to NEIL3, versus 47 genes (blue dots) negatively related to NEIL3 (Fig. 5b). Next, we performed GO and KEGG enrichment analyses and displayed the top 30 items. The GO analysis indicated that the co-expressed genes were significantly enriched in nuclear cell cycle DNA replication, DNA strand elongation, and DNA replication initiation  in the biological process group; condensed chromosome outer kinetochore and spindle midzone in the cellular component group; 3′-5′ DNA helicase activity in the molecular function group (Fig. 5c). Similarly, the KEGG pathway enrichment analyses showed that these genes were mainly enriched in the cell cycle, oocyte meiosis and DNA replication (Fig. 5d). Taking 370 co-expressed genes into analysis via the Cytoscape software cyto-Hubba plugin, we filtered 10 hub genes according to node degree: TOP2A, CCNA2, BUB1B, CDC45, BUB1, CDK1,  Table 4). Several hub genes (CDK1, CCNA2, CDC45, and CCNB2) played a vital role in cell cycle progression, whereas genomic instability contributed to potentiate tumorigenesis [27].

Establishment and evaluation of prognostic signature for LUAD patients
Based on the TCGA LUAD cohorts, we verified that all 10 of hub genes were conspicuously overexpressed in LUAD tissues versus normal lung tissues (p < 0.05; Additional file 1: Fig. S1). In addition, the high expressions of nine hub genes (all but TOP2A) were remarkably associated with poor prognosis in LUAD patients (p < 0.05; Additional file 2: Fig. S2).

Fig. 4 continued
Next, we attempted to establish a prognostic signature based on the expressions of NEIL3 and the other nine hub genes (Table 4). According to the median risk score (cutoff = 0.926), the LUAD patients were divided into two groups with discrete clinical outcomes for OS. Figure 6a shows the distribution of risk scores in the LUAD dataset. More deaths occurred in the high risk-score group than in the low risk score group (Fig. 6b). Meanwhile, the expressions of these 10 genes were up-regulated in the high-risk group (Fig. 6c). Figure 6d shows that patients in the low-risk group had significantly better OS than others in the Kaplan-Meier analysis (p < 0.05). Analysis of the association between risk core and various clinical features revealed that an increased risk score was correlated with more advanced stage, the larger tumor size and poor outcome (Fig. 6e, f, g).
The univariate and multivariate Cox analyses revealed that the risk score of a 10-gene signature was an independent prognostic factor for LUAD patients (p < 0.05, Fig. 7a, b). After ROC analysis, we observed a marked predictive advantage of the risk-score signature (area under the curve [AUC] = 0.679), stage (AUC = 0.739) and lymph node metastasis (AUC = 0.680) (Fig. 7c).
Finally, we selected a test cohort based on GSE50081 (127 LUAD patients with TNM stage I & II). In line with the TCGA cohort, the risk score of the 10-gene prognostic signature could be an independent prognostic factor for LUAD patients (p < 0.001; Fig. 7e, Table 5). Patients in the high risk score group had poor OS (p < 0.001; Fig. 6d), and the AUC values of the 1-, 3-, and 5-year ROC curves were 0.676, 0.788, and 0.766, respectively (p < 0.001; Fig. 7f ). These evidences strongly suggests that the 10-gene prognostic signature has superior diagnostic accuracy and may benefit LUAD patients in the early stage.

Discussion
LUAD, the most common type of malignant tumors, has significant morbidity and mortality rates [28]. Growing scientific research has focused on searching for effective treatment methods and sensitive biomarkers to improve the 5-year survival rate and life quality of LUAD patients. NEIL3, a DNA glycosylase of the BER pathway, repairs telomere oxidative damage and protects telomere integrity in cells with a high proliferative capacity during the S phase, which may explain the advanced Tumor stage in NEIL3 overexpressing patients [17]. In addition, some studies indicated NEIL3 as a cell cycle dependent gene was regulated by BRG1 in breast cancer cells and that NEIL3 overexpression may facilitate distant metastasis in primary melanoma [15,19]. Tran et al. found NEIL3 was overexpressed in a variety of tumors such as pancreatic adenocarcinoma, lower grade glioma, and kidney papillary cell carcinoma; furthermore, NEIL3 overexpressed tumors accumulate mutation and chromosomal variations [29]. As mentioned in the literature review, NEIL3 plays a crucial role in preventing autoimmunity and cell proliferation [11,13]. Here a comprehensive bioinformatics analysis was performed based on gene transcript profiles of LUAD from TCGA and GEO databases. Combined with the IHC-scores of our 406 patients cohort, NEIL3 expression was up-regulated in LUAD tissues and correlated with clinicopathological characteristics, especially advanced TNM stage and large tumor size. Meanwhile, the Cox regression analysis results demonstrated that NEIL3 may serve as an independent prognostic predictor in LUAD patients.
Over the past decade, immunotherapy has been a well-known promising cancer treatment with amazing achievements in treating various refractory malignancies. The pivotal strategy of immunotherapy is to interfere with immune checkpoints expressed in immune cells [30]. Immune cells are a crucial part of the immune microenvironment, including tumor infiltrating lymphocytes (TILs), tumor-associated macrophages (TAM), dendritic cells (DC), and myeloid-derived suppressor cells (MDSCs) [31,32]. As reported in the immunoediting theory, tumor invasive immune cells (TIICs) play a "double-edged sword" role in the development of cancers. A large number of studies have found the occurrence and development of lung adenocarcinoma not only depends on the lung cancer cells themselves but also is regulated by the tumor-infiltrating immune cells in the lung cancer microenvironment [33].
Based on the TIMER database review, NEIL3 expression had a significant negative correlation with B cells, CD4 + T cells, and DCs. The ImmuCellAI analysis revealed nTreg, iTreg, and Exhausted T cells were increased in the high NEIL3 expression group, whereas Th17 cells, DCs and CD4 + T cells were decreased. Th17 cells play a contradictory role in tumorigenesis and might be associated with secreting cytokines such as IL-17A, (See figure on next page.) Fig. 5 Functional enrichment analysis and protein-protein interaction network of NEIL3 and genes co-expressed with NEIL3. a KEGG signaling pathway enrichment analysis of the high and low NEIL3 expression groups via GSEA software. b The PPI network among genes co-expressed with NEIL3. The yellow dots indicate a positive correlation with NEIL3, whereas the blue dots indicate a negative correlation. c, d KEGG enrichment analysis and GO analysis of the co-expressed genes. e The 10 hub genes among the 370 co-expressed genes identified by the Cytoscape software CytoHubba plugin IL-17F, IL-21, and IFN-γ + Th17 cells as well as effector lymphocytes, including Th1, Tc1, and NK cells [34][35][36]. Treg cells could inhibit the activation of T lymphocytes by secreting cytokines such as IL-4, IL-10, and TGF-β, which regulate the immune function of tumor patients and promote the proliferation of lung cancer cells [37]. Evidence indicates that DCs induced antitumor immunity and inhibit the formation of new blood vessels in tumors in the lung cancer microenvironment [38]. These findings suggest that NEIL3 overexpression could increase the proportion of T regulatory cells and inhibit the antitumor function of Th17 cells and DCs, which predicted a poor OS and advanced TNM stage. Together this evidence demonstrates that NEIL3 plays a pivotal role in the regulation of immune infiltrating cells and could be considered a novel immune-related therapeutic target in LUAD. The molecular mechanism underlying NEIL3 gene and tumor microenvironment is still not known, which is our limitation and next crucial research question.
Equally important, GSEA analysis showed that the cell cycle and P53 signaling pathway were enriched in cases of high NEIL3 expression. As we all know, cell cycle proteins dysregulations is related to uncontrolled proliferation of a malignant tumor and becomes an attractive target in cancer therapy [39]. This work selected nine prognosisassociated hub genes among 370 genes co-expressed with NEIL3: CCNA2, BUB1B, CDC45, BUB1, CDK1, NCAPG, KIF23, UBE2C and CCNB2 (Table 3, Additional file 2: Fig. S2). CDK1, a cyclin-dependent kinase, plays an important part in regulating cell cycle progression and reportedly increases cellular proliferation in various cancers if dysregulated [40]. CCNA2 and CCNB2 are cyclin family proteins. CCNA2 was recognized as an effective prognostic marker in prostate, colon, lung, and liver cancers [41,42]. In line with CCNA2, CCNB2 increased the risk of multiple cancer prognoses such as such as adrenocortical carcinoma [43], lung cancer [44], breast cancer [45], and colorectal adenocarcinoma [46].
Our result showed that NEIL3 and nine hub genes were in a tight co-expressed relationship. Combining  Fig. 6 Relation between signature and cancer risk. a Distribution of risk score. b Proportion of deaths was increased in the high risk score group. c Hierarchical clustering of the 10 genes between the low and high risk groups. Red, up-regulated; green, down-regulated. d-g A higher risk score was remarkably associated with shorter overall survival, poorer clinical outcomes, more advanced TNM stage, and larger tumor size (p < 0.05) the expression levels of the nine hub genes and NEIL3, a 10-gene prognostic signature was constructed to accurately predict the prognostic risk of LUAD patients. Multivariate Cox regression analyses proved that the risk score was an independent prognostic factor correlated with TNM stage and tumor size. Analysis of the test cohort data achieved the same results. As early as 2007, Chen et al. attempted to develop gene signatures correlated with NSCLC clinical outcomes and finally developed a five-gene prognostic signature for NSCLC [47]. Since then, scholars have performed a great number of studies to explore novel gene signature as effective classifier to predict prognosis. Sun et al. [48] identified a immune-related prognostic signature, but the AUC of verification set GSE50081 was only about 0.617, and Wang et al. [49] identified a 4-gene signature, but the average AUC was less than 0.7. In addition, Liu et al. [50] identified a 22-gene signature through analysis of autophagy gene expression. Although the AUC is high, the large number of genes that needs to be detected makes this analysis impractical for clinical use. By contrast, our signature has a high AUC using 10 genes, which makes it conducive to clinical application and provides a theoretical basis for the development of new targeted therapies. NEIL3 expression and the 10-gene signature were proven to be independent predictors for LUAD patients using bioinformatics technology in this study. However, identifying NEIL3 in LUAD cell lines, exploring the underlying tumorigenesis mechanism, and designing novel potent selective NEIL3-targeted drugs require further research.

Conclusion
In summary, this study revealed that NEIL3 was overexpressed in LUAD and identified it as a new diagnostic biomarker for LUAD patients. Increased NEIL3 expression was related to advanced stage and larger tumor size as an independent diagnostic factor of poor prognosis in LUAD patients. Moreover, NEIL3 may play an important role in regulating immune-infiltrating cells such as regulatory T cells and DCs and may affect LUAD cell proliferations. The cell cycle and P53 signaling pathway is the major pathway affected by NEIL3 in LUAD. And finally, we constructed a 10-gene prognostic signature (NEIL3 and nine co-expressed hub genes) to accurately predict prognostic risk of LUAD patients. The results based on the test cohort proved that the 10-gene signature had precise diagnostic accuracy. Meanwhile, NEIL3 could be a promising biomarker for diagnosis and treatment and correlates with immune infiltration in LUAD.