Skip to main content

Up-regulation of CTD-2547G23.4 in hepatocellular carcinoma tissues and its prospective molecular regulatory mechanism: a novel qRT-PCR and bioinformatics analysis study



Dysregulated expression of long non-coding RNAs (lncRNAs) has been reported in the pathogenesis and progression of multiple cancers, including hepatocellular carcinoma (HCC). LncRNA CTD-2547G23.4 is a novel lncRNA, and its role in HCC is still unknown. Here, we aimed to clarify the expression pattern and clinical value of CTD-2547G23.4 and to investigate the prospective regulatory mechanism via bioinformatics analysis in HCC.


To identify differentially expressed lncRNAs in HCC, we downloaded RNA-Seq data for HCC and adjacent non-tumour tissues via The Cancer Genome Atlas (TCGA). CTD-2547G23.4 was selected by using the R language and receiver operating characteristic curve analysis. Furthermore, we validated the differential expression of CTD-2547G23.4 via Gene Expression Omnibus (GEO), ArrayExpress, Oncomine databases and quantitative real-time polymerase chain reaction (qRT-PCR). The relationship between the CTD-2547G23.4 level and clinic pathological parameters was also assessed. To further probe the role of CTD-2547G23.4 in HCC cell cycle, lentivirus-mediated small interfering RNA was applied to silence CTD-2547G23.4 expression in Huh-7 cell line. In addition, the related genes of CTD-2547G23.4 gathered from The Atlas of Noncoding RNAs in Cancer (TANRIC) database and Multi Experiment Matrix (MEM) were assessed with Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes, Protein Analysis Through Evolutionary Relationships and protein–protein interaction (PPI) networks.


CTD-2547G23.4 expression was remarkably higher in 370 HCC tissue samples than that in adjacent non-tumour liver tissues (48.762 ± 27.270 vs. 14.511 ± 8.341, P < 0.001) from TCGA dataset. The relative expression level of CTD-2547G23.4 in HCC was consistently higher than that in adjacent non-cancerous tissues (2.464 ± 0.833 vs. 1.813 ± 0.784, P = 0.001) as assessed by real time RT-qPCR. The area under the curve of the summary receiver operating characteristic curve was 0.8720 based on TCGA, qRT-PCR and GEO data. Further analysis indicated that the increased expression levels of CTD-2547G23.4 were associated with the neoplasm histologic grade and vascular tumour cell type. The expression of CTD-2547G23.4 was significantly downregulated in CTD-2547G23.4 knockdown cells. Moreover, cell cycle analysis revealed that CTD-2547G23.4 depletion in Huh-7 cell line led to S phase arrest. Furthermore, 314 related genes identified by TANRIC and MEM databases were processed with a pathway analysis. The bioinformatics analysis indicated that CTD-2547G23.4 might play a key role in the progress of HCC through four hub genes, SRC, CREBBP, ADCY8 and PPARA.


Collectively, we put forward the hypothesis that the novel lncRNA CTD-2547G23.4 may act as an exceptional clinical index and promote the HCC tumourigenesis and progression via various related genes.


Hepatocellular carcinoma (HCC) is currently a highly prevalent human cancer, which is correlated with high mortality all over the world [1]. Moreover, the number of new cases of HCC is increasing with each passing year. The number of HCC patients in the United States is predicted to reach approximately 27,000 by 2020 [2]. The risk factors of HCC primarily include cirrhotic livers and chronic liver injury caused by virus infections [3]. Although developments in the screening and treatment of HCC have been rapid, the clinical outcome is still limited due to frequent recurrence and metastasis regulated by the activation of multiple signal transduction pathways [4,5,6]. Prior to our study, extensive research has focused on new biomarkers associated with HCC diagnosis, prognosis, and evaluation of treatment efficacy [7,8,9,10,11]. However, satisfactory biomarkers and therapeutic related genes of HCC are still rare and sought-after. Therefore, the identification of a novel reliable biomarker and further investigation of the molecular mechanisms for HCC, which could boost the diagnostic value and survival prediction, are imperative.

Long noncoding RNAs (lncRNAs) are transcriptional RNA molecules with little protein-coding capacity that are longer than 200 nucleotides [12]. Lately, new lncRNAs have been extensively discovered. Fortunately, studies on lncRNAs and their roles in various pathophysiological processes have created new avenues for cancer diagnosis and therapies [13]. LncRNA SPRY4-IT1 has been shown to promote proliferation and invasion of HCC through activating EZH2 and may be used as a new treatment biomarker [14]. Furthermore, Lv et al. [15] reported that lncRNA Unigene56159, as a ceRNA of miR-140-5p, thus promotes the migration and invasion of HCC cells. Despite the fact that several lncRNAs have been demonstrated to be indispensable in the biological process of HCC, screening novel lncRNAs which could be excellent diagnostic and prognostic markers are still urgently needed.

For the sake of exploring the novel lncRNAs which could serve as an appropriate clinical index and investigate the potential mechanisms responsible for HCC, CTD-2547G23.4 (ENSG00000274925, Exons: 1, Coding exons: 0, Transcript length: 3115 bps), a lncRNA, has not yet been explored to play a potential role in the molecular mechanism of cancers, was finally randomly chosen due to its good diagnostic value and its unknown biological function. Here, we proposed to uncover the mystery of CTD-2547G23.4.

Materials and methods

TCGA dataset and analysis of the differentially expressed lncRNAs

TCGA project provides RNA sequencing (RNA-Seq) data from 370 HCC cases and 50 adjacent liver tissue samples. The publicly available RNA-Seq data were downloaded directly from the TCGA portal ( via bulk download mode of the liver hepatocellular carcinoma (LIHC) (cancer type), RNASeqV2 (data type), and level 3 (data level) cancer tissues collected by the end of December 8, 2016. The lncRNA expression data were displayed as HTSeq-Counts. Next, analysis was carried out using the DESeq package in R language to compare the lncRNA expression data from HCC and their adjacent non-tumour tissues. Differentially expressed lncRNAs between HCC tissues and the adjacent non-tumour tissues were selected based on the following criteria: Padj < 0.05 and an absolute log2FC > 1. Visualization of the identified differentially expressed lncRNAs is shown in the form of a volcano plot that was created by using the ggplot2 package. Receiver operating characteristic (ROC) curve analyses were performed to identify the distinguishing capability of cancer from non-cancerous tissues. Based on the AUC values, the lncRNA CTD-2547G23.4, was finally selected for in-depth investigation.

Verification of CTD-2547G23.4 expression based on other databases

We also collected RNA-Seq or chip datasets from Gene Expression Omnibus (GEO) (, ArrayExpress ( and Oncomine ( databases. The following keywords were used: (“lncRNA” OR “lncRNAs”) AND (malignan* OR cancer OR tumour OR tumour OR neoplas* OR carcinoma) AND (hepatocellular OR liver OR hepatic OR HCC). We extracted all of the expression data for CTD-2547G23.4 and the clinical parameters from the online public databases.

Confirmation of CTD-2547G23.4 expression based on clinical samples

Hepatocellular carcinoma tissue samples and their adjacent normal liver tissues were gathered from 39 HCC patients from the First Affiliated Hospital of Guangxi Medical University, People’s Republic of China from January, 2012 to August, 2013. All formalin-fixed, paraffin-embedded (FFPE) clinical samples were acquired from the surgical resection of HCC patients who had not received tumour-specific therapy prior to surgery. The diagnoses were independently confirmed by two pathologists (Hai-wei Liang and Gang Chen), and the adjacent non-cancer hepatic tissues located at least 2 cm away from the macroscopically unaffected margins of the tumour were confirmed as being without cancer by microscopic analysis. The Ethics Committee of First Affiliated Hospital of Guangxi Medical University approved the protocol, and written informed consent was provided by HCC patients involved.

RNA extraction and quantitative real-time PCR

Total RNA was extracted from 39 HCC samples and corresponding adjacent non-tumour tissues using the Qiagen RNeasy FFPE Kit following the manufacturer’s protocol as previously reported [16, 17]. The quantification of CTD-2547G23.4 and glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was performed by Applied Biosystems PCR7900. The sequences of the CTD-2547G23.4 primer were as follows: F: TTTGGTCTCTTCGGGTCATC, R: CTTAGCTGGACGCCTACCTG. For GAPDH, the primers were: F: GTAAGACCCCTGGACCACCA, R: CAAGGGGTCTACATGG CAACT. The expression were calculated using the 2−ΔCt method.

Cell culture

Huh-7, SMMC-7721, HepG2, Bel-7404 and HL-7702 cell lines, were all purchased from the Cell Bank of Chinese Academy of Sciences. Five human HCC cell lines were cultured in Dulbecco’s modified Eagle’s medium (DMEM, HyClone, Logan, UT), mixed with 10% fetal bovine serum at 37 °C in the presence of 5% CO2.

Lentivirus construction and transfection

To further probe the role of CTD-2547G23.4 in HCC cell cycle, lentivirus-mediated siRNA was applied to silence CTD-2547G23.4 expression in Huh-7 cell line. According to the CTD-2547G23.4 sequence from Ensemble (ENSG00000274925), three different target siRNA sequences were designed and named as knock down-1 (KD-1) (5′-CAGCCTTCATTCAGTGGCCAT-3′), KD-2 (5′-AGGCATTGACCAAGAAAGTAA-3′), KD-3 (5′-TCGCTTGAGGTCAGAAGTTTA-3′), with a negative control (NC) siRNA target sequence (5′-TTCTCCGAACGTGTCACGT-3′). Above-mentioned siRNA fragments were cloned into a GV248 vector (Shanghai GeneChem, China).

Cell cycle analysis

The fluorescence intensity of propidium directly detected by flow cytometry, which reflected the distribution status of DNA in the G0/G1, S and G2/M phases. Cells were seeded in 6 cm dish at a density of 4 ml/well. According to the manufacturer’s protocol, cells with 80% confluence were stained with propidium after lentivirus infection for 5 days, and determined by a flow cytometer (Guava easyCyte HT, Millipore).

Statistical analyses for the clinical implication of CTD-2547G23.4

Statistical analyses were performed utilizing the SPSS 24.0 statistical software package (Chicago, IL, USA), graphs and curves were constructed with the GraphPad Prism 7 software (GraphPad Software, San Diego, CA, USA). The quantitative values were expressed as the mean ± SD (range). Student’s t test was adopted for the comparison of two independent groups. We used ROC and summary ROC (SROC) curves to examine the feasibility of using the CTD-2547G23.4 expression level as a value for screening or detecting HCC. Then, the overall SMD with a 95% CI was assessed with STATA software version 12.0 (StataCorp, College Station, TX, USA). An observed SMD > 0 with a 95% CI not crossing zero indicated that CTD-2547G23.4 had a higher expression level in HCC tissues than in adjacent non-tumour tissues. The heterogeneity across datasets was analysed with the I2 statistics method. A P value less than 0.05 or an I2 more than 50% was considered to indicate a heterogeneous dataset, in which a random-effects model would be used for pooling data. Otherwise, a fixed-effects model was employed. All statistical tests were two-sided. The statistical results were considered to be significant with a P value less than 0.05.

Prospective related genes of CTD-2547G23.4 in HCC

The related genes of CTD-2547G23.4 were accumulated based on two online prediction databases: TANRIC ( and MEM ( databases. TANRIC database is a web resource of the Bioinformatics and Computational Biology department used to explore the interactions of lncRNAs in cancer and comprises experimentally supported mRNA related genes [18]. MEM is a gene expression query and visualization tool from a web-based multi-experiment that gathers substantial publicly usable gene expression data from the ArrayExpress database [19]. To identify more reliable pathways and gene network, we examined the intersecting genes from the above two sets of TANRIC and MEM.

Gene-enrichment and functional annotation analysis

To further understand the potential mechanism of CTD-2547G23.4 in HCC, the Database for Annotation, Visualization and Integrated Discovery (DAVID, was used to perform a GO enrichment analysis, KEGG and PANTHER pathway annotations. GO terms, KEGG and PANTHER pathways with P < 0.05 were considered significant. The enrichment map of the annotation analysis was generated by Cytoscape v3.4.0 to visualize the results.

PPI network construction analysis

The protein-to-protein (PPI) network was created by using the STRING software v10.0 ( and was drawn to reveal the connection among the overlapping related genes. The amount of nodes and edges were used to identify the most essential related genes for CTD-2547G23.4 in HCC. Hub genes were identified according to the numerical digit of the degrees of each of the nodes and edges. A P value less than 0.05 was regarded to be statistically significant. Pearson’s correlation coefficient was utilized to disclose the relationship between CTD-2547G23.4 and the expression level of hub genes.


Differentially expressed lncRNAs in HCC

Among all the available 60,244 mRNAs expression data downloaded from TCGA portal, 7589 lncRNA were involved. After calculation mentioned above, 441 significantly differentially expressed lncRNAs (334 lncRNAs up-regulated and 107 lncRNAs down-regulated) that met the criteria of log2FC > 1 and Padj < 0.05 were obtained (Figs. 1, 2). Subsequently, the diagnostic value of all 441 lncRNAs were analysed by ROC curve analyses. Among them, 46 lncRNAs with AUC values more than 0.900 (data not shown). We conducted the literature research on these lncRNAs and noticed that no investigation were reported concerning part of lncRNAs. CTD-2547G23.4 was finally randomly selected due to its significantly differential expression and unknown biological possesses according to literature.

Fig. 1
figure 1

Flow chart of the study procedures in this investigation

Fig. 2
figure 2

Volcano plot of the differentially expressed lncRNAs between HCC and adjacent non-tumour tissues. The volcano plot was created with the ggplot2 package of R language. The X axis indicates a log2 (fold change), and the Y axis indicates an − log10 (P value). Red represents high expression and green low expression. Black shows the lncRNA expression with both the logFC < 1 and − log10 (P value) < 0.05. Differentially expressed lncRNAs were calculated by DESeqR with 334 overexpressed lncRNAs and 107 underexpressed lncRNAs

The crucial role of CTD-2547G23.4 in the occurrence and progression of HCC evidenced by multiple databases

The expression level of CTD-2547G23.4 in HCC via various databases

We first assessed the extracted CTD-2547G23.4 data from TCGA database. The data indicated that CTD-2547G23.4 expression was higher in the 370 HCC samples (48.762 ± 27.270) than in adjacent non-tumour tissues (14.511 ± 8.341) (Fold change = 3.370, P < 0.001; Table 1, Fig. 3a). Subsequently, in other online databases (GEO, ArrayExpress and Oncomine databases), only two chip data (GSE27462 and GSE49713) were obtained from the GEO database, which provided CTD-2547G23.4 expression value in HCC tissues and adjacent non-tumour tissues (Fig. 3c, e). However, the expression of CTD-2547G23.4 did not differ significantly between HCC and adjacent non-tumour tissues in either microarrays. We also examined the relative expression of CTD-2547G23.4 in 39 pairs of HCC tissues matched with adjacent non-tumour tissues by qRT-PCR analysis normalized to GAPDH. The expression level of CTD-2547G23.4 was 2.464 ± 0.833 in HCC tissues, which was predominantly elevated than that in the adjacent non-cancerous tissues (1.813 ± 0.784) (P = 0.001, Table 2, Fig. 3g). To draw a comprehensive conclusion, we integrated the data from TCGA, GEO and in-house PCR using a meta-analysis (Additional file 1: Figure S1). The pooled SMD of CTD-2547G23.4 was 0.730 (95% CI − 0.400 to 1.860, P = 0.206; I2 = 92.0%, P < 0.001, Additional file 2: Figure S2A) by the random-effects model. Then, we omitted the GSE27462 and GSE49713 datasets due to small sample sizes, the overall result demonstrated that CTD-2547G23.4 was remarkably up-regulated in HCC (SMD = 1.470, 95% CI 0.190–2.740, P = 0.024; I2 = 95.0%, P < 0.001, Additional file 2: Figure S2C). Collectively, the above results certified that CTD-2547G23.4 was evidently elevated in HCC.

Table 1 Relationship between CTD-2547G23.4 expression and clinical parameters in TCGA
Fig. 3
figure 3

Relative level of CTD-2547G23.4 expression in HCC and adjacent non-cancerous tissues. a TCGA data were used to measure CTD-2547G23.4 expression in HCC tissues and matched adjacent non-tumour tissues. b Significance of CTD-2547G23.4 in HCC with the ROC curve analysis from TCGA. c CTD-2547G23.4 expression value in HCC tissues and adjacent non-tumour tissues from GSE27462. d ROC curve of CTD-2547G23.4 in HCC from GSE27462. e CTD-2547G23.4 expression level in HCC tissues and adjacent non-tumour tissues from GSE49713. f ROC curve of CTD-2547G23.4 in HCC from GSE49713. g qRT-PCR was used to measure CTD-2547G23.4 expression in HCC tissues and matched adjacent non-tumour tissues. h Diagnostic value of CTD-2547G23.4 in HCC with the ROC curve analysis by qRT-PCR analysis

Table 2 Relationship between CTD-2547G23.4 expression and clinical parameters in qRT-PCR

Further verification of CTD-2547G23.4 up-regulation in HCC by SROC

To further identify the capability of CTD-2547G23.4 in distinguishing cancer from non-cancerous liver tissues, ROC and SROC curve analyses were carried out. The AUC of CTD-2547G23.4 from TCGA data was 0.927 (P < 0.001, cut-off > 22.776, Fig. 3b). The AUC of CTD-2547G23.4 from GSE27462 and GSE49713 were 0.640 (P = 0.480, cut-off > 5.543, Fig. 3d) and 0.280 (P = 0.251, cut-off > 5.680, Fig. 3f) respectively. The AUC of CTD-2547G23.4 from the 39 HCC patients was 0.721 (P < 0.001, cut-off > 2.460, Fig. 3h). From meta-analysis, the AUC of SROC was 0.816 (95% CI 0.654–0.976) (Additional file 2: Figure S2B). The pooled sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), diagnostic odds ratio (DOR) of CTD-2547G23.4 in these studies were 0.840 (95% CI 0.800–0.880), 0.67 (95% CI 0.570–0.760), 2.15 (95% CI 0.750–6.160), 0.31 (95% CI 0.130–0.740) and 8.96 (95% CI 1.930–41.560), respectively (Fig. 4a–e). Then, we omitted the GSE49713 dataset due to its low capability to distinguish the cancer from para-cancerous tissues. The AUC of SROC was 0.8690 (95% CI 0.758–0.980) (Additional file 2: Figure S2D). Based on the described above, CTD-2547G23.4 was up-regulated in HCC.

Fig. 4
figure 4

Forest plots exhibit diagnostic performance of CTD-2547G23.4 in HCC. a The pooled sensitivity. b The pooled specificity. c The pooled positive likelihood ratio. d The pooled negative likelihood ratio. e The pooled diagnostic odds ratio based upon the eligible datasets

Clinical implication of CTD-2547G23.4 in the progression of HCC

Considering the potential oncogenic role of CTD-2547G23.4 in the progression of HCC, we analysed the relationship between clinicopathological. Significantly different expression values of CTD-2547G23.4 were observed between high and low neoplasm histologic grades. The obvious difference also occurred between negative and positive vascular tumour cell types (Table 1). Regarding neoplasm histologic grade, CTD-2547G23.4 expression levels were overexpressed in samples with GIII + GIV vs. GI + GII (54.617 ± 27.838 vs. 45.739 ± 26.533, P = 0.003, Table 1). Samples with vascular tumour cell type versus without vascular tumour cell type had up-regulated CTD-2547G23.4 level (53.590 ± 31.448 vs. 45.899 ± 25.301, P = 0.028, Table 1). However, expression of CTD-2547G23.4 did not significantly correlate with other clinicopathological features. Other online databases did not provide specific clinicopathological parameters data. Simultaneously, we also investigated relationship between CTD-2547G23.4 expression and clinicopathological significance using PCR data. No significant difference concerning the CTD-2547G23.4 expression was detected between different groups of clinical parameters (Table 2), which could be probably due to insufficient size of cases we collected.

Knockdown of CTD-2547G23.4 arrests the cell cycle of HCC cells

To explore the potential role of CTD-2547G23.4 in HCC. We firstly detected the expression level of CTD-2547G23.4 in five HCC cell lines (Huh-7, SMMC-7721, HepG2, BEL-7404 and HL-7702) by qRT-PCR and discovered that CTD-2547G23.4 was widely expressed and subsequently chose Huh-7 cell line for the following investigation due to the highest expression level among five HCC cell lines (Fig. 5). To further probe the role of CTD-2547G23.4 in HCC cell cycle, lentivirus-mediated small interfering RNA (siRNA) was applied to silence CTD-2547G23.4 expression in Huh-7 cell line. The relative expression of CTD-2547G23.4 was significantly downregulated in CTD-2547G23.4 knockdown (KD) 2 group of Huh-7 cell line (0.270 ± 0.038 vs. 1.000 ± 0.087, P < 0.001, Fig. 6). Moreover, cell cycle analysis revealed that CTD-2547G23.4 depletion in Huh-7 cell line led to S phase arrest (Fig. 7).

Fig. 5
figure 5

Relative expression level of CTD-2547G23.4 in five HCC cell lines

Fig. 6
figure 6

Knockdown (KD) of CTD-2547G23.4 using lentivirus-mediated siRNA in Huh-7 cell line. a Microscopic analysis of Huh-7 cells 72 h following infection with lentivirus-mediated siRNA (NC negative control, GFP green fluorescent protein). b qRT-PCR analysis of CTD-2547G23.4 relative expression level in Huh-7 cells

Fig. 7
figure 7

Percentage of cells in G0/G1, S and G2/M phase of the cell cycle (*P < 0.05)

Related genes of CTD-2547G23.4 and gene-annotation enrichment analysis

The online TANRIC software collected 2045 related genes, and we identified more than 5002 genes from the MEM database. We eventually identified 314 overlapping genes (Fig. 8a) and the DAVID analysis was implemented to identify GO annotations and KEGG and PANTHER pathways. “Regulation of system process” was the most significantly enriched biological process (BP) (P = 0.002, Table 3). According to the cellular component (CC) analysis, genes mostly assembled at the neuronal projections (P < 0.001, Table 3). The genes from GO molecular functions (MFs) were enriched in metal ion binding (P < 0.001, Table 3). The three most significantly enriched annotations of GO categories were GO:0043005, GO:0044057 and GO:0046872 (Fig. 8b). In addition, the related genes of CTD-2547G23.4 in the KEGG enrichment analysis were shown to be particularly related to long-term potentiation (P < 0.001, Table 4, Fig. 8c) with seven genes (ADCY8, GRIN2C, CREBBP, GRIN2A, CALML5, ITPR3, CAMK2A) (Fig. 8d). The PANTHER pathway was mainly enriched in the heterotrimeric G-protein signalling pathway (P = 0.004, Table 4).

Fig. 8
figure 8

The enriched annotation pathways analysis of potential genes targeted by CTD-2547G23.4 in HCC. a Venn diagram of the overlap between the number of predicted target genes using the MEM and TANRIC databases. b The significantly enriched annotation of the Gene Ontology (GO) categories. c The significantly enriched annotation of the KEGG and PANTHER pathway. d KEGG pathway map illustrating the long-term potentiation signalling pathway in humans by DAVID 6.8 (

Table 3 Gene Ontology (GO) analysis of the potential targets of CTD-2547G23.4
Table 4 KEGG and PANTHER pathway analyses of the validated targets of CTD-2547G23.4

PPI network construction and module analysis

The PPI network contained 301 nodes and 88 edges (Fig. 9a). Among these genes, the degree values of more than 2 were defined as being indicative of hub genes (Fig. 9b). Sarcoma (SRC, degree = 13), cyclic AMP responsive element-binding protein (CREBBP, degree = 11), adenylate cyclase 8 (ADCY8, degree = 6) and peroxisome proliferator activated receptor alpha (PPARA, degree = 6) were the four most symbolic hub genes. We found that the PPARA and CREBBP expression value in HCC tissues was clearly down-regulated than in matched non-tumour tissues (P < 0.01, Fig. 10). The SRC expression level in HCC tissues was obviously up-regulated than in matched non-tumour tissues (P < 0.001, Fig. 10). The AUC values of PPARA, SRC, CREBBP and ADCY8 were 0.700 (P < 0.001), 0.693 (P < 0.001), 0.649 (P < 0.001) and 0.532 (P = 0.471), respectively. Through Pearson’s correlation analysis, PPARA (r = − 0.169, P = 0.001), SRC (r = 0.149, P = 0.004), CREBBP (r = 0.149, P = 0.004), and ADCY8 (r = − 0.254, P < 0.001) were significantly related to CTD-2547G23.4 in TCGA (Fig. 10).

Fig. 9
figure 9

Protein–protein interaction (PPI) network analysis. a PPI network of CTD-2547G23.4 target genes in HCC. b The 30 most significant potential hub genes targeted by CTD-2547G23.4 in HCC

Fig. 10
figure 10

Relative expression level of 4 hub genes in HCC and adjacent non-cancerous tissues from TCGA. a PPARA, d SRC, g CREBBP, j ADCY8. Diagnostic value of CTD-2547G23.4 in HCC with the ROC curve analysis. b PPARA, e SRC, h CREBBP, k ADCY8. Correlation analysis between CTD-2547G23.4 and hub genes through Pearson’s correlation. c PPARA, f SRC, i CREBBP, l ADCY8


The current study, to the best of our knowledge, was the first to investigate a novel lncRNA CTD-2547G23.4 in HCC, which was significantly up-regulated in HCC samples. We found that increased CTD-2547G23.4 expression was associated with the neoplasm histologic grade and vascular tumour cell type, and these clinic pathological characteristics could be reprehensive of poor outcome. In addition, SROC curve analysis demonstrated that CTD-2547G23.4 provided high diagnostic performance for the detection of HCC, with an AUC 0.8156 based on TCGA, GSE27462, GSE49713 and qRT-PCR data. Moreover, we also carried out a sequential in silico prediction for the related genes of CTD-2547G23.4 in HCC and observed that CTD-2547G23.4 targeted hub genes related to the tumourigenesis and development of HCC. Based on these findings, we proposed that CTD-2547G23.4 may be a candidate clinical index and exert an oncogenic role in the progression of HCC.

To date, an accumulating number of investigations have identified that aberrant lncRNA expression levels often play a crucial role in the biological process of HCC. For instance, the lncRNA HULC, which is highly up-regulated in HCC, should serve as a scaffold for ERK and YB-1 to enhance hepatocarcinogenesis [20]. Liu J et al. [21] determined that the lncRNA SNHG20 was significantly overexpressed in HCC tissues. Interestingly, its high expression level leads to EMT-induced HCC cell invasion via binding to the enhancer of EZH2. HOTAIR is a long non-coding RNA that is overexpressed in HCC tissues and drives HCC cell proliferation and progression [22]. These findings could also afford new insight into the molecular biological mechanisms of HCC that depend on lncRNAs.

More importantly, HCC is not frequently diagnosed until the late stage. Therefore, the identification of a suitable biomarker with good diagnostic performance is urgent. Alpha-fetoprotein (AFP) has been used as the most common clinical screening and diagnosis method for HCC. AFP has been reported to have a sensitivity from 39 to 64% and specificity from 76 to 91% [23]. The suboptimal sensitivity and specificity of AFP are probably due to a variety of factors. Recently, the versatile diagnostic role of lncRNAs in various cancer types, including HCC, has attracted the attention of many scholars. A meta-analysis of ten studies with 820 HCC patients and 785 healthy controls determined that lncRNAs had a high diagnostic significance for HCC, and their expression could theoretically be used as auxiliary biomarkers for confirmation of HCC [24]. For example, the overexpression of the lncRNA HULC in HCC could act as a promising biomarker for detecting and screening hepatocarcinogenesis [25, 26]. All of these studies suggest that lncRNA CTD-2547G23.4 may serve as a predictor for HCC diagnosis. Herein, we carried out SMD and SROC curve analyses. Simultaneously, the heterogeneity between the AUC of the two cohorts may be owing to the different methods used for the evaluation of the expression levels of CTD-2547G23.4 and the different number of samples.

We have demonstrated that CTD-2547G23.4 is overexpressed in HCC and might have potential diagnostic value for HCC patients. To investigate the clinical value of CTD-2547G23.4 in HCC diagnosis and prognosis, we analysed the relationship between the expression of CTD-2547G23.4 and clinic pathological characteristics. Surprisingly, through analysing the expression data from TCGA datasets, we observed that elevated expression of CTD-2547G23.4 was significantly related to a high neoplasm histologic grade and vascular tumour cell type. The histologic grade often reflects the tumour growth and invasion rate and predicts the clinical outcome [27]. Vascular tumour cell type is also closely related to the growth of tumours. These observations suggest that against CTD-2547G23.4 might be more effective in tumours of high histological grade and effective after resection in the prevention of recurrences. Furthermore, the varying degrees of the CTD-2547G23.4 expression may limit the effect of various types of therapy.

Protein–protein interaction network evaluation assisted to identify 30 hub genes that were the core related genes of CTD-2547G23.4 in HCC. Among these 30 hub genes, SRC, CREBBP, ADCY8, and PPARA were the four core genes. We next focused on the roles and functions of SRC, CREBBP and PPARA in HCC for deepgoing discussion.

The role of SRC, as the proto-oncogene encoding a tyrosine kinase, has been studied in multiple tumours for many years [28, 29]. Similarly, many studies have indicated that SRC is involved in various signalling pathways of HCC [30, 31]. Zhao et al. [32] demonstrated that SRC was markedly elevated in HCC tissues and related to clinical stage, pathological differentiation, and the status of lymph node metastasis. In addition, SRC was discovered to enhance HCC cell invasion and metastasis via phosphorylating the EGFR pathway [33]. Moreover, the underlying mechanism of EF24-suppressed invasion and migration of hepatocellular carcinoma has been shown to be through inducing the phosphorylation of SRC [34]. Thus, SRC is highly involved in HCC. But, the association between SRC and CTD-2547G23.4 has not been reported. As we predicted, SRC is the most significant hub gene of CTD-2547G23.4 in HCC. In-depth studies are essential to validate this correlation between SRC and CTD-2547G23.4 in HCC.

CREBBP is a transcriptional co-activator with an essential function in the liver through its regulation of gene expression and diverse processes such as gluconeogenesis, lipid metabolism, and cell proliferation [35]. Lately, some studies have shown that CREBBP is associated with apoptosis in HCC. Chen et al. [36] found that the CREB pathway was partly involved in tumour apoptosis caused by N-butylidenephthalide. Moreover, Abramovitch et al. [37] demonstrated that CREBBP played a central role in the anti-apoptotic effect in HCC through in vitro and in vivo experiments.

The PPARA gene encodes PPAR-alpha, which is a transcription factor that maintains hepatic metabolic homeostasis through the hepatocyte nuclear factor-4 alpha (HNF4A) gene [38]. The abnormal stimulation of PPAR has also been reported to generate HCC. In addition, Drakaki and his colleagues have proven that miR-9 plays key roles in the early stages of HCC oncogenesis through direct regulation of PPARA [39]. Yamasaki investigated cell proliferation in fenofibrate-treated HCC cells and elucidated that fenofibrate induced an antiproliferative effect via a PPAR-alpha-dependent mechanism [40]. These findings suggest that PPARA is vital in HCC and may be a molecular target for therapy. Due to these findings and the results of the PPI network analysis, we put forward a hypothesis that CTD-2547G23.4 modulates the process of HCC via targeting PPARA. Nevertheless, target gene in silico prediction algorithms have limited specificity. Therefore, further in vitro and in vivo assays of CTD-2547G23.4 potential biological function in those signaling pathways in HCC is essential to verify and illuminate the regulative mechanisms of CTD-2547G23.4 in HCC. Further investigations are required to fully elucidate this hypothesis.


In conclusion, we have demonstrated high lncRNA CTD-2547G23.4 expression in HCC and analysed the related genes and pathways of CTD-2547G23.4 through bioinformatics methods. CTD-2547G23.4 could target hub genes such as SRC, CREBBP and PPARA, which regulate the biological process in HCC. Our study has provided the first demonstration that lncRNA CTD-2547G23.4 could be a useful clinical index and furnished insights into the better understanding of the potential mechanism of CTD-2547G23.4 in HCC.



hepatocellular carcinoma


long non-coding RNA


The Cancer Genome Atlas


Gene Expression Omnibus


receiver operating characteristic


The Atlas of Noncoding RNAs in Cancer


Multi Experiment Matrix


Gene Ontology


biological process


cellular component


molecular function


Kyoto Encyclopedia of Genes and Genomes


Protein Analysis Through Evolutionary Relationships


protein–protein interaction


  1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2017. CA Cancer J Clin. 2017;67(1):7–30.

    Article  PubMed  Google Scholar 

  2. Vitale A, Peck-Radosavljevic M, Giannini EG, Vibert E, Sieghart W, Van Poucke S, Pawlik TM. Personalized treatment of patients with very early hepatocellular carcinoma. J Hepatol. 2017;66(2):412–23.

    Article  PubMed  Google Scholar 

  3. Bosetti C, Turati F, La Vecchia C. Hepatocellular carcinoma epidemiology. Best Pract Res Clin Gastroenterol. 2014;28(5):753–70.

    Article  PubMed  Google Scholar 

  4. Tao Y, Hu K, Tan F, Zhang S, Zhou M, Luo J, Wang Z. SH3-domain binding protein 1 in the tumor microenvironment promotes hepatocellular carcinoma metastasis through WAVE2 pathway. Oncotarget. 2016;7(14):18356–70.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Jia Q, Dong Q, Qin L. CCN: core regulatory proteins in the microenvironment that affect the metastasis of hepatocellular carcinoma? Oncotarget. 2016;7(2):1203–14.

    Article  PubMed  Google Scholar 

  6. Yang N, Li S, Li G, Zhang S, Tang X, Ni S, Jian X, Xu C, Zhu J, Lu M. The role of extracellular vesicles in mediating progression, metastasis and potential treatment of hepatocellular carcinoma. Oncotarget. 2017;8(2):3683–95.

    PubMed  Google Scholar 

  7. Yang H, Zhang X, Cai XY, Wen DY, Ye ZH, Liang L, Zhang L, Wang HL, Chen G, Feng ZB. From big data to diagnosis and prognosis: gene expression signatures in liver hepatocellular carcinoma. PeerJ. 2017;5:e3089.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Zhang X, Tang W, Chen G, Ren F, Liang H, Dang Y, Rong M. An encapsulation of gene signatures for hepatocellular carcinoma, microRNA-132 predicted target genes and the corresponding overlaps. PLoS ONE. 2016;11(7):e0159498.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Li JJ, Luo J, Lu JN, Liang XN, Luo YH, Liu YR, Yang J, Ding H, Qin GH, Yang LH, et al. Relationship between TRAF6 and deterioration of HCC: an immunohistochemical and in vitro study. Cancer Cell Int. 2016;16:76.

    Article  PubMed  PubMed Central  Google Scholar 

  10. He R, Gao L, Ma J, Peng Z, Zhou S, Yang L, Feng Z, Dang Y, Chen G. The essential role of MTDH in the progression of HCC: a study with immunohistochemistry, TCGA, meta-analysis and in vitro investigation. Am J Transl Res. 2017;9(4):1561–79.

    PubMed  PubMed Central  Google Scholar 

  11. Ye ZH, Gao L, Wen DY, He Y, Pang YY, Chen G. Diagnostic and prognostic roles of IRAK1 in hepatocellular carcinoma tissues: an analysis of immunohistochemistry and RNA-sequencing data from the cancer genome atlas. OncoTargets Ther. 2017;10:1711–23.

    Article  Google Scholar 

  12. Fatica A, Bozzoni I. Long non-coding RNAs: new players in cell differentiation and development. Nat Rev Genet. 2014;15(1):7–21.

    CAS  Article  PubMed  Google Scholar 

  13. Li HM, Yang H, Wen DY, Luo YH, Liang CY, Pan DH, Ma W, Chen G, He Y, Chen JQ. Overexpression of LncRNA HOTAIR is associated with poor prognosis in thyroid carcinoma: a study based on TCGA and GEO data. Horm Metabol Res. 2017;49(5):388–99.

    CAS  Article  Google Scholar 

  14. Zhou M, Zhang XY, Yu X. Overexpression of the long non-coding RNA SPRY4-IT1 promotes tumor cell proliferation and invasion by activating EZH2 in hepatocellular carcinoma. Biomed Pharmacother. 2017;85:348–54.

    CAS  Article  PubMed  Google Scholar 

  15. Lv J, Fan HX, Zhao XP, Lv P, Fan JY, Zhang Y, Liu M, Tang H. Long non-coding RNA Unigene56159 promotes epithelial–mesenchymal transition by acting as a ceRNA of miR-140-5p in hepatocellular carcinoma cells. Cancer Lett. 2016;382(2):166–75.

    CAS  Article  PubMed  Google Scholar 

  16. Guo S, Chen W, Luo Y, Ren F, Zhong T, Rong M, Dang Y, Feng Z, Chen G. Clinical implication of long non-coding RNA NEAT1 expression in hepatocellular carcinoma patients. Int J Clin Exp Pathol. 2015;8(5):5395–402.

    CAS  PubMed  PubMed Central  Google Scholar 

  17. Pan LJ, Zhong TF, Tang RX, Li P, Dang YW, Huang SN, Chen G. Upregulation and clinicopathological significance of long non-coding NEAT1 RNA in NSCLC tissues. Asian Pac J Cancer Prev APJCP. 2015;16(7):2851–5.

    Article  PubMed  Google Scholar 

  18. Li J, Han L, Roebuck P, Diao L, Liu L, Yuan Y, Weinstein JN, Liang H. TANRIC: an interactive open platform to explore the function of lncRNAs in cancer. Cancer Res. 2015;75(18):3728–37.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  19. Adler P, Kolde R, Kull M, Tkachenko A, Peterson H, Reimand J, Vilo J. Mining for coexpression across hundreds of datasets using novel rank aggregation and visualization methods. Genome Biol. 2009;10(12):R139.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Li D, Liu X, Zhou J, Hu J, Zhang D, Liu J, Qiao Y, Zhan Q. Long noncoding RNA HULC modulates the phosphorylation of YB-1 through serving as a scaffold of extracellular signal-regulated kinase and YB-1 to enhance hepatocarcinogenesis. Hepatology. 2017;65(5):1612–27.

    CAS  Article  PubMed  Google Scholar 

  21. Liu J, Lu C, Xiao M, Jiang F, Qu L, Ni R. Long non-coding RNA SNHG20 predicts a poor prognosis for HCC and promotes cell invasion by regulating the epithelial-to-mesenchymal transition. Biomed Pharmacother. 2017;89:857–63.

    CAS  Article  PubMed  Google Scholar 

  22. Su DN, Wu SP, Chen HT, He JH. HOTAIR, a long non-coding RNA driver of malignancy whose expression is activated by FOXC1, negatively regulates miRNA-1 in hepatocellular carcinoma. Oncol Lett. 2016;12(5):4061–7.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  23. Erdal H, Gul Utku O, Karatay E, Celik B, Elbeg S, Dogan I. Combination of DKK1 and AFP improves diagnostic accuracy of hepatocellular carcinoma compared with either marker alone. Turk J Gastroenterol. 2016;27(4):375–81.

    Article  PubMed  Google Scholar 

  24. Zheng C, Hao H, Chen L, Shao J. Long noncoding RNAs as novel serum biomarkers for the diagnosis of hepatocellular carcinoma: a systematic review and meta-analysis. Clin Transl Oncol. 2017;19(8):961–8.

    CAS  Article  PubMed  Google Scholar 

  25. Li J, Wang X, Tang J, Jiang R, Zhang W, Ji J, Sun B. HULC and Linc00152 act as novel biomarkers in predicting diagnosis of hepatocellular carcinoma. Cell Physiol Biochem. 2015;37(2):687–96.

    CAS  Article  PubMed  Google Scholar 

  26. Xie H, Ma H, Zhou D. Plasma HULC as a promising novel biomarker for the detection of hepatocellular carcinoma. Biomed Res Int. 2013;2013:136106.

    PubMed  PubMed Central  Google Scholar 

  27. Tamura S, Kato T, Berho M, Misiakos EP, O’Brien C, Reddy KR, Nery JR, Burke GW, Schiff ER, Miller J, et al. Impact of histological grade of hepatocellular carcinoma on the outcome of liver transplantation. Arch Surg. 2001;136(1):25–30 (discussion 31).

    CAS  Article  PubMed  Google Scholar 

  28. Dehm SM, Bonham K. SRC gene expression in human cancer: the role of transcriptional activation. Biochem Cell Biol. 2004;82(2):263–74.

    CAS  Article  PubMed  Google Scholar 

  29. Frame MC. Src in cancer: deregulation and consequences for cell behaviour. Biochem Biophys Acta. 2002;1602(2):114–30.

    CAS  PubMed  Google Scholar 

  30. Ku CY, Wang YR, Lin HY, Lu SC, Lin JY. Corosolic acid inhibits hepatocellular carcinoma cell migration by targeting the VEGFR2/Src/FAK pathway. PLoS ONE. 2015;10(5):e0126725.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Sun CK, Man K, Ng KT, Ho JW, Lim ZX, Cheng Q, Lo CM, Poon RT, Fan ST. Proline-rich tyrosine kinase 2 (Pyk2) promotes proliferation and invasiveness of hepatocellular carcinoma cells through c-Src/ERK activation. Carcinogenesis. 2008;29(11):2096–105.

    CAS  Article  PubMed  Google Scholar 

  32. Zhao R, Wu Y, Wang T, Zhang Y, Kong D, Zhang L, Li X, Wang G, Jin Y, Jin X, et al. Elevated Src expression associated with hepatocellular carcinoma metastasis in northern Chinese patients. Oncol Lett. 2015;10(5):3026–34.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  33. Zhao S, Li H, Wang Q, Su C, Wang G, Song H, Zhao L, Luan Z, Su R. The role of c-Src in the invasion and metastasis of hepatocellular carcinoma cells induced by association of cell surface GRP78 with activated alpha2M. BMC Cancer. 2015;15:389.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Zhao R, Tin L, Zhang Y, Wu Y, Jin Y, Jin X, Zhang F, Li X. EF24 suppresses invasion and migration of hepatocellular carcinoma cells in vitro via inhibiting the phosphorylation of Src. Biomed Res Int. 2016;2016:8569684.

    PubMed  PubMed Central  Google Scholar 

  35. Servillo G, Della Fazia MA, Sassone-Corsi P. Coupling cAMP signaling to transcription in the liver: pivotal role of CREB and CREM. Exp Cell Res. 2002;275(2):143–54.

    CAS  Article  PubMed  Google Scholar 

  36. Chen YL, Jian MH, Lin CC, Kang JC, Chen SP, Lin PC, Hung PJ, Chen JR, Chang WL, Lin SZ, et al. The induction of orphan nuclear receptor Nur77 expression by n-butylenephthalide as pharmaceuticals on hepatocellular carcinoma cell therapy. Mol Pharmacol. 2008;74(4):1046–58.

    CAS  Article  PubMed  Google Scholar 

  37. Abramovitch R, Tavor E, Jacob-Hirsch J, Zeira E, Amariglio N, Pappo O, Rechavi G, Galun E, Honigman A. A pivotal role of cyclic AMP-responsive element binding protein in tumor progression. Cancer Res. 2004;64(4):1338–46.

    CAS  Article  PubMed  Google Scholar 

  38. Contreras AV, Rangel-Escareno C, Torres N, Aleman-Escondrillas G, Ortiz V, Noriega LG, Torre-Villalvazo I, Granados O, Velazquez-Villegas LA, Tobon-Cornejo S, et al. PPARalpha via HNF4alpha regulates the expression of genes encoding hepatic amino acid catabolizing enzymes to maintain metabolic homeostasis. Genes Nutr. 2015;10(2):452.

    Article  PubMed  Google Scholar 

  39. Drakaki A, Hatziapostolou M, Polytarchou C, Vorvis C, Poultsides GA, Souglakos J, Georgoulias V, Iliopoulos D. Functional microRNA high throughput screening reveals miR-9 as a central regulator of liver oncogenesis by affecting the PPARA-CDH1 pathway. BMC Cancer. 2015;15:542.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Yamasaki D, Kawabe N, Nakamura H, Tachibana K, Ishimoto K, Tanaka T, Aburatani H, Sakai J, Hamakubo T, Kodama T, et al. Fenofibrate suppresses growth of the human hepatocellular carcinoma cell via PPARalpha-independent mechanisms. Eur J Cell Biol. 2011;90(8):657–64.

    CAS  Article  PubMed  Google Scholar 

Download references

Authors’ contributions

DYW and PL designed and carried out the study. HWL, XY and HYL participated in experiments and statistical analysis. DYW, PL and YH wrote the manuscript. HY and GC revised the manuscript. All authors read and approved the final manuscript.


The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Dong-yue Wen and Peng Lin contributed equally as co-first authors, and Hong Yang and Gang Chen contributed equally as co-corresponding authors of this paper.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

All the data supporting our findings can be found in the “Results” section of the paper. Please contact authors for data request.

Consent for publication

Not applicable.

Ethics approval and consent to participate

The Ethics Committee of First Affiliated Hospital of Guangxi Medical University approved the protocol, and written informed consent was provided by HCC patients involved.


This work was funded by the Fund of National Natural Science Foundation of China (NSFC81060202, NSFC81260222), the Fund of Innovation Project of Guangxi Graduate Education (YCSW2018104), and the Fund of Guangxi Key R&D Project Plan (AB17195020).

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations


Corresponding authors

Correspondence to Hong Yang or Gang Chen.

Additional files

Additional file 1: Figure S1.

Flow diagram of the selection in three databases.

Additional file 2: Figure S2.

The expression level of CTD-2547G23.4 in HCC. (A) Forest plot of all eligible datasets evaluating CTD-2547G23.4 expression between HCC and adjacent non-tumour tissues. (B) The SROC curve for the differentiation of HCC from adjacent non-tumour tissues based upon TCGA, GSE27462, GSE49713 and qRT-PCR datasets. (C) Forest plot of TCGA and in-house PCR datasets evaluating CTD-2547G23.4 expression between HCC and adjacent non-tumour tissues. (D) The SROC curve for the differentiation of HCC from adjacent non-tumour tissues based upon TCGA, GSE27462 and qRT-PCR datasets.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wen, Dy., Lin, P., Liang, Hw. et al. Up-regulation of CTD-2547G23.4 in hepatocellular carcinoma tissues and its prospective molecular regulatory mechanism: a novel qRT-PCR and bioinformatics analysis study. Cancer Cell Int 18, 74 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • CTD-2547G23.4
  • Hepatocellular carcinoma (HCC)
  • Bioinformatic analysis
  • qRT-PCR
  • TCGA
  • GEO