A Robust 8-Gene Prognostic Signature for Early-Stage Non-small Cell Lung Cancer

Background: The current staging system is imprecise for prognostic prediction of early-stage non–small cell lung cancer (NSCLC). This study aimed to develop a robust prognostic signature for early-stage NSCLC, allowing classification of patients with a high risk of poor outcome and specific treatmen...

Full description

Bibliographic Details
Main Authors: Ru He, Shuguang Zuo
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-07-01
Series:Frontiers in Oncology
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fonc.2019.00693/full
id doaj-de46f13507244a2da501f019694dfac5
record_format Article
spelling doaj-de46f13507244a2da501f019694dfac52020-11-25T00:37:03ZengFrontiers Media S.A.Frontiers in Oncology2234-943X2019-07-01910.3389/fonc.2019.00693469834A Robust 8-Gene Prognostic Signature for Early-Stage Non-small Cell Lung CancerRu He0Shuguang Zuo1Shuguang Zuo2Center for Translational Medicine, Huaihe Hospital of Henan University, Kaifeng, ChinaCenter for Translational Medicine, Huaihe Hospital of Henan University, Kaifeng, ChinaInstitute of Infection and Immunity, Huaihe Hospital of Henan University, Kaifeng, ChinaBackground: The current staging system is imprecise for prognostic prediction of early-stage non–small cell lung cancer (NSCLC). This study aimed to develop a robust prognostic signature for early-stage NSCLC, allowing classification of patients with a high risk of poor outcome and specific treatment decision.Method: In the present study, a comprehensive genome-wide profiling analysis was conducted using a retrospective pool of early-stage NSCLC patient data from the previous datasets of Gene Expression Omnibus (GEO) including GSE31210, GSE37745, and GSE50081 and The Cancer Genome Atlas (TCGA). Cox proportional hazards models were implemented to determine the association between gene expression levels and overall patient survival in each dataset. The common genes among all datasets were selected as candidate prognostic genes. A risk score model was developed and validated using four independent datasets and the entire cohort. The Kaplan-Meier with log-rank test was used to assess survival difference.Results: A univariate Cox proportional hazards regression analysis for each dataset showed that a total of 2280 genes in GSE31210, 762 genes in GSE37745, 871 genes in GSE50081, and 666 genes in TCGA were identified as candidate protective genes, while overall 2131 genes in GSE31210, 913 in GSE37745, 1107 in GSE50081, and 997 in TCGA were identified as candidate risky genes. There were 8 common genes associated with overall survival, including 7 mRNA and 1 lncRNA. By using the Step-wise multivariate Cox analysis, an 8-gene prognostic signature (CDCP1, HMMR, TPX2, CIRBP, HLF, KBTBD7, SEC24B-AS1, and SH2B1) for early-stage NSCLC was developed. Patients in the high-risk group had shorter overall survival than those in the low-risk group. Multivariate regression and stratified analysis suggested that the prognostic power of the 8-gene signature was independent of other clinical factors. Furthermore, the 8-gene signature achieved AUC values of 0.726, 0.701, 0.725 and 0.650 in GSE31210, GSE37745, GSE50081 and TCGA, respectively. Moreover, the combination of the 8-gene signature and the stage resulted to a better patient classification for survival prediction and treatment decision.Conclusion: This study developed a robust gene signature with great value for prognostic prediction in early-stage NSCLC, which may contribute to patient classification and personalized treatment decisions.https://www.frontiersin.org/article/10.3389/fonc.2019.00693/fullnon-small cell lung canceroverall survivalrisk scorestageprognostic signature
collection DOAJ
language English
format Article
sources DOAJ
author Ru He
Shuguang Zuo
Shuguang Zuo
spellingShingle Ru He
Shuguang Zuo
Shuguang Zuo
A Robust 8-Gene Prognostic Signature for Early-Stage Non-small Cell Lung Cancer
Frontiers in Oncology
non-small cell lung cancer
overall survival
risk score
stage
prognostic signature
author_facet Ru He
Shuguang Zuo
Shuguang Zuo
author_sort Ru He
title A Robust 8-Gene Prognostic Signature for Early-Stage Non-small Cell Lung Cancer
title_short A Robust 8-Gene Prognostic Signature for Early-Stage Non-small Cell Lung Cancer
title_full A Robust 8-Gene Prognostic Signature for Early-Stage Non-small Cell Lung Cancer
title_fullStr A Robust 8-Gene Prognostic Signature for Early-Stage Non-small Cell Lung Cancer
title_full_unstemmed A Robust 8-Gene Prognostic Signature for Early-Stage Non-small Cell Lung Cancer
title_sort robust 8-gene prognostic signature for early-stage non-small cell lung cancer
publisher Frontiers Media S.A.
series Frontiers in Oncology
issn 2234-943X
publishDate 2019-07-01
description Background: The current staging system is imprecise for prognostic prediction of early-stage non–small cell lung cancer (NSCLC). This study aimed to develop a robust prognostic signature for early-stage NSCLC, allowing classification of patients with a high risk of poor outcome and specific treatment decision.Method: In the present study, a comprehensive genome-wide profiling analysis was conducted using a retrospective pool of early-stage NSCLC patient data from the previous datasets of Gene Expression Omnibus (GEO) including GSE31210, GSE37745, and GSE50081 and The Cancer Genome Atlas (TCGA). Cox proportional hazards models were implemented to determine the association between gene expression levels and overall patient survival in each dataset. The common genes among all datasets were selected as candidate prognostic genes. A risk score model was developed and validated using four independent datasets and the entire cohort. The Kaplan-Meier with log-rank test was used to assess survival difference.Results: A univariate Cox proportional hazards regression analysis for each dataset showed that a total of 2280 genes in GSE31210, 762 genes in GSE37745, 871 genes in GSE50081, and 666 genes in TCGA were identified as candidate protective genes, while overall 2131 genes in GSE31210, 913 in GSE37745, 1107 in GSE50081, and 997 in TCGA were identified as candidate risky genes. There were 8 common genes associated with overall survival, including 7 mRNA and 1 lncRNA. By using the Step-wise multivariate Cox analysis, an 8-gene prognostic signature (CDCP1, HMMR, TPX2, CIRBP, HLF, KBTBD7, SEC24B-AS1, and SH2B1) for early-stage NSCLC was developed. Patients in the high-risk group had shorter overall survival than those in the low-risk group. Multivariate regression and stratified analysis suggested that the prognostic power of the 8-gene signature was independent of other clinical factors. Furthermore, the 8-gene signature achieved AUC values of 0.726, 0.701, 0.725 and 0.650 in GSE31210, GSE37745, GSE50081 and TCGA, respectively. Moreover, the combination of the 8-gene signature and the stage resulted to a better patient classification for survival prediction and treatment decision.Conclusion: This study developed a robust gene signature with great value for prognostic prediction in early-stage NSCLC, which may contribute to patient classification and personalized treatment decisions.
topic non-small cell lung cancer
overall survival
risk score
stage
prognostic signature
url https://www.frontiersin.org/article/10.3389/fonc.2019.00693/full
work_keys_str_mv AT ruhe arobust8geneprognosticsignatureforearlystagenonsmallcelllungcancer
AT shuguangzuo arobust8geneprognosticsignatureforearlystagenonsmallcelllungcancer
AT shuguangzuo arobust8geneprognosticsignatureforearlystagenonsmallcelllungcancer
AT ruhe robust8geneprognosticsignatureforearlystagenonsmallcelllungcancer
AT shuguangzuo robust8geneprognosticsignatureforearlystagenonsmallcelllungcancer
AT shuguangzuo robust8geneprognosticsignatureforearlystagenonsmallcelllungcancer
_version_ 1725302834629443584