Development and Validation of a 12-Gene Immune Relevant Prognostic Signature for Lung Adenocarcinoma Through Machine Learning Strategies

Background: Although immunotherapy with checkpoint inhibitors is changing the face of lung adenocarcinoma (LUAD) treatments, only limited patients could benefit from it. Therefore, we aimed to develop an immune-relevant-gene-based signature to predict LUAD patients' prognosis and to characteriz...

Full description

Bibliographic Details
Main Authors: Liang Xue, Guoshu Bi, Cheng Zhan, Yi Zhang, Yunfeng Yuan, Hong Fan
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-05-01
Series:Frontiers in Oncology
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fonc.2020.00835/full
id doaj-eda04626423f4b4cabd4abef862fe7b8
record_format Article
spelling doaj-eda04626423f4b4cabd4abef862fe7b82020-11-25T03:04:29ZengFrontiers Media S.A.Frontiers in Oncology2234-943X2020-05-011010.3389/fonc.2020.00835523788Development and Validation of a 12-Gene Immune Relevant Prognostic Signature for Lung Adenocarcinoma Through Machine Learning StrategiesLiang XueGuoshu BiCheng ZhanYi ZhangYunfeng YuanHong FanBackground: Although immunotherapy with checkpoint inhibitors is changing the face of lung adenocarcinoma (LUAD) treatments, only limited patients could benefit from it. Therefore, we aimed to develop an immune-relevant-gene-based signature to predict LUAD patients' prognosis and to characterize their tumor microenvironment thus guiding therapeutic strategy.Methods and Materials: Gene expression data of LUAD patients from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) were systematically analyzed. We performed Cox regression and random survival forest algorithm to identify immune-relevant genes with potential prognostic value. A risk score formula was then established by integrating these selected genes and patients were classified into high- and low-risk score group. Differentially expressed genes, infiltration level of immune cells, and several immune-associated molecules were further compared across the two groups.Results: Nine hundred and fifty-four LUAD patients were enrolled in this study. After implementing the 2-steps machine learning screening methods, 12 immune-relevant genes were finally selected into the risk-score formula and the patients in high-risk group had significantly worse overall survival (HR = 10.6, 95%CI = 3.21–34.95, P < 0.001). We also found the distinct immune infiltration patterns in the two groups that several immune cells like cytotoxic cells and immune checkpoint molecules were significantly enriched and upregulated in patients from the high-risk group. These findings were further validated in two independent LUAD cohorts.Conclusion: Our risk score formula could serve as a powerful and accurate tool for predicting survival of LUAD patients and may facilitate clinicians to choose the optimal therapeutic regimen more precisely.https://www.frontiersin.org/article/10.3389/fonc.2020.00835/fulllung adenocarcinomarisk score formulaimmune infiltrationmachine learningsurvival
collection DOAJ
language English
format Article
sources DOAJ
author Liang Xue
Guoshu Bi
Cheng Zhan
Yi Zhang
Yunfeng Yuan
Hong Fan
spellingShingle Liang Xue
Guoshu Bi
Cheng Zhan
Yi Zhang
Yunfeng Yuan
Hong Fan
Development and Validation of a 12-Gene Immune Relevant Prognostic Signature for Lung Adenocarcinoma Through Machine Learning Strategies
Frontiers in Oncology
lung adenocarcinoma
risk score formula
immune infiltration
machine learning
survival
author_facet Liang Xue
Guoshu Bi
Cheng Zhan
Yi Zhang
Yunfeng Yuan
Hong Fan
author_sort Liang Xue
title Development and Validation of a 12-Gene Immune Relevant Prognostic Signature for Lung Adenocarcinoma Through Machine Learning Strategies
title_short Development and Validation of a 12-Gene Immune Relevant Prognostic Signature for Lung Adenocarcinoma Through Machine Learning Strategies
title_full Development and Validation of a 12-Gene Immune Relevant Prognostic Signature for Lung Adenocarcinoma Through Machine Learning Strategies
title_fullStr Development and Validation of a 12-Gene Immune Relevant Prognostic Signature for Lung Adenocarcinoma Through Machine Learning Strategies
title_full_unstemmed Development and Validation of a 12-Gene Immune Relevant Prognostic Signature for Lung Adenocarcinoma Through Machine Learning Strategies
title_sort development and validation of a 12-gene immune relevant prognostic signature for lung adenocarcinoma through machine learning strategies
publisher Frontiers Media S.A.
series Frontiers in Oncology
issn 2234-943X
publishDate 2020-05-01
description Background: Although immunotherapy with checkpoint inhibitors is changing the face of lung adenocarcinoma (LUAD) treatments, only limited patients could benefit from it. Therefore, we aimed to develop an immune-relevant-gene-based signature to predict LUAD patients' prognosis and to characterize their tumor microenvironment thus guiding therapeutic strategy.Methods and Materials: Gene expression data of LUAD patients from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) were systematically analyzed. We performed Cox regression and random survival forest algorithm to identify immune-relevant genes with potential prognostic value. A risk score formula was then established by integrating these selected genes and patients were classified into high- and low-risk score group. Differentially expressed genes, infiltration level of immune cells, and several immune-associated molecules were further compared across the two groups.Results: Nine hundred and fifty-four LUAD patients were enrolled in this study. After implementing the 2-steps machine learning screening methods, 12 immune-relevant genes were finally selected into the risk-score formula and the patients in high-risk group had significantly worse overall survival (HR = 10.6, 95%CI = 3.21–34.95, P < 0.001). We also found the distinct immune infiltration patterns in the two groups that several immune cells like cytotoxic cells and immune checkpoint molecules were significantly enriched and upregulated in patients from the high-risk group. These findings were further validated in two independent LUAD cohorts.Conclusion: Our risk score formula could serve as a powerful and accurate tool for predicting survival of LUAD patients and may facilitate clinicians to choose the optimal therapeutic regimen more precisely.
topic lung adenocarcinoma
risk score formula
immune infiltration
machine learning
survival
url https://www.frontiersin.org/article/10.3389/fonc.2020.00835/full
work_keys_str_mv AT liangxue developmentandvalidationofa12geneimmunerelevantprognosticsignatureforlungadenocarcinomathroughmachinelearningstrategies
AT guoshubi developmentandvalidationofa12geneimmunerelevantprognosticsignatureforlungadenocarcinomathroughmachinelearningstrategies
AT chengzhan developmentandvalidationofa12geneimmunerelevantprognosticsignatureforlungadenocarcinomathroughmachinelearningstrategies
AT yizhang developmentandvalidationofa12geneimmunerelevantprognosticsignatureforlungadenocarcinomathroughmachinelearningstrategies
AT yunfengyuan developmentandvalidationofa12geneimmunerelevantprognosticsignatureforlungadenocarcinomathroughmachinelearningstrategies
AT hongfan developmentandvalidationofa12geneimmunerelevantprognosticsignatureforlungadenocarcinomathroughmachinelearningstrategies
_version_ 1724681504030720000