Deep CNN Model Using CT Radiomics Feature Mapping Recognizes EGFR Gene Mutation Status of Lung Adenocarcinoma

To recognize the epidermal growth factor receptor (EGFR) gene mutation status in lung adenocarcinoma (LADC) has become a prerequisite of deciding whether EGFR-tyrosine kinase inhibitor (EGFR-TKI) medicine can be used. Polymerase chain reaction assay or gene sequencing is for measuring EGFR status, h...

Full description

Bibliographic Details
Main Authors: Baihua Zhang, Shouliang Qi, Xiaohuan Pan, Chen Li, Yudong Yao, Wei Qian, Yubao Guan
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-02-01
Series:Frontiers in Oncology
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fonc.2020.598721/full
id doaj-95c73a5a4ef64de0bfdcf95fc8330dd8
record_format Article
spelling doaj-95c73a5a4ef64de0bfdcf95fc8330dd82021-02-12T05:26:45ZengFrontiers Media S.A.Frontiers in Oncology2234-943X2021-02-011010.3389/fonc.2020.598721598721Deep CNN Model Using CT Radiomics Feature Mapping Recognizes EGFR Gene Mutation Status of Lung AdenocarcinomaBaihua Zhang0Shouliang Qi1Shouliang Qi2Xiaohuan Pan3Chen Li4Yudong Yao5Wei Qian6Yubao Guan7College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, ChinaCollege of Medicine and Biological Information Engineering, Northeastern University, Shenyang, ChinaKey Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Northeastern University, Shenyang, ChinaDepartment of Radiology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, ChinaCollege of Medicine and Biological Information Engineering, Northeastern University, Shenyang, ChinaDepartment of Electrical and Computer Engineering, Stevens Institute of Technology, Hoboken, NJ, United StatesDepartment of Electrical and Computer Engineering, University of Texas at El Paso, El Paso, TX, United StatesDepartment of Radiology, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, ChinaTo recognize the epidermal growth factor receptor (EGFR) gene mutation status in lung adenocarcinoma (LADC) has become a prerequisite of deciding whether EGFR-tyrosine kinase inhibitor (EGFR-TKI) medicine can be used. Polymerase chain reaction assay or gene sequencing is for measuring EGFR status, however, the tissue samples by surgery or biopsy are required. We propose to develop deep learning models to recognize EGFR status by using radiomics features extracted from non-invasive CT images. Preoperative CT images, EGFR mutation status and clinical data have been collected in a cohort of 709 patients (the primary cohort) and an independent cohort of 205 patients. After 1,037 CT-based radiomics features are extracted from each lesion region, 784 discriminative features are selected for analysis and construct a feature mapping. One Squeeze-and-Excitation (SE) Convolutional Neural Network (SE-CNN) has been designed and trained to recognize EGFR status from the radiomics feature mapping. SE-CNN model is trained and validated by using 638 patients from the primary cohort, tested by using the rest 71 patients (the internal test cohort), and further tested by using the independent 205 patients (the external test cohort). Furthermore, SE-CNN model is compared with machine learning (ML) models using radiomics features, clinical features, and both features. EGFR(-) patients show the smaller age, higher odds of female, larger lesion volumes, and lower odds of subtype of acinar predominant adenocarcinoma (APA), compared with EGFR(+). The most discriminative features are for texture (614, 78.3%) and the features of first order of intensity (158, 20.1%) and the shape features (12, 1.5%) follow. SE-CNN model can recognize EGFR mutation status with an AUC of 0.910 and 0.841 for the internal and external test cohorts, respectively. It outperforms the CNN model without SE, the fine-tuned VGG16 and VGG19, three ML models, and the state-of-art models. Utilizing radiomics feature mapping extracted from non-invasive CT images, SE-CNN can precisely recognize EGFR mutation status of LADC patients. The proposed method combining radiomics features and deep leaning is superior to ML methods and can be expanded to other medical applications. The proposed SE-CNN model may help make decision on usage of EGFR-TKI medicine.https://www.frontiersin.org/articles/10.3389/fonc.2020.598721/fulllung cancerepidermal growth factor receptor mutationdeep learningconvolutional neural networkfeature mapping
collection DOAJ
language English
format Article
sources DOAJ
author Baihua Zhang
Shouliang Qi
Shouliang Qi
Xiaohuan Pan
Chen Li
Yudong Yao
Wei Qian
Yubao Guan
spellingShingle Baihua Zhang
Shouliang Qi
Shouliang Qi
Xiaohuan Pan
Chen Li
Yudong Yao
Wei Qian
Yubao Guan
Deep CNN Model Using CT Radiomics Feature Mapping Recognizes EGFR Gene Mutation Status of Lung Adenocarcinoma
Frontiers in Oncology
lung cancer
epidermal growth factor receptor mutation
deep learning
convolutional neural network
feature mapping
author_facet Baihua Zhang
Shouliang Qi
Shouliang Qi
Xiaohuan Pan
Chen Li
Yudong Yao
Wei Qian
Yubao Guan
author_sort Baihua Zhang
title Deep CNN Model Using CT Radiomics Feature Mapping Recognizes EGFR Gene Mutation Status of Lung Adenocarcinoma
title_short Deep CNN Model Using CT Radiomics Feature Mapping Recognizes EGFR Gene Mutation Status of Lung Adenocarcinoma
title_full Deep CNN Model Using CT Radiomics Feature Mapping Recognizes EGFR Gene Mutation Status of Lung Adenocarcinoma
title_fullStr Deep CNN Model Using CT Radiomics Feature Mapping Recognizes EGFR Gene Mutation Status of Lung Adenocarcinoma
title_full_unstemmed Deep CNN Model Using CT Radiomics Feature Mapping Recognizes EGFR Gene Mutation Status of Lung Adenocarcinoma
title_sort deep cnn model using ct radiomics feature mapping recognizes egfr gene mutation status of lung adenocarcinoma
publisher Frontiers Media S.A.
series Frontiers in Oncology
issn 2234-943X
publishDate 2021-02-01
description To recognize the epidermal growth factor receptor (EGFR) gene mutation status in lung adenocarcinoma (LADC) has become a prerequisite of deciding whether EGFR-tyrosine kinase inhibitor (EGFR-TKI) medicine can be used. Polymerase chain reaction assay or gene sequencing is for measuring EGFR status, however, the tissue samples by surgery or biopsy are required. We propose to develop deep learning models to recognize EGFR status by using radiomics features extracted from non-invasive CT images. Preoperative CT images, EGFR mutation status and clinical data have been collected in a cohort of 709 patients (the primary cohort) and an independent cohort of 205 patients. After 1,037 CT-based radiomics features are extracted from each lesion region, 784 discriminative features are selected for analysis and construct a feature mapping. One Squeeze-and-Excitation (SE) Convolutional Neural Network (SE-CNN) has been designed and trained to recognize EGFR status from the radiomics feature mapping. SE-CNN model is trained and validated by using 638 patients from the primary cohort, tested by using the rest 71 patients (the internal test cohort), and further tested by using the independent 205 patients (the external test cohort). Furthermore, SE-CNN model is compared with machine learning (ML) models using radiomics features, clinical features, and both features. EGFR(-) patients show the smaller age, higher odds of female, larger lesion volumes, and lower odds of subtype of acinar predominant adenocarcinoma (APA), compared with EGFR(+). The most discriminative features are for texture (614, 78.3%) and the features of first order of intensity (158, 20.1%) and the shape features (12, 1.5%) follow. SE-CNN model can recognize EGFR mutation status with an AUC of 0.910 and 0.841 for the internal and external test cohorts, respectively. It outperforms the CNN model without SE, the fine-tuned VGG16 and VGG19, three ML models, and the state-of-art models. Utilizing radiomics feature mapping extracted from non-invasive CT images, SE-CNN can precisely recognize EGFR mutation status of LADC patients. The proposed method combining radiomics features and deep leaning is superior to ML methods and can be expanded to other medical applications. The proposed SE-CNN model may help make decision on usage of EGFR-TKI medicine.
topic lung cancer
epidermal growth factor receptor mutation
deep learning
convolutional neural network
feature mapping
url https://www.frontiersin.org/articles/10.3389/fonc.2020.598721/full
work_keys_str_mv AT baihuazhang deepcnnmodelusingctradiomicsfeaturemappingrecognizesegfrgenemutationstatusoflungadenocarcinoma
AT shouliangqi deepcnnmodelusingctradiomicsfeaturemappingrecognizesegfrgenemutationstatusoflungadenocarcinoma
AT shouliangqi deepcnnmodelusingctradiomicsfeaturemappingrecognizesegfrgenemutationstatusoflungadenocarcinoma
AT xiaohuanpan deepcnnmodelusingctradiomicsfeaturemappingrecognizesegfrgenemutationstatusoflungadenocarcinoma
AT chenli deepcnnmodelusingctradiomicsfeaturemappingrecognizesegfrgenemutationstatusoflungadenocarcinoma
AT yudongyao deepcnnmodelusingctradiomicsfeaturemappingrecognizesegfrgenemutationstatusoflungadenocarcinoma
AT weiqian deepcnnmodelusingctradiomicsfeaturemappingrecognizesegfrgenemutationstatusoflungadenocarcinoma
AT yubaoguan deepcnnmodelusingctradiomicsfeaturemappingrecognizesegfrgenemutationstatusoflungadenocarcinoma
_version_ 1724273725654695936