Cancer-Related Gene Signature Selection Based on Boosted Regression for Multilayer Perceptron

Gene expression profiling is a useful technique for analyzing cellular function, and gene expression profiles are widely studied in human cancer research. Gene expression data usually consist of a very large number of features and a relatively small number of samples, and extracting a small number o...

Full description

Bibliographic Details
Main Authors: Hyein Seo, Dong-Ho Cho
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9056524/
id doaj-8be301f4b55f4586befa1855de616bbb
record_format Article
spelling doaj-8be301f4b55f4586befa1855de616bbb2021-03-30T01:34:43ZengIEEEIEEE Access2169-35362020-01-018649926500410.1109/ACCESS.2020.29854149056524Cancer-Related Gene Signature Selection Based on Boosted Regression for Multilayer PerceptronHyein Seo0Dong-Ho Cho1https://orcid.org/0000-0001-9849-4392School of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South KoreaSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South KoreaGene expression profiling is a useful technique for analyzing cellular function, and gene expression profiles are widely studied in human cancer research. Gene expression data usually consist of a very large number of features and a relatively small number of samples, and extracting a small number of important features from these data is a major challenge of gene expression-based analysis in cancer research. In this paper, we propose an embedded feature selection algorithm using boosted linear regression-based feature selection. The boosting technique is applied to derive the ensemble feature selector and improve the performance of linear regression-based feature selection. The proposed feature selection algorithm, called boosted regression-based feature selection for the multilayer perceptron (BREG-MLP), repeats the boosted feature selection process to extract the smallest feature subset while maintaining good classification performance. We apply the proposed BREG-MLP to some human cancer-related gene expression data sets for the purpose of extracting important features, and we confirm that BREG-MLP offers improved performance compared to single regression-based feature selection methods.https://ieeexplore.ieee.org/document/9056524/Boostingfeature selectionlinear regressiongene expression profile
collection DOAJ
language English
format Article
sources DOAJ
author Hyein Seo
Dong-Ho Cho
spellingShingle Hyein Seo
Dong-Ho Cho
Cancer-Related Gene Signature Selection Based on Boosted Regression for Multilayer Perceptron
IEEE Access
Boosting
feature selection
linear regression
gene expression profile
author_facet Hyein Seo
Dong-Ho Cho
author_sort Hyein Seo
title Cancer-Related Gene Signature Selection Based on Boosted Regression for Multilayer Perceptron
title_short Cancer-Related Gene Signature Selection Based on Boosted Regression for Multilayer Perceptron
title_full Cancer-Related Gene Signature Selection Based on Boosted Regression for Multilayer Perceptron
title_fullStr Cancer-Related Gene Signature Selection Based on Boosted Regression for Multilayer Perceptron
title_full_unstemmed Cancer-Related Gene Signature Selection Based on Boosted Regression for Multilayer Perceptron
title_sort cancer-related gene signature selection based on boosted regression for multilayer perceptron
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Gene expression profiling is a useful technique for analyzing cellular function, and gene expression profiles are widely studied in human cancer research. Gene expression data usually consist of a very large number of features and a relatively small number of samples, and extracting a small number of important features from these data is a major challenge of gene expression-based analysis in cancer research. In this paper, we propose an embedded feature selection algorithm using boosted linear regression-based feature selection. The boosting technique is applied to derive the ensemble feature selector and improve the performance of linear regression-based feature selection. The proposed feature selection algorithm, called boosted regression-based feature selection for the multilayer perceptron (BREG-MLP), repeats the boosted feature selection process to extract the smallest feature subset while maintaining good classification performance. We apply the proposed BREG-MLP to some human cancer-related gene expression data sets for the purpose of extracting important features, and we confirm that BREG-MLP offers improved performance compared to single regression-based feature selection methods.
topic Boosting
feature selection
linear regression
gene expression profile
url https://ieeexplore.ieee.org/document/9056524/
work_keys_str_mv AT hyeinseo cancerrelatedgenesignatureselectionbasedonboostedregressionformultilayerperceptron
AT donghocho cancerrelatedgenesignatureselectionbasedonboostedregressionformultilayerperceptron
_version_ 1724186806450126848