Identifying critical genes in esophageal squamous cell carcinoma using an ensemble approach

Esophageal Squamous Cell Carcinoma (ESCC) is considered as a deadly disease especially in the North-East, India. A series of differentially expressed genes (DEGs) are suspected to be involved in the progression of ESCC. To search the DEGs a good number of tools are available. To remove the biasness...

Full description

Bibliographic Details
Main Authors: Pallabi Patowary, Dhruba K. Bhattacharyya, Pankaj Barah
Format: Article
Language:English
Published: Elsevier 2020-01-01
Series:Informatics in Medicine Unlocked
Online Access:http://www.sciencedirect.com/science/article/pii/S2352914819301881
id doaj-bc8c721211864e0893ff8b9be7857678
record_format Article
spelling doaj-bc8c721211864e0893ff8b9be78576782020-11-25T02:30:47ZengElsevierInformatics in Medicine Unlocked2352-91482020-01-0118Identifying critical genes in esophageal squamous cell carcinoma using an ensemble approachPallabi Patowary0Dhruba K. Bhattacharyya1Pankaj Barah2Department of Computer Science and Engineering, Tezpur University, Assam, IndiaDepartment of Computer Science and Engineering, Tezpur University, Assam, India; Corresponding author.Department of Molecular Biology and Biotechnology, Tezpur University, Assam, IndiaEsophageal Squamous Cell Carcinoma (ESCC) is considered as a deadly disease especially in the North-East, India. A series of differentially expressed genes (DEGs) are suspected to be involved in the progression of ESCC. To search the DEGs a good number of tools are available. To remove the biasness of resulting DEGs given by all such tools, a consensus function is necessary on which user can rely on the output generated by differential expression analysis methods applied on multiple sources of data. In this study, we have considered two microarrays (of 34 and 106 samples, respectively) and one RNA-seq data (of 29 samples) to conduct an unbiased integrative analysis towards the identification of critical genes for ESCC. Initially, independent downstream analysis on each type of data using six differentially expressed gene identification tools followed by an integrative analysis supported by an effective consensus function is conducted to identify an unbiased set of differently expressed genes. The identified gene set includes common genes obtained (for P-value cut-off < 0.01) from the tools as well as some uncommon top-ranked genes (for P-value cut-off < 0.001). Next, further preservation analysis is performed and identified a set of low preserved modules. Finally, hub genes are identified from the selected low preserved modules and validated both topologically and biologically. A set of hub genes are identified such as SOX11, COL27A1, TOP3A, BAG6, CDC6, EZH2, COL7A1, G6PD, and AKR1C2 which have been established to be critical for ESCC. Keywords: ESCC, Microarray, RNA-seq, Differentially expressed gene, Module preservation, Differential expression analysis, Co-expression networkhttp://www.sciencedirect.com/science/article/pii/S2352914819301881
collection DOAJ
language English
format Article
sources DOAJ
author Pallabi Patowary
Dhruba K. Bhattacharyya
Pankaj Barah
spellingShingle Pallabi Patowary
Dhruba K. Bhattacharyya
Pankaj Barah
Identifying critical genes in esophageal squamous cell carcinoma using an ensemble approach
Informatics in Medicine Unlocked
author_facet Pallabi Patowary
Dhruba K. Bhattacharyya
Pankaj Barah
author_sort Pallabi Patowary
title Identifying critical genes in esophageal squamous cell carcinoma using an ensemble approach
title_short Identifying critical genes in esophageal squamous cell carcinoma using an ensemble approach
title_full Identifying critical genes in esophageal squamous cell carcinoma using an ensemble approach
title_fullStr Identifying critical genes in esophageal squamous cell carcinoma using an ensemble approach
title_full_unstemmed Identifying critical genes in esophageal squamous cell carcinoma using an ensemble approach
title_sort identifying critical genes in esophageal squamous cell carcinoma using an ensemble approach
publisher Elsevier
series Informatics in Medicine Unlocked
issn 2352-9148
publishDate 2020-01-01
description Esophageal Squamous Cell Carcinoma (ESCC) is considered as a deadly disease especially in the North-East, India. A series of differentially expressed genes (DEGs) are suspected to be involved in the progression of ESCC. To search the DEGs a good number of tools are available. To remove the biasness of resulting DEGs given by all such tools, a consensus function is necessary on which user can rely on the output generated by differential expression analysis methods applied on multiple sources of data. In this study, we have considered two microarrays (of 34 and 106 samples, respectively) and one RNA-seq data (of 29 samples) to conduct an unbiased integrative analysis towards the identification of critical genes for ESCC. Initially, independent downstream analysis on each type of data using six differentially expressed gene identification tools followed by an integrative analysis supported by an effective consensus function is conducted to identify an unbiased set of differently expressed genes. The identified gene set includes common genes obtained (for P-value cut-off < 0.01) from the tools as well as some uncommon top-ranked genes (for P-value cut-off < 0.001). Next, further preservation analysis is performed and identified a set of low preserved modules. Finally, hub genes are identified from the selected low preserved modules and validated both topologically and biologically. A set of hub genes are identified such as SOX11, COL27A1, TOP3A, BAG6, CDC6, EZH2, COL7A1, G6PD, and AKR1C2 which have been established to be critical for ESCC. Keywords: ESCC, Microarray, RNA-seq, Differentially expressed gene, Module preservation, Differential expression analysis, Co-expression network
url http://www.sciencedirect.com/science/article/pii/S2352914819301881
work_keys_str_mv AT pallabipatowary identifyingcriticalgenesinesophagealsquamouscellcarcinomausinganensembleapproach
AT dhrubakbhattacharyya identifyingcriticalgenesinesophagealsquamouscellcarcinomausinganensembleapproach
AT pankajbarah identifyingcriticalgenesinesophagealsquamouscellcarcinomausinganensembleapproach
_version_ 1724827963616133120