Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms

BackgroundIntervertebral Disc Degeneration (IDD) is a major cause of lower back pain and a significant global health issue. However, the specific mechanisms of IDD remain unclear. This study aims to identify key genes and pathways associated with IDD using bioinformatics and machine learning algorit...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:Frontiers in Immunology
المؤلفون الرئيسيون: Hao Zhang, Shengbo Shi, Xingxing Huang, Changsheng Gong, Zijing Zhang, Zetian Zhao, Junxiao Gao, Meng Zhang, Xiaobing Yu
التنسيق: مقال
اللغة:الإنجليزية
منشور في: Frontiers Media S.A. 2024-07-01
الموضوعات:
الوصول للمادة أونلاين:https://www.frontiersin.org/articles/10.3389/fimmu.2024.1401957/full
_version_ 1850038414852751360
author Hao Zhang
Shengbo Shi
Xingxing Huang
Changsheng Gong
Zijing Zhang
Zetian Zhao
Junxiao Gao
Meng Zhang
Xiaobing Yu
author_facet Hao Zhang
Shengbo Shi
Xingxing Huang
Changsheng Gong
Zijing Zhang
Zetian Zhao
Junxiao Gao
Meng Zhang
Xiaobing Yu
author_sort Hao Zhang
collection DOAJ
container_title Frontiers in Immunology
description BackgroundIntervertebral Disc Degeneration (IDD) is a major cause of lower back pain and a significant global health issue. However, the specific mechanisms of IDD remain unclear. This study aims to identify key genes and pathways associated with IDD using bioinformatics and machine learning algorithms.MethodsGene expression profiles, including those from 35 LDH patients and 43 healthy volunteers, were downloaded from the GEO database (GSE124272, GSE150408, GSE23130, GSE153761). After merging four microarray datasets, differentially expressed genes (DEGs) were selected for GO and KEGG pathway enrichment analysis. Weighted Gene Co-expression Network Analysis (WGCNA) was then applied to the merged dataset to identify relevant modules and intersect with DEGs to discover candidate genes with diagnostic value. A LASSO model was established to select appropriate genes, and ROC curves were drawn to elucidate the diagnostic value of genetic markers. A Protein-Protein Interaction (PPI) network was constructed and visualized to determine central genes, followed by external validation using qRT-PCR.ResultsDifferential analysis of the preprocessed dataset identified 244 genes, including 183 upregulated and 61 downregulated genes. WGCNA analysis revealed the most relevant module intersecting with DEGs, yielding 9 candidate genes. The lasso-cox method was used for regression analysis, ultimately identifying 6 genes: ASPH, CDC42EP3, FOSL2, IL1R1, NFKBIZ, TCF7L2. A Protein-Protein Interaction (PPI) network created with GENEMANIA identified IL1R1 and TCF7L2 as central genes.ConclusionOur study shows that IL1R1 and TCF7L2 are the core genes of IDD, offering new insights into the pathogenesis and therapeutic development of IDD.
format Article
id doaj-art-e3eae81376ab46dabd4948aaa2d50f2c
institution Directory of Open Access Journals
issn 1664-3224
language English
publishDate 2024-07-01
publisher Frontiers Media S.A.
record_format Article
spelling doaj-art-e3eae81376ab46dabd4948aaa2d50f2c2025-08-20T00:32:54ZengFrontiers Media S.A.Frontiers in Immunology1664-32242024-07-011510.3389/fimmu.2024.14019571401957Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithmsHao ZhangShengbo ShiXingxing HuangChangsheng GongZijing ZhangZetian ZhaoJunxiao GaoMeng ZhangXiaobing YuBackgroundIntervertebral Disc Degeneration (IDD) is a major cause of lower back pain and a significant global health issue. However, the specific mechanisms of IDD remain unclear. This study aims to identify key genes and pathways associated with IDD using bioinformatics and machine learning algorithms.MethodsGene expression profiles, including those from 35 LDH patients and 43 healthy volunteers, were downloaded from the GEO database (GSE124272, GSE150408, GSE23130, GSE153761). After merging four microarray datasets, differentially expressed genes (DEGs) were selected for GO and KEGG pathway enrichment analysis. Weighted Gene Co-expression Network Analysis (WGCNA) was then applied to the merged dataset to identify relevant modules and intersect with DEGs to discover candidate genes with diagnostic value. A LASSO model was established to select appropriate genes, and ROC curves were drawn to elucidate the diagnostic value of genetic markers. A Protein-Protein Interaction (PPI) network was constructed and visualized to determine central genes, followed by external validation using qRT-PCR.ResultsDifferential analysis of the preprocessed dataset identified 244 genes, including 183 upregulated and 61 downregulated genes. WGCNA analysis revealed the most relevant module intersecting with DEGs, yielding 9 candidate genes. The lasso-cox method was used for regression analysis, ultimately identifying 6 genes: ASPH, CDC42EP3, FOSL2, IL1R1, NFKBIZ, TCF7L2. A Protein-Protein Interaction (PPI) network created with GENEMANIA identified IL1R1 and TCF7L2 as central genes.ConclusionOur study shows that IL1R1 and TCF7L2 are the core genes of IDD, offering new insights into the pathogenesis and therapeutic development of IDD.https://www.frontiersin.org/articles/10.3389/fimmu.2024.1401957/fullintervertebral disc degeneratio1bioinformaticsmachine learninggenesIL1R15TCF7L2
spellingShingle Hao Zhang
Shengbo Shi
Xingxing Huang
Changsheng Gong
Zijing Zhang
Zetian Zhao
Junxiao Gao
Meng Zhang
Xiaobing Yu
Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms
intervertebral disc degeneratio1
bioinformatics
machine learning
genes
IL1R15
TCF7L2
title Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms
title_full Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms
title_fullStr Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms
title_full_unstemmed Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms
title_short Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms
title_sort identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms
topic intervertebral disc degeneratio1
bioinformatics
machine learning
genes
IL1R15
TCF7L2
url https://www.frontiersin.org/articles/10.3389/fimmu.2024.1401957/full
work_keys_str_mv AT haozhang identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms
AT shengboshi identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms
AT xingxinghuang identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms
AT changshenggong identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms
AT zijingzhang identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms
AT zetianzhao identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms
AT junxiaogao identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms
AT mengzhang identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms
AT xiaobingyu identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms