Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms
BackgroundIntervertebral Disc Degeneration (IDD) is a major cause of lower back pain and a significant global health issue. However, the specific mechanisms of IDD remain unclear. This study aims to identify key genes and pathways associated with IDD using bioinformatics and machine learning algorit...
| الحاوية / القاعدة: | Frontiers in Immunology |
|---|---|
| المؤلفون الرئيسيون: | , , , , , , , , |
| التنسيق: | مقال |
| اللغة: | الإنجليزية |
| منشور في: |
Frontiers Media S.A.
2024-07-01
|
| الموضوعات: | |
| الوصول للمادة أونلاين: | https://www.frontiersin.org/articles/10.3389/fimmu.2024.1401957/full |
| _version_ | 1850038414852751360 |
|---|---|
| author | Hao Zhang Shengbo Shi Xingxing Huang Changsheng Gong Zijing Zhang Zetian Zhao Junxiao Gao Meng Zhang Xiaobing Yu |
| author_facet | Hao Zhang Shengbo Shi Xingxing Huang Changsheng Gong Zijing Zhang Zetian Zhao Junxiao Gao Meng Zhang Xiaobing Yu |
| author_sort | Hao Zhang |
| collection | DOAJ |
| container_title | Frontiers in Immunology |
| description | BackgroundIntervertebral Disc Degeneration (IDD) is a major cause of lower back pain and a significant global health issue. However, the specific mechanisms of IDD remain unclear. This study aims to identify key genes and pathways associated with IDD using bioinformatics and machine learning algorithms.MethodsGene expression profiles, including those from 35 LDH patients and 43 healthy volunteers, were downloaded from the GEO database (GSE124272, GSE150408, GSE23130, GSE153761). After merging four microarray datasets, differentially expressed genes (DEGs) were selected for GO and KEGG pathway enrichment analysis. Weighted Gene Co-expression Network Analysis (WGCNA) was then applied to the merged dataset to identify relevant modules and intersect with DEGs to discover candidate genes with diagnostic value. A LASSO model was established to select appropriate genes, and ROC curves were drawn to elucidate the diagnostic value of genetic markers. A Protein-Protein Interaction (PPI) network was constructed and visualized to determine central genes, followed by external validation using qRT-PCR.ResultsDifferential analysis of the preprocessed dataset identified 244 genes, including 183 upregulated and 61 downregulated genes. WGCNA analysis revealed the most relevant module intersecting with DEGs, yielding 9 candidate genes. The lasso-cox method was used for regression analysis, ultimately identifying 6 genes: ASPH, CDC42EP3, FOSL2, IL1R1, NFKBIZ, TCF7L2. A Protein-Protein Interaction (PPI) network created with GENEMANIA identified IL1R1 and TCF7L2 as central genes.ConclusionOur study shows that IL1R1 and TCF7L2 are the core genes of IDD, offering new insights into the pathogenesis and therapeutic development of IDD. |
| format | Article |
| id | doaj-art-e3eae81376ab46dabd4948aaa2d50f2c |
| institution | Directory of Open Access Journals |
| issn | 1664-3224 |
| language | English |
| publishDate | 2024-07-01 |
| publisher | Frontiers Media S.A. |
| record_format | Article |
| spelling | doaj-art-e3eae81376ab46dabd4948aaa2d50f2c2025-08-20T00:32:54ZengFrontiers Media S.A.Frontiers in Immunology1664-32242024-07-011510.3389/fimmu.2024.14019571401957Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithmsHao ZhangShengbo ShiXingxing HuangChangsheng GongZijing ZhangZetian ZhaoJunxiao GaoMeng ZhangXiaobing YuBackgroundIntervertebral Disc Degeneration (IDD) is a major cause of lower back pain and a significant global health issue. However, the specific mechanisms of IDD remain unclear. This study aims to identify key genes and pathways associated with IDD using bioinformatics and machine learning algorithms.MethodsGene expression profiles, including those from 35 LDH patients and 43 healthy volunteers, were downloaded from the GEO database (GSE124272, GSE150408, GSE23130, GSE153761). After merging four microarray datasets, differentially expressed genes (DEGs) were selected for GO and KEGG pathway enrichment analysis. Weighted Gene Co-expression Network Analysis (WGCNA) was then applied to the merged dataset to identify relevant modules and intersect with DEGs to discover candidate genes with diagnostic value. A LASSO model was established to select appropriate genes, and ROC curves were drawn to elucidate the diagnostic value of genetic markers. A Protein-Protein Interaction (PPI) network was constructed and visualized to determine central genes, followed by external validation using qRT-PCR.ResultsDifferential analysis of the preprocessed dataset identified 244 genes, including 183 upregulated and 61 downregulated genes. WGCNA analysis revealed the most relevant module intersecting with DEGs, yielding 9 candidate genes. The lasso-cox method was used for regression analysis, ultimately identifying 6 genes: ASPH, CDC42EP3, FOSL2, IL1R1, NFKBIZ, TCF7L2. A Protein-Protein Interaction (PPI) network created with GENEMANIA identified IL1R1 and TCF7L2 as central genes.ConclusionOur study shows that IL1R1 and TCF7L2 are the core genes of IDD, offering new insights into the pathogenesis and therapeutic development of IDD.https://www.frontiersin.org/articles/10.3389/fimmu.2024.1401957/fullintervertebral disc degeneratio1bioinformaticsmachine learninggenesIL1R15TCF7L2 |
| spellingShingle | Hao Zhang Shengbo Shi Xingxing Huang Changsheng Gong Zijing Zhang Zetian Zhao Junxiao Gao Meng Zhang Xiaobing Yu Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms intervertebral disc degeneratio1 bioinformatics machine learning genes IL1R15 TCF7L2 |
| title | Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms |
| title_full | Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms |
| title_fullStr | Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms |
| title_full_unstemmed | Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms |
| title_short | Identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms |
| title_sort | identification of core genes in intervertebral disc degeneration using bioinformatics and machine learning algorithms |
| topic | intervertebral disc degeneratio1 bioinformatics machine learning genes IL1R15 TCF7L2 |
| url | https://www.frontiersin.org/articles/10.3389/fimmu.2024.1401957/full |
| work_keys_str_mv | AT haozhang identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms AT shengboshi identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms AT xingxinghuang identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms AT changshenggong identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms AT zijingzhang identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms AT zetianzhao identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms AT junxiaogao identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms AT mengzhang identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms AT xiaobingyu identificationofcoregenesinintervertebraldiscdegenerationusingbioinformaticsandmachinelearningalgorithms |
