The Human Blood Transcriptome in a Large Population Cohort and Its Relation to Aging and Health

Background: The blood transcriptome is expected to provide a detailed picture of an organism's physiological state with potential outcomes for applications in medical diagnostics and molecular and epidemiological research. We here present the analysis of blood specimens of 3,388 adult individua...

Full description

Bibliographic Details
Main Authors: Maria Schmidt, Lydia Hopp, Arsen Arakelyan, Holger Kirsten, Christoph Engel, Kerstin Wirkner, Knut Krohn, Ralph Burkhardt, Joachim Thiery, Markus Loeffler, Henry Loeffler-Wirth, Hans Binder
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-10-01
Series:Frontiers in Big Data
Subjects:
age
Online Access:https://www.frontiersin.org/articles/10.3389/fdata.2020.548873/full
id doaj-f74d5d749ea04c20b9586801a0143196
record_format Article
collection DOAJ
language English
format Article
sources DOAJ
author Maria Schmidt
Lydia Hopp
Arsen Arakelyan
Holger Kirsten
Holger Kirsten
Christoph Engel
Christoph Engel
Kerstin Wirkner
Kerstin Wirkner
Knut Krohn
Knut Krohn
Ralph Burkhardt
Ralph Burkhardt
Joachim Thiery
Joachim Thiery
Markus Loeffler
Markus Loeffler
Markus Loeffler
Henry Loeffler-Wirth
Hans Binder
Hans Binder
spellingShingle Maria Schmidt
Lydia Hopp
Arsen Arakelyan
Holger Kirsten
Holger Kirsten
Christoph Engel
Christoph Engel
Kerstin Wirkner
Kerstin Wirkner
Knut Krohn
Knut Krohn
Ralph Burkhardt
Ralph Burkhardt
Joachim Thiery
Joachim Thiery
Markus Loeffler
Markus Loeffler
Markus Loeffler
Henry Loeffler-Wirth
Hans Binder
Hans Binder
The Human Blood Transcriptome in a Large Population Cohort and Its Relation to Aging and Health
Frontiers in Big Data
self-organizing maps
omics and phenotype integration
age
lifestyle and obesity
gene expression
immune response
author_facet Maria Schmidt
Lydia Hopp
Arsen Arakelyan
Holger Kirsten
Holger Kirsten
Christoph Engel
Christoph Engel
Kerstin Wirkner
Kerstin Wirkner
Knut Krohn
Knut Krohn
Ralph Burkhardt
Ralph Burkhardt
Joachim Thiery
Joachim Thiery
Markus Loeffler
Markus Loeffler
Markus Loeffler
Henry Loeffler-Wirth
Hans Binder
Hans Binder
author_sort Maria Schmidt
title The Human Blood Transcriptome in a Large Population Cohort and Its Relation to Aging and Health
title_short The Human Blood Transcriptome in a Large Population Cohort and Its Relation to Aging and Health
title_full The Human Blood Transcriptome in a Large Population Cohort and Its Relation to Aging and Health
title_fullStr The Human Blood Transcriptome in a Large Population Cohort and Its Relation to Aging and Health
title_full_unstemmed The Human Blood Transcriptome in a Large Population Cohort and Its Relation to Aging and Health
title_sort human blood transcriptome in a large population cohort and its relation to aging and health
publisher Frontiers Media S.A.
series Frontiers in Big Data
issn 2624-909X
publishDate 2020-10-01
description Background: The blood transcriptome is expected to provide a detailed picture of an organism's physiological state with potential outcomes for applications in medical diagnostics and molecular and epidemiological research. We here present the analysis of blood specimens of 3,388 adult individuals, together with phenotype characteristics such as disease history, medication status, lifestyle factors, and body mass index (BMI). The size and heterogeneity of this data challenges analytics in terms of dimension reduction, knowledge mining, feature extraction, and data integration.Methods: Self-organizing maps (SOM)-machine learning was applied to study transcriptional states on a population-wide scale. This method permits a detailed description and visualization of the molecular heterogeneity of transcriptomes and of their association with different phenotypic features.Results: The diversity of transcriptomes is described by personalized SOM-portraits, which specify the samples in terms of modules of co-expressed genes of different functional context. We identified two major blood transcriptome types where type 1 was found more in men, the elderly, and overweight people and it upregulated genes associated with inflammation and increased heme metabolism, while type 2 was predominantly found in women, younger, and normal weight participants and it was associated with activated immune responses, transcriptional, ribosomal, mitochondrial, and telomere-maintenance cell-functions. We find a striking overlap of signatures shared by multiple diseases, aging, and obesity driven by an underlying common pattern, which was associated with the immune response and the increase of inflammatory processes.Conclusions: Machine learning applications for large and heterogeneous omics data provide a holistic view on the diversity of the human blood transcriptome. It provides a tool for comparative analyses of transcriptional signatures and of associated phenotypes in population studies and medical applications.
topic self-organizing maps
omics and phenotype integration
age
lifestyle and obesity
gene expression
immune response
url https://www.frontiersin.org/articles/10.3389/fdata.2020.548873/full
work_keys_str_mv AT mariaschmidt thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT lydiahopp thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT arsenarakelyan thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT holgerkirsten thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT holgerkirsten thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT christophengel thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT christophengel thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT kerstinwirkner thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT kerstinwirkner thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT knutkrohn thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT knutkrohn thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT ralphburkhardt thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT ralphburkhardt thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT joachimthiery thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT joachimthiery thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT markusloeffler thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT markusloeffler thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT markusloeffler thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT henryloefflerwirth thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT hansbinder thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT hansbinder thehumanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT mariaschmidt humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT lydiahopp humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT arsenarakelyan humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT holgerkirsten humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT holgerkirsten humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT christophengel humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT christophengel humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT kerstinwirkner humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT kerstinwirkner humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT knutkrohn humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT knutkrohn humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT ralphburkhardt humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT ralphburkhardt humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT joachimthiery humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT joachimthiery humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT markusloeffler humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT markusloeffler humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT markusloeffler humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT henryloefflerwirth humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT hansbinder humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
AT hansbinder humanbloodtranscriptomeinalargepopulationcohortanditsrelationtoagingandhealth
_version_ 1724448310048063488
spelling doaj-f74d5d749ea04c20b9586801a01431962020-11-25T04:00:58ZengFrontiers Media S.A.Frontiers in Big Data2624-909X2020-10-01310.3389/fdata.2020.548873548873The Human Blood Transcriptome in a Large Population Cohort and Its Relation to Aging and HealthMaria Schmidt0Lydia Hopp1Arsen Arakelyan2Holger Kirsten3Holger Kirsten4Christoph Engel5Christoph Engel6Kerstin Wirkner7Kerstin Wirkner8Knut Krohn9Knut Krohn10Ralph Burkhardt11Ralph Burkhardt12Joachim Thiery13Joachim Thiery14Markus Loeffler15Markus Loeffler16Markus Loeffler17Henry Loeffler-Wirth18Hans Binder19Hans Binder20IZBI, Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Leipzig, GermanyIZBI, Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Leipzig, GermanyBIG, Group of Bioinformatics, Institute of Molecular Biology, National Academy of Sciences, Yerevan, ArmeniaIMISE, Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, GermanyLeipzig Research Centre for Civilization Diseases, University of Leipzig, Leipzig, GermanyIMISE, Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, GermanyLeipzig Research Centre for Civilization Diseases, University of Leipzig, Leipzig, GermanyIMISE, Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, GermanyLeipzig Research Centre for Civilization Diseases, University of Leipzig, Leipzig, GermanyLeipzig Research Centre for Civilization Diseases, University of Leipzig, Leipzig, GermanyInstitute of Laboratory Medicine, Clinical Chemistry and Molecular Diagnostics, University of Leipzig, Leipzig, GermanyLeipzig Research Centre for Civilization Diseases, University of Leipzig, Leipzig, GermanyInstitute of Laboratory Medicine, Clinical Chemistry and Molecular Diagnostics, University of Leipzig, Leipzig, GermanyLeipzig Research Centre for Civilization Diseases, University of Leipzig, Leipzig, GermanyInstitute of Laboratory Medicine, Clinical Chemistry and Molecular Diagnostics, University of Leipzig, Leipzig, GermanyIZBI, Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Leipzig, GermanyIMISE, Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, GermanyLeipzig Research Centre for Civilization Diseases, University of Leipzig, Leipzig, GermanyIZBI, Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Leipzig, GermanyIZBI, Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Leipzig, GermanyLeipzig Research Centre for Civilization Diseases, University of Leipzig, Leipzig, GermanyBackground: The blood transcriptome is expected to provide a detailed picture of an organism's physiological state with potential outcomes for applications in medical diagnostics and molecular and epidemiological research. We here present the analysis of blood specimens of 3,388 adult individuals, together with phenotype characteristics such as disease history, medication status, lifestyle factors, and body mass index (BMI). The size and heterogeneity of this data challenges analytics in terms of dimension reduction, knowledge mining, feature extraction, and data integration.Methods: Self-organizing maps (SOM)-machine learning was applied to study transcriptional states on a population-wide scale. This method permits a detailed description and visualization of the molecular heterogeneity of transcriptomes and of their association with different phenotypic features.Results: The diversity of transcriptomes is described by personalized SOM-portraits, which specify the samples in terms of modules of co-expressed genes of different functional context. We identified two major blood transcriptome types where type 1 was found more in men, the elderly, and overweight people and it upregulated genes associated with inflammation and increased heme metabolism, while type 2 was predominantly found in women, younger, and normal weight participants and it was associated with activated immune responses, transcriptional, ribosomal, mitochondrial, and telomere-maintenance cell-functions. We find a striking overlap of signatures shared by multiple diseases, aging, and obesity driven by an underlying common pattern, which was associated with the immune response and the increase of inflammatory processes.Conclusions: Machine learning applications for large and heterogeneous omics data provide a holistic view on the diversity of the human blood transcriptome. It provides a tool for comparative analyses of transcriptional signatures and of associated phenotypes in population studies and medical applications.https://www.frontiersin.org/articles/10.3389/fdata.2020.548873/fullself-organizing mapsomics and phenotype integrationagelifestyle and obesitygene expressionimmune response