Regularization and grouping -omics data by GCA method: A transcriptomic case.

The paper presents the application of Grade Correspondence Analysis (GCA) and Grade Correspondence Cluster Analysis (GCCA) for ordering and grouping -omics datasets, using transcriptomic data as an example. Based on gene expression data describing 256 patients with Multiple Myeloma it was shown that...

Full description

Bibliographic Details
Main Authors:	Monika Piwowar, Kinga A Kocemba-Pilarczyk, Piotr Piwowar
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2018-01-01
Series:	PLoS ONE
Online Access:	http://europepmc.org/articles/PMC6211732?pdf=render

id	doaj-4bbc2887b55e4ebb80d31391751c819e
record_format	Article
spelling	doaj-4bbc2887b55e4ebb80d31391751c819e2020-11-25T01:26:05ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-011311e020660810.1371/journal.pone.0206608Regularization and grouping -omics data by GCA method: A transcriptomic case.Monika PiwowarKinga A Kocemba-PilarczykPiotr PiwowarThe paper presents the application of Grade Correspondence Analysis (GCA) and Grade Correspondence Cluster Analysis (GCCA) for ordering and grouping -omics datasets, using transcriptomic data as an example. Based on gene expression data describing 256 patients with Multiple Myeloma it was shown that the GCA method could be used to find regularities in the analyzed collections and to create characteristic gene expression profiles for individual groups of patients. GCA iteratively permutes rows and columns to maximize the tau-Kendall or rho-Spearman coefficients, which makes it possible to arrange rows and columns in such a way that the most similar ones remain in each other's neighbourhood. In this way, the GCA algorithm highlights regularities in the data matrix. The ranked data can then be grouped using the GCCA method, and after that aggregated in clusters, providing a representation that is easier to analyze-especially in the case of large sets of gene expression profiles. Regularization of transcriptomic data, which is presented in this manuscript, has enabled division of the data set into column clusters (representing genes) and row clusters (representing patients). Subsequently, rows were aggregated (based on medians) to visualise the gene expression profiles for patients with Multiple Myeloma in each collection. The presented analysis became the starting point for characterisation of differentiated genes and biochemical processes in which they are involved. GCA analysis may provide an alternative analytical method to support differentiation and analysis of gene expression profiles characterising individual groups of patients.http://europepmc.org/articles/PMC6211732?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Monika Piwowar Kinga A Kocemba-Pilarczyk Piotr Piwowar
spellingShingle	Monika Piwowar Kinga A Kocemba-Pilarczyk Piotr Piwowar Regularization and grouping -omics data by GCA method: A transcriptomic case. PLoS ONE
author_facet	Monika Piwowar Kinga A Kocemba-Pilarczyk Piotr Piwowar
author_sort	Monika Piwowar
title	Regularization and grouping -omics data by GCA method: A transcriptomic case.
title_short	Regularization and grouping -omics data by GCA method: A transcriptomic case.
title_full	Regularization and grouping -omics data by GCA method: A transcriptomic case.
title_fullStr	Regularization and grouping -omics data by GCA method: A transcriptomic case.
title_full_unstemmed	Regularization and grouping -omics data by GCA method: A transcriptomic case.
title_sort	regularization and grouping -omics data by gca method: a transcriptomic case.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2018-01-01
description	The paper presents the application of Grade Correspondence Analysis (GCA) and Grade Correspondence Cluster Analysis (GCCA) for ordering and grouping -omics datasets, using transcriptomic data as an example. Based on gene expression data describing 256 patients with Multiple Myeloma it was shown that the GCA method could be used to find regularities in the analyzed collections and to create characteristic gene expression profiles for individual groups of patients. GCA iteratively permutes rows and columns to maximize the tau-Kendall or rho-Spearman coefficients, which makes it possible to arrange rows and columns in such a way that the most similar ones remain in each other's neighbourhood. In this way, the GCA algorithm highlights regularities in the data matrix. The ranked data can then be grouped using the GCCA method, and after that aggregated in clusters, providing a representation that is easier to analyze-especially in the case of large sets of gene expression profiles. Regularization of transcriptomic data, which is presented in this manuscript, has enabled division of the data set into column clusters (representing genes) and row clusters (representing patients). Subsequently, rows were aggregated (based on medians) to visualise the gene expression profiles for patients with Multiple Myeloma in each collection. The presented analysis became the starting point for characterisation of differentiated genes and biochemical processes in which they are involved. GCA analysis may provide an alternative analytical method to support differentiation and analysis of gene expression profiles characterising individual groups of patients.
url	http://europepmc.org/articles/PMC6211732?pdf=render
work_keys_str_mv	AT monikapiwowar regularizationandgroupingomicsdatabygcamethodatranscriptomiccase AT kingaakocembapilarczyk regularizationandgroupingomicsdatabygcamethodatranscriptomiccase AT piotrpiwowar regularizationandgroupingomicsdatabygcamethodatranscriptomiccase
_version_	1725110846630133760

Regularization and grouping -omics data by GCA method: A transcriptomic case.

Similar Items