Non-linear mapping for exploratory data analysis in functional genomics

<p>Abstract</p> <p>Background</p> <p>Several supervised and unsupervised learning tools are available to classify functional genomics data. However, relatively less attention has been given to exploratory, visualisation-driven approaches. Such approaches should satisfy...

Full description

Bibliographic Details
Main Authors: Chesneau Alban, Wang Haiying, Azuaje Francisco
Format: Article
Language:English
Published: BMC 2005-01-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/6/13
id doaj-9e904b0db80046cf9c4ac4fc909835f8
record_format Article
spelling doaj-9e904b0db80046cf9c4ac4fc909835f82020-11-24T21:15:34ZengBMCBMC Bioinformatics1471-21052005-01-01611310.1186/1471-2105-6-13Non-linear mapping for exploratory data analysis in functional genomicsChesneau AlbanWang HaiyingAzuaje Francisco<p>Abstract</p> <p>Background</p> <p>Several supervised and unsupervised learning tools are available to classify functional genomics data. However, relatively less attention has been given to exploratory, visualisation-driven approaches. Such approaches should satisfy the following factors: Support for intuitive cluster visualisation, user-friendly and robust application, computational efficiency and generation of biologically meaningful outcomes. This research assesses a relaxation method for non-linear mapping that addresses these concerns. Its applications to gene expression and protein-protein interaction data analyses are investigated</p> <p>Results</p> <p>Publicly available expression data originating from leukaemia, round blue-cell tumours and Parkinson disease studies were analysed. The method distinguished relevant clusters and critical analysis areas. The system does not require assumptions about the inherent class structure of the data, its mapping process is controlled by only one parameter and the resulting transformations offer intuitive, meaningful visual displays. Comparisons with traditional mapping models are presented. As a way of promoting potential, alternative applications of the methodology presented, an example of exploratory data analysis of interactome networks is illustrated. Data from the <it>C. elegans </it>interactome were analysed. Results suggest that this method might represent an effective solution for detecting key network hubs and for clustering biologically meaningful groups of proteins.</p> <p>Conclusion</p> <p>A relaxation method for non-linear mapping provided the basis for visualisation-driven analyses using different types of data. This study indicates that such a system may represent a user-friendly and robust approach to exploratory data analysis. It may allow users to gain better insights into the underlying data structure, detect potential outliers and assess assumptions about the cluster composition of the data.</p> http://www.biomedcentral.com/1471-2105/6/13
collection DOAJ
language English
format Article
sources DOAJ
author Chesneau Alban
Wang Haiying
Azuaje Francisco
spellingShingle Chesneau Alban
Wang Haiying
Azuaje Francisco
Non-linear mapping for exploratory data analysis in functional genomics
BMC Bioinformatics
author_facet Chesneau Alban
Wang Haiying
Azuaje Francisco
author_sort Chesneau Alban
title Non-linear mapping for exploratory data analysis in functional genomics
title_short Non-linear mapping for exploratory data analysis in functional genomics
title_full Non-linear mapping for exploratory data analysis in functional genomics
title_fullStr Non-linear mapping for exploratory data analysis in functional genomics
title_full_unstemmed Non-linear mapping for exploratory data analysis in functional genomics
title_sort non-linear mapping for exploratory data analysis in functional genomics
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2005-01-01
description <p>Abstract</p> <p>Background</p> <p>Several supervised and unsupervised learning tools are available to classify functional genomics data. However, relatively less attention has been given to exploratory, visualisation-driven approaches. Such approaches should satisfy the following factors: Support for intuitive cluster visualisation, user-friendly and robust application, computational efficiency and generation of biologically meaningful outcomes. This research assesses a relaxation method for non-linear mapping that addresses these concerns. Its applications to gene expression and protein-protein interaction data analyses are investigated</p> <p>Results</p> <p>Publicly available expression data originating from leukaemia, round blue-cell tumours and Parkinson disease studies were analysed. The method distinguished relevant clusters and critical analysis areas. The system does not require assumptions about the inherent class structure of the data, its mapping process is controlled by only one parameter and the resulting transformations offer intuitive, meaningful visual displays. Comparisons with traditional mapping models are presented. As a way of promoting potential, alternative applications of the methodology presented, an example of exploratory data analysis of interactome networks is illustrated. Data from the <it>C. elegans </it>interactome were analysed. Results suggest that this method might represent an effective solution for detecting key network hubs and for clustering biologically meaningful groups of proteins.</p> <p>Conclusion</p> <p>A relaxation method for non-linear mapping provided the basis for visualisation-driven analyses using different types of data. This study indicates that such a system may represent a user-friendly and robust approach to exploratory data analysis. It may allow users to gain better insights into the underlying data structure, detect potential outliers and assess assumptions about the cluster composition of the data.</p>
url http://www.biomedcentral.com/1471-2105/6/13
work_keys_str_mv AT chesneaualban nonlinearmappingforexploratorydataanalysisinfunctionalgenomics
AT wanghaiying nonlinearmappingforexploratorydataanalysisinfunctionalgenomics
AT azuajefrancisco nonlinearmappingforexploratorydataanalysisinfunctionalgenomics
_version_ 1716744797101752320