A statistical approach for array CGH data analysis

<p>Abstract</p> <p>Background</p> <p>Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizing targets of genomic DNA from a test and a reference sample to sequences immobilized on a slide. These probes are genomic DNA sequences (BACs)...

Full description

Bibliographic Details
Main Authors: Vaisse Christian, Lavielle Marc, Robin Stephane, Picard Franck, Daudin Jean-Jacques
Format: Article
Language:English
Published: BMC 2005-02-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/6/27
id doaj-f612e3e9ebc8461e814bc610bf47df6f
record_format Article
spelling doaj-f612e3e9ebc8461e814bc610bf47df6f2020-11-25T02:43:19ZengBMCBMC Bioinformatics1471-21052005-02-01612710.1186/1471-2105-6-27A statistical approach for array CGH data analysisVaisse ChristianLavielle MarcRobin StephanePicard FranckDaudin Jean-Jacques<p>Abstract</p> <p>Background</p> <p>Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizing targets of genomic DNA from a test and a reference sample to sequences immobilized on a slide. These probes are genomic DNA sequences (BACs) that are mapped on the genome. The signal has a spatial coherence that can be handled by specific statistical tools. Segmentation methods seem to be a natural framework for this purpose. A CGH profile can be viewed as a succession of segments that represent homogeneous regions in the genome whose BACs share the same relative copy number on average. We model a CGH profile by a random Gaussian process whose distribution parameters are affected by abrupt changes at unknown coordinates. Two major problems arise : to determine which parameters are affected by the abrupt changes (the mean and the variance, or the mean only), and the selection of the number of segments in the profile.</p> <p>Results</p> <p>We demonstrate that existing methods for estimating the number of segments are not well adapted in the case of array CGH data, and we propose an adaptive criterion that detects previously mapped chromosomal aberrations. The performances of this method are discussed based on simulations and publicly available data sets. Then we discuss the choice of modeling for array CGH data and show that the model with a homogeneous variance is adapted to this context.</p> <p>Conclusions</p> <p>Array CGH data analysis is an emerging field that needs appropriate statistical tools. Process segmentation and model selection provide a theoretical framework that allows precise biological interpretations. Adaptive methods for model selection give promising results concerning the estimation of the number of altered regions on the genome.</p> http://www.biomedcentral.com/1471-2105/6/27
collection DOAJ
language English
format Article
sources DOAJ
author Vaisse Christian
Lavielle Marc
Robin Stephane
Picard Franck
Daudin Jean-Jacques
spellingShingle Vaisse Christian
Lavielle Marc
Robin Stephane
Picard Franck
Daudin Jean-Jacques
A statistical approach for array CGH data analysis
BMC Bioinformatics
author_facet Vaisse Christian
Lavielle Marc
Robin Stephane
Picard Franck
Daudin Jean-Jacques
author_sort Vaisse Christian
title A statistical approach for array CGH data analysis
title_short A statistical approach for array CGH data analysis
title_full A statistical approach for array CGH data analysis
title_fullStr A statistical approach for array CGH data analysis
title_full_unstemmed A statistical approach for array CGH data analysis
title_sort statistical approach for array cgh data analysis
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2005-02-01
description <p>Abstract</p> <p>Background</p> <p>Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizing targets of genomic DNA from a test and a reference sample to sequences immobilized on a slide. These probes are genomic DNA sequences (BACs) that are mapped on the genome. The signal has a spatial coherence that can be handled by specific statistical tools. Segmentation methods seem to be a natural framework for this purpose. A CGH profile can be viewed as a succession of segments that represent homogeneous regions in the genome whose BACs share the same relative copy number on average. We model a CGH profile by a random Gaussian process whose distribution parameters are affected by abrupt changes at unknown coordinates. Two major problems arise : to determine which parameters are affected by the abrupt changes (the mean and the variance, or the mean only), and the selection of the number of segments in the profile.</p> <p>Results</p> <p>We demonstrate that existing methods for estimating the number of segments are not well adapted in the case of array CGH data, and we propose an adaptive criterion that detects previously mapped chromosomal aberrations. The performances of this method are discussed based on simulations and publicly available data sets. Then we discuss the choice of modeling for array CGH data and show that the model with a homogeneous variance is adapted to this context.</p> <p>Conclusions</p> <p>Array CGH data analysis is an emerging field that needs appropriate statistical tools. Process segmentation and model selection provide a theoretical framework that allows precise biological interpretations. Adaptive methods for model selection give promising results concerning the estimation of the number of altered regions on the genome.</p>
url http://www.biomedcentral.com/1471-2105/6/27
work_keys_str_mv AT vaissechristian astatisticalapproachforarraycghdataanalysis
AT laviellemarc astatisticalapproachforarraycghdataanalysis
AT robinstephane astatisticalapproachforarraycghdataanalysis
AT picardfranck astatisticalapproachforarraycghdataanalysis
AT daudinjeanjacques astatisticalapproachforarraycghdataanalysis
AT vaissechristian statisticalapproachforarraycghdataanalysis
AT laviellemarc statisticalapproachforarraycghdataanalysis
AT robinstephane statisticalapproachforarraycghdataanalysis
AT picardfranck statisticalapproachforarraycghdataanalysis
AT daudinjeanjacques statisticalapproachforarraycghdataanalysis
_version_ 1724770101328084992