Array CGH Phylogeny: How accurate are Comparative Genomic Hybridization-based trees?

<p>Abstract</p> <p>Background</p> <p>Array-based Comparative Genomic Hybridization (CGH) data have been used to infer phylogenetic relationships. However, the reliability of array CGH analysis to determine evolutionary relationships has not been well established. In mos...

Full description

Bibliographic Details
Main Authors: Glass N Louise, Kasuga Takao, Gilbert Luz B, Taylor John W
Format: Article
Language:English
Published: BMC 2011-10-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/12/487
id doaj-f810db4941a54e5ab72f03485cc27c39
record_format Article
spelling doaj-f810db4941a54e5ab72f03485cc27c392020-11-25T00:13:46ZengBMCBMC Genomics1471-21642011-10-0112148710.1186/1471-2164-12-487Array CGH Phylogeny: How accurate are Comparative Genomic Hybridization-based trees?Glass N LouiseKasuga TakaoGilbert Luz BTaylor John W<p>Abstract</p> <p>Background</p> <p>Array-based Comparative Genomic Hybridization (CGH) data have been used to infer phylogenetic relationships. However, the reliability of array CGH analysis to determine evolutionary relationships has not been well established. In most CGH work, all species and strains are compared to a single reference species, whose genome was used to design the array. In the accompanying work, we critically evaluated CGH-based phylogeny using simulated competitive hybridization data. This work showed that a limited number of conditions, principally the tree topology and placement of the reference taxon in the tree, had a strong effect on the ability to recover the correct tree topology. Here, we add to our simulation study by testing the use of CGH as a phylogenetic tool with experimental CGH data from competitive hybridizations between <it>N. crassa </it>and other <it>Neurospora </it>species. In the discussion, we add to our empirical study of <it>Neurospora </it>by reanalyzing of data from a previous CGH phylogenetic analysis of the yeast <it>sensu stricto </it>complex.</p> <p>Results</p> <p>Array ratio data for <it>Neurospora </it>and related species were normalized with loess, robust spline, and linear ratio based methods, and then used to construct Neighbor-Joining and parsimony trees. These trees were compared to published phylogenetic analyses for <it>Neurospora </it>based on multilocus sequence analysis (MLSA). For the <it>Neurospora </it>dataset, the best combination of methods resulted in recovery of the MLSA tree topology less than half the time. Our reanalysis of a yeast dataset found that trees identical to established phylogeny were recovered only by pruning taxa - including the reference taxon - from the analysis.</p> <p>Conclusion</p> <p>Our results indicate that CGH data can be problematic for phylogenetic analysis. Success fluctuates based on the methods utilized to construct the tree and the taxa included. Selective pruning of the taxa improves the results - an impractical approach for normal phylogenetic analysis. From the more successful methods we make suggestions on the normalization and post-normalization methods that work best in estimating genetic distance between taxa.</p> http://www.biomedcentral.com/1471-2164/12/487
collection DOAJ
language English
format Article
sources DOAJ
author Glass N Louise
Kasuga Takao
Gilbert Luz B
Taylor John W
spellingShingle Glass N Louise
Kasuga Takao
Gilbert Luz B
Taylor John W
Array CGH Phylogeny: How accurate are Comparative Genomic Hybridization-based trees?
BMC Genomics
author_facet Glass N Louise
Kasuga Takao
Gilbert Luz B
Taylor John W
author_sort Glass N Louise
title Array CGH Phylogeny: How accurate are Comparative Genomic Hybridization-based trees?
title_short Array CGH Phylogeny: How accurate are Comparative Genomic Hybridization-based trees?
title_full Array CGH Phylogeny: How accurate are Comparative Genomic Hybridization-based trees?
title_fullStr Array CGH Phylogeny: How accurate are Comparative Genomic Hybridization-based trees?
title_full_unstemmed Array CGH Phylogeny: How accurate are Comparative Genomic Hybridization-based trees?
title_sort array cgh phylogeny: how accurate are comparative genomic hybridization-based trees?
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2011-10-01
description <p>Abstract</p> <p>Background</p> <p>Array-based Comparative Genomic Hybridization (CGH) data have been used to infer phylogenetic relationships. However, the reliability of array CGH analysis to determine evolutionary relationships has not been well established. In most CGH work, all species and strains are compared to a single reference species, whose genome was used to design the array. In the accompanying work, we critically evaluated CGH-based phylogeny using simulated competitive hybridization data. This work showed that a limited number of conditions, principally the tree topology and placement of the reference taxon in the tree, had a strong effect on the ability to recover the correct tree topology. Here, we add to our simulation study by testing the use of CGH as a phylogenetic tool with experimental CGH data from competitive hybridizations between <it>N. crassa </it>and other <it>Neurospora </it>species. In the discussion, we add to our empirical study of <it>Neurospora </it>by reanalyzing of data from a previous CGH phylogenetic analysis of the yeast <it>sensu stricto </it>complex.</p> <p>Results</p> <p>Array ratio data for <it>Neurospora </it>and related species were normalized with loess, robust spline, and linear ratio based methods, and then used to construct Neighbor-Joining and parsimony trees. These trees were compared to published phylogenetic analyses for <it>Neurospora </it>based on multilocus sequence analysis (MLSA). For the <it>Neurospora </it>dataset, the best combination of methods resulted in recovery of the MLSA tree topology less than half the time. Our reanalysis of a yeast dataset found that trees identical to established phylogeny were recovered only by pruning taxa - including the reference taxon - from the analysis.</p> <p>Conclusion</p> <p>Our results indicate that CGH data can be problematic for phylogenetic analysis. Success fluctuates based on the methods utilized to construct the tree and the taxa included. Selective pruning of the taxa improves the results - an impractical approach for normal phylogenetic analysis. From the more successful methods we make suggestions on the normalization and post-normalization methods that work best in estimating genetic distance between taxa.</p>
url http://www.biomedcentral.com/1471-2164/12/487
work_keys_str_mv AT glassnlouise arraycghphylogenyhowaccuratearecomparativegenomichybridizationbasedtrees
AT kasugatakao arraycghphylogenyhowaccuratearecomparativegenomichybridizationbasedtrees
AT gilbertluzb arraycghphylogenyhowaccuratearecomparativegenomichybridizationbasedtrees
AT taylorjohnw arraycghphylogenyhowaccuratearecomparativegenomichybridizationbasedtrees
_version_ 1725393184251445248