Accuracy of CNV Detection from GWAS Data.

Several computer programs are available for detecting copy number variants (CNVs) using genome-wide SNP arrays. We evaluated the performance of four CNV detection software suites--Birdsuite, Partek, HelixTree, and PennCNV-Affy--in the identification of both rare and common CNVs. Each program's...

Full description

Bibliographic Details
Main Authors:	Dandan Zhang, Yudong Qian, Nirmala Akula, Ney Alliey-Rodriguez, Jinsong Tang, Bipolar Genome Study, Elliot S Gershon, Chunyu Liu
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2011-01-01
Series:	PLoS ONE
Online Access:	http://europepmc.org/articles/PMC3020939?pdf=render

id	doaj-3736a14edb9a4bd780030d74e4653f19
record_format	Article
spelling	doaj-3736a14edb9a4bd780030d74e4653f192020-11-24T21:35:12ZengPublic Library of Science (PLoS)PLoS ONE1932-62032011-01-0161e1451110.1371/journal.pone.0014511Accuracy of CNV Detection from GWAS Data.Dandan ZhangYudong QianNirmala AkulaNey Alliey-RodriguezJinsong TangBipolar Genome StudyElliot S GershonChunyu LiuSeveral computer programs are available for detecting copy number variants (CNVs) using genome-wide SNP arrays. We evaluated the performance of four CNV detection software suites--Birdsuite, Partek, HelixTree, and PennCNV-Affy--in the identification of both rare and common CNVs. Each program's performance was assessed in two ways. The first was its recovery rate, i.e., its ability to call 893 CNVs previously identified in eight HapMap samples by paired-end sequencing of whole-genome fosmid clones, and 51,440 CNVs identified by array Comparative Genome Hybridization (aCGH) followed by validation procedures, in 90 HapMap CEU samples. The second evaluation was program performance calling rare and common CNVs in the Bipolar Genome Study (BiGS) data set (1001 bipolar cases and 1033 controls, all of European ancestry) as measured by the Affymetrix SNP 6.0 array. Accuracy in calling rare CNVs was assessed by positive predictive value, based on the proportion of rare CNVs validated by quantitative real-time PCR (qPCR), while accuracy in calling common CNVs was assessed by false positive/false negative rates based on qPCR validation results from a subset of common CNVs. Birdsuite recovered the highest percentages of known HapMap CNVs containing >20 markers in two reference CNV datasets. The recovery rate increased with decreased CNV frequency. In the tested rare CNV data, Birdsuite and Partek had higher positive predictive values than the other software suites. In a test of three common CNVs in the BiGS dataset, Birdsuite's call was 98.8% consistent with qPCR quantification in one CNV region, but the other two regions showed an unacceptable degree of accuracy. We found relatively poor consistency between the two "gold standards," the sequence data of Kidd et al., and aCGH data of Conrad et al. Algorithms for calling CNVs especially common ones need substantial improvement, and a "gold standard" for detection of CNVs remains to be established.http://europepmc.org/articles/PMC3020939?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Dandan Zhang Yudong Qian Nirmala Akula Ney Alliey-Rodriguez Jinsong Tang Bipolar Genome Study Elliot S Gershon Chunyu Liu
spellingShingle	Dandan Zhang Yudong Qian Nirmala Akula Ney Alliey-Rodriguez Jinsong Tang Bipolar Genome Study Elliot S Gershon Chunyu Liu Accuracy of CNV Detection from GWAS Data. PLoS ONE
author_facet	Dandan Zhang Yudong Qian Nirmala Akula Ney Alliey-Rodriguez Jinsong Tang Bipolar Genome Study Elliot S Gershon Chunyu Liu
author_sort	Dandan Zhang
title	Accuracy of CNV Detection from GWAS Data.
title_short	Accuracy of CNV Detection from GWAS Data.
title_full	Accuracy of CNV Detection from GWAS Data.
title_fullStr	Accuracy of CNV Detection from GWAS Data.
title_full_unstemmed	Accuracy of CNV Detection from GWAS Data.
title_sort	accuracy of cnv detection from gwas data.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2011-01-01
description	Several computer programs are available for detecting copy number variants (CNVs) using genome-wide SNP arrays. We evaluated the performance of four CNV detection software suites--Birdsuite, Partek, HelixTree, and PennCNV-Affy--in the identification of both rare and common CNVs. Each program's performance was assessed in two ways. The first was its recovery rate, i.e., its ability to call 893 CNVs previously identified in eight HapMap samples by paired-end sequencing of whole-genome fosmid clones, and 51,440 CNVs identified by array Comparative Genome Hybridization (aCGH) followed by validation procedures, in 90 HapMap CEU samples. The second evaluation was program performance calling rare and common CNVs in the Bipolar Genome Study (BiGS) data set (1001 bipolar cases and 1033 controls, all of European ancestry) as measured by the Affymetrix SNP 6.0 array. Accuracy in calling rare CNVs was assessed by positive predictive value, based on the proportion of rare CNVs validated by quantitative real-time PCR (qPCR), while accuracy in calling common CNVs was assessed by false positive/false negative rates based on qPCR validation results from a subset of common CNVs. Birdsuite recovered the highest percentages of known HapMap CNVs containing >20 markers in two reference CNV datasets. The recovery rate increased with decreased CNV frequency. In the tested rare CNV data, Birdsuite and Partek had higher positive predictive values than the other software suites. In a test of three common CNVs in the BiGS dataset, Birdsuite's call was 98.8% consistent with qPCR quantification in one CNV region, but the other two regions showed an unacceptable degree of accuracy. We found relatively poor consistency between the two "gold standards," the sequence data of Kidd et al., and aCGH data of Conrad et al. Algorithms for calling CNVs especially common ones need substantial improvement, and a "gold standard" for detection of CNVs remains to be established.
url	http://europepmc.org/articles/PMC3020939?pdf=render
work_keys_str_mv	AT dandanzhang accuracyofcnvdetectionfromgwasdata AT yudongqian accuracyofcnvdetectionfromgwasdata AT nirmalaakula accuracyofcnvdetectionfromgwasdata AT neyallieyrodriguez accuracyofcnvdetectionfromgwasdata AT jinsongtang accuracyofcnvdetectionfromgwasdata AT bipolargenomestudy accuracyofcnvdetectionfromgwasdata AT elliotsgershon accuracyofcnvdetectionfromgwasdata AT chunyuliu accuracyofcnvdetectionfromgwasdata
_version_	1725946123211767808

Accuracy of CNV Detection from GWAS Data.

Similar Items