The diploid genome sequence of an individual human.

Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage f...

Full description

Bibliographic Details
Main Authors: Samuel Levy, Granger Sutton, Pauline C Ng, Lars Feuk, Aaron L Halpern, Brian P Walenz, Nelson Axelrod, Jiaqi Huang, Ewen F Kirkness, Gennady Denisov, Yuan Lin, Jeffrey R MacDonald, Andy Wing Chun Pang, Mary Shago, Timothy B Stockwell, Alexia Tsiamouri, Vineet Bafna, Vikas Bansal, Saul A Kravitz, Dana A Busam, Karen Y Beeson, Tina C McIntosh, Karin A Remington, Josep F Abril, John Gill, Jon Borman, Yu-Hui Rogers, Marvin E Frazier, Stephen W Scherer, Robert L Strausberg, J Craig Venter
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2007-09-01
Series:PLoS Biology
Online Access:https://doi.org/10.1371/journal.pbio.0050254
id doaj-1ebc1b9d3b354a3c848c72f1b19274b6
record_format Article
spelling doaj-1ebc1b9d3b354a3c848c72f1b19274b62021-07-02T21:22:23ZengPublic Library of Science (PLoS)PLoS Biology1544-91731545-78852007-09-01510e25410.1371/journal.pbio.0050254The diploid genome sequence of an individual human.Samuel LevyGranger SuttonPauline C NgLars FeukAaron L HalpernBrian P WalenzNelson AxelrodJiaqi HuangEwen F KirknessGennady DenisovYuan LinJeffrey R MacDonaldAndy Wing Chun PangMary ShagoTimothy B StockwellAlexia TsiamouriVineet BafnaVikas BansalSaul A KravitzDana A BusamKaren Y BeesonTina C McIntoshKarin A RemingtonJosep F AbrilJohn GillJon BormanYu-Hui RogersMarvin E FrazierStephen W SchererRobert L StrausbergJ Craig VenterPresented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2-206 bp), 292,102 heterozygous insertion/deletion events (indels)(1-571 bp), 559,473 homozygous indels (1-82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.https://doi.org/10.1371/journal.pbio.0050254
collection DOAJ
language English
format Article
sources DOAJ
author Samuel Levy
Granger Sutton
Pauline C Ng
Lars Feuk
Aaron L Halpern
Brian P Walenz
Nelson Axelrod
Jiaqi Huang
Ewen F Kirkness
Gennady Denisov
Yuan Lin
Jeffrey R MacDonald
Andy Wing Chun Pang
Mary Shago
Timothy B Stockwell
Alexia Tsiamouri
Vineet Bafna
Vikas Bansal
Saul A Kravitz
Dana A Busam
Karen Y Beeson
Tina C McIntosh
Karin A Remington
Josep F Abril
John Gill
Jon Borman
Yu-Hui Rogers
Marvin E Frazier
Stephen W Scherer
Robert L Strausberg
J Craig Venter
spellingShingle Samuel Levy
Granger Sutton
Pauline C Ng
Lars Feuk
Aaron L Halpern
Brian P Walenz
Nelson Axelrod
Jiaqi Huang
Ewen F Kirkness
Gennady Denisov
Yuan Lin
Jeffrey R MacDonald
Andy Wing Chun Pang
Mary Shago
Timothy B Stockwell
Alexia Tsiamouri
Vineet Bafna
Vikas Bansal
Saul A Kravitz
Dana A Busam
Karen Y Beeson
Tina C McIntosh
Karin A Remington
Josep F Abril
John Gill
Jon Borman
Yu-Hui Rogers
Marvin E Frazier
Stephen W Scherer
Robert L Strausberg
J Craig Venter
The diploid genome sequence of an individual human.
PLoS Biology
author_facet Samuel Levy
Granger Sutton
Pauline C Ng
Lars Feuk
Aaron L Halpern
Brian P Walenz
Nelson Axelrod
Jiaqi Huang
Ewen F Kirkness
Gennady Denisov
Yuan Lin
Jeffrey R MacDonald
Andy Wing Chun Pang
Mary Shago
Timothy B Stockwell
Alexia Tsiamouri
Vineet Bafna
Vikas Bansal
Saul A Kravitz
Dana A Busam
Karen Y Beeson
Tina C McIntosh
Karin A Remington
Josep F Abril
John Gill
Jon Borman
Yu-Hui Rogers
Marvin E Frazier
Stephen W Scherer
Robert L Strausberg
J Craig Venter
author_sort Samuel Levy
title The diploid genome sequence of an individual human.
title_short The diploid genome sequence of an individual human.
title_full The diploid genome sequence of an individual human.
title_fullStr The diploid genome sequence of an individual human.
title_full_unstemmed The diploid genome sequence of an individual human.
title_sort diploid genome sequence of an individual human.
publisher Public Library of Science (PLoS)
series PLoS Biology
issn 1544-9173
1545-7885
publishDate 2007-09-01
description Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2-206 bp), 292,102 heterozygous insertion/deletion events (indels)(1-571 bp), 559,473 homozygous indels (1-82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.
url https://doi.org/10.1371/journal.pbio.0050254
work_keys_str_mv AT samuellevy thediploidgenomesequenceofanindividualhuman
AT grangersutton thediploidgenomesequenceofanindividualhuman
AT paulinecng thediploidgenomesequenceofanindividualhuman
AT larsfeuk thediploidgenomesequenceofanindividualhuman
AT aaronlhalpern thediploidgenomesequenceofanindividualhuman
AT brianpwalenz thediploidgenomesequenceofanindividualhuman
AT nelsonaxelrod thediploidgenomesequenceofanindividualhuman
AT jiaqihuang thediploidgenomesequenceofanindividualhuman
AT ewenfkirkness thediploidgenomesequenceofanindividualhuman
AT gennadydenisov thediploidgenomesequenceofanindividualhuman
AT yuanlin thediploidgenomesequenceofanindividualhuman
AT jeffreyrmacdonald thediploidgenomesequenceofanindividualhuman
AT andywingchunpang thediploidgenomesequenceofanindividualhuman
AT maryshago thediploidgenomesequenceofanindividualhuman
AT timothybstockwell thediploidgenomesequenceofanindividualhuman
AT alexiatsiamouri thediploidgenomesequenceofanindividualhuman
AT vineetbafna thediploidgenomesequenceofanindividualhuman
AT vikasbansal thediploidgenomesequenceofanindividualhuman
AT saulakravitz thediploidgenomesequenceofanindividualhuman
AT danaabusam thediploidgenomesequenceofanindividualhuman
AT karenybeeson thediploidgenomesequenceofanindividualhuman
AT tinacmcintosh thediploidgenomesequenceofanindividualhuman
AT karinaremington thediploidgenomesequenceofanindividualhuman
AT josepfabril thediploidgenomesequenceofanindividualhuman
AT johngill thediploidgenomesequenceofanindividualhuman
AT jonborman thediploidgenomesequenceofanindividualhuman
AT yuhuirogers thediploidgenomesequenceofanindividualhuman
AT marvinefrazier thediploidgenomesequenceofanindividualhuman
AT stephenwscherer thediploidgenomesequenceofanindividualhuman
AT robertlstrausberg thediploidgenomesequenceofanindividualhuman
AT jcraigventer thediploidgenomesequenceofanindividualhuman
AT samuellevy diploidgenomesequenceofanindividualhuman
AT grangersutton diploidgenomesequenceofanindividualhuman
AT paulinecng diploidgenomesequenceofanindividualhuman
AT larsfeuk diploidgenomesequenceofanindividualhuman
AT aaronlhalpern diploidgenomesequenceofanindividualhuman
AT brianpwalenz diploidgenomesequenceofanindividualhuman
AT nelsonaxelrod diploidgenomesequenceofanindividualhuman
AT jiaqihuang diploidgenomesequenceofanindividualhuman
AT ewenfkirkness diploidgenomesequenceofanindividualhuman
AT gennadydenisov diploidgenomesequenceofanindividualhuman
AT yuanlin diploidgenomesequenceofanindividualhuman
AT jeffreyrmacdonald diploidgenomesequenceofanindividualhuman
AT andywingchunpang diploidgenomesequenceofanindividualhuman
AT maryshago diploidgenomesequenceofanindividualhuman
AT timothybstockwell diploidgenomesequenceofanindividualhuman
AT alexiatsiamouri diploidgenomesequenceofanindividualhuman
AT vineetbafna diploidgenomesequenceofanindividualhuman
AT vikasbansal diploidgenomesequenceofanindividualhuman
AT saulakravitz diploidgenomesequenceofanindividualhuman
AT danaabusam diploidgenomesequenceofanindividualhuman
AT karenybeeson diploidgenomesequenceofanindividualhuman
AT tinacmcintosh diploidgenomesequenceofanindividualhuman
AT karinaremington diploidgenomesequenceofanindividualhuman
AT josepfabril diploidgenomesequenceofanindividualhuman
AT johngill diploidgenomesequenceofanindividualhuman
AT jonborman diploidgenomesequenceofanindividualhuman
AT yuhuirogers diploidgenomesequenceofanindividualhuman
AT marvinefrazier diploidgenomesequenceofanindividualhuman
AT stephenwscherer diploidgenomesequenceofanindividualhuman
AT robertlstrausberg diploidgenomesequenceofanindividualhuman
AT jcraigventer diploidgenomesequenceofanindividualhuman
_version_ 1721321911758094336