The diploid genome sequence of an individual human.
Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage f...
Main Authors: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2007-09-01
|
Series: | PLoS Biology |
Online Access: | https://doi.org/10.1371/journal.pbio.0050254 |
id |
doaj-1ebc1b9d3b354a3c848c72f1b19274b6 |
---|---|
record_format |
Article |
spelling |
doaj-1ebc1b9d3b354a3c848c72f1b19274b62021-07-02T21:22:23ZengPublic Library of Science (PLoS)PLoS Biology1544-91731545-78852007-09-01510e25410.1371/journal.pbio.0050254The diploid genome sequence of an individual human.Samuel LevyGranger SuttonPauline C NgLars FeukAaron L HalpernBrian P WalenzNelson AxelrodJiaqi HuangEwen F KirknessGennady DenisovYuan LinJeffrey R MacDonaldAndy Wing Chun PangMary ShagoTimothy B StockwellAlexia TsiamouriVineet BafnaVikas BansalSaul A KravitzDana A BusamKaren Y BeesonTina C McIntoshKarin A RemingtonJosep F AbrilJohn GillJon BormanYu-Hui RogersMarvin E FrazierStephen W SchererRobert L StrausbergJ Craig VenterPresented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2-206 bp), 292,102 heterozygous insertion/deletion events (indels)(1-571 bp), 559,473 homozygous indels (1-82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.https://doi.org/10.1371/journal.pbio.0050254 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Samuel Levy Granger Sutton Pauline C Ng Lars Feuk Aaron L Halpern Brian P Walenz Nelson Axelrod Jiaqi Huang Ewen F Kirkness Gennady Denisov Yuan Lin Jeffrey R MacDonald Andy Wing Chun Pang Mary Shago Timothy B Stockwell Alexia Tsiamouri Vineet Bafna Vikas Bansal Saul A Kravitz Dana A Busam Karen Y Beeson Tina C McIntosh Karin A Remington Josep F Abril John Gill Jon Borman Yu-Hui Rogers Marvin E Frazier Stephen W Scherer Robert L Strausberg J Craig Venter |
spellingShingle |
Samuel Levy Granger Sutton Pauline C Ng Lars Feuk Aaron L Halpern Brian P Walenz Nelson Axelrod Jiaqi Huang Ewen F Kirkness Gennady Denisov Yuan Lin Jeffrey R MacDonald Andy Wing Chun Pang Mary Shago Timothy B Stockwell Alexia Tsiamouri Vineet Bafna Vikas Bansal Saul A Kravitz Dana A Busam Karen Y Beeson Tina C McIntosh Karin A Remington Josep F Abril John Gill Jon Borman Yu-Hui Rogers Marvin E Frazier Stephen W Scherer Robert L Strausberg J Craig Venter The diploid genome sequence of an individual human. PLoS Biology |
author_facet |
Samuel Levy Granger Sutton Pauline C Ng Lars Feuk Aaron L Halpern Brian P Walenz Nelson Axelrod Jiaqi Huang Ewen F Kirkness Gennady Denisov Yuan Lin Jeffrey R MacDonald Andy Wing Chun Pang Mary Shago Timothy B Stockwell Alexia Tsiamouri Vineet Bafna Vikas Bansal Saul A Kravitz Dana A Busam Karen Y Beeson Tina C McIntosh Karin A Remington Josep F Abril John Gill Jon Borman Yu-Hui Rogers Marvin E Frazier Stephen W Scherer Robert L Strausberg J Craig Venter |
author_sort |
Samuel Levy |
title |
The diploid genome sequence of an individual human. |
title_short |
The diploid genome sequence of an individual human. |
title_full |
The diploid genome sequence of an individual human. |
title_fullStr |
The diploid genome sequence of an individual human. |
title_full_unstemmed |
The diploid genome sequence of an individual human. |
title_sort |
diploid genome sequence of an individual human. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Biology |
issn |
1544-9173 1545-7885 |
publishDate |
2007-09-01 |
description |
Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2-206 bp), 292,102 heterozygous insertion/deletion events (indels)(1-571 bp), 559,473 homozygous indels (1-82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information. |
url |
https://doi.org/10.1371/journal.pbio.0050254 |
work_keys_str_mv |
AT samuellevy thediploidgenomesequenceofanindividualhuman AT grangersutton thediploidgenomesequenceofanindividualhuman AT paulinecng thediploidgenomesequenceofanindividualhuman AT larsfeuk thediploidgenomesequenceofanindividualhuman AT aaronlhalpern thediploidgenomesequenceofanindividualhuman AT brianpwalenz thediploidgenomesequenceofanindividualhuman AT nelsonaxelrod thediploidgenomesequenceofanindividualhuman AT jiaqihuang thediploidgenomesequenceofanindividualhuman AT ewenfkirkness thediploidgenomesequenceofanindividualhuman AT gennadydenisov thediploidgenomesequenceofanindividualhuman AT yuanlin thediploidgenomesequenceofanindividualhuman AT jeffreyrmacdonald thediploidgenomesequenceofanindividualhuman AT andywingchunpang thediploidgenomesequenceofanindividualhuman AT maryshago thediploidgenomesequenceofanindividualhuman AT timothybstockwell thediploidgenomesequenceofanindividualhuman AT alexiatsiamouri thediploidgenomesequenceofanindividualhuman AT vineetbafna thediploidgenomesequenceofanindividualhuman AT vikasbansal thediploidgenomesequenceofanindividualhuman AT saulakravitz thediploidgenomesequenceofanindividualhuman AT danaabusam thediploidgenomesequenceofanindividualhuman AT karenybeeson thediploidgenomesequenceofanindividualhuman AT tinacmcintosh thediploidgenomesequenceofanindividualhuman AT karinaremington thediploidgenomesequenceofanindividualhuman AT josepfabril thediploidgenomesequenceofanindividualhuman AT johngill thediploidgenomesequenceofanindividualhuman AT jonborman thediploidgenomesequenceofanindividualhuman AT yuhuirogers thediploidgenomesequenceofanindividualhuman AT marvinefrazier thediploidgenomesequenceofanindividualhuman AT stephenwscherer thediploidgenomesequenceofanindividualhuman AT robertlstrausberg thediploidgenomesequenceofanindividualhuman AT jcraigventer thediploidgenomesequenceofanindividualhuman AT samuellevy diploidgenomesequenceofanindividualhuman AT grangersutton diploidgenomesequenceofanindividualhuman AT paulinecng diploidgenomesequenceofanindividualhuman AT larsfeuk diploidgenomesequenceofanindividualhuman AT aaronlhalpern diploidgenomesequenceofanindividualhuman AT brianpwalenz diploidgenomesequenceofanindividualhuman AT nelsonaxelrod diploidgenomesequenceofanindividualhuman AT jiaqihuang diploidgenomesequenceofanindividualhuman AT ewenfkirkness diploidgenomesequenceofanindividualhuman AT gennadydenisov diploidgenomesequenceofanindividualhuman AT yuanlin diploidgenomesequenceofanindividualhuman AT jeffreyrmacdonald diploidgenomesequenceofanindividualhuman AT andywingchunpang diploidgenomesequenceofanindividualhuman AT maryshago diploidgenomesequenceofanindividualhuman AT timothybstockwell diploidgenomesequenceofanindividualhuman AT alexiatsiamouri diploidgenomesequenceofanindividualhuman AT vineetbafna diploidgenomesequenceofanindividualhuman AT vikasbansal diploidgenomesequenceofanindividualhuman AT saulakravitz diploidgenomesequenceofanindividualhuman AT danaabusam diploidgenomesequenceofanindividualhuman AT karenybeeson diploidgenomesequenceofanindividualhuman AT tinacmcintosh diploidgenomesequenceofanindividualhuman AT karinaremington diploidgenomesequenceofanindividualhuman AT josepfabril diploidgenomesequenceofanindividualhuman AT johngill diploidgenomesequenceofanindividualhuman AT jonborman diploidgenomesequenceofanindividualhuman AT yuhuirogers diploidgenomesequenceofanindividualhuman AT marvinefrazier diploidgenomesequenceofanindividualhuman AT stephenwscherer diploidgenomesequenceofanindividualhuman AT robertlstrausberg diploidgenomesequenceofanindividualhuman AT jcraigventer diploidgenomesequenceofanindividualhuman |
_version_ |
1721321911758094336 |