Comparative Scaffolding and Gap Closure Using Next Generation Sequencing

碩士 === 國立中正大學 === 資訊工程研究所 === 99 === Next Generation Sequencing (NGS) technologies have been widely used to assemble the genomes of unstudied species in the biosphere. In practice, the assembled genomes are very fragmented due to the complexity of the genome and relatively short length of reads. In...

Full description

Bibliographic Details
Main Authors: Wang, Hsiaowen, 王筱文
Other Authors: Huang, Yaoting
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/34573992629587433153
id ndltd-TW-099CCU00392140
record_format oai_dc
spelling ndltd-TW-099CCU003921402015-10-23T06:50:18Z http://ndltd.ncl.edu.tw/handle/34573992629587433153 Comparative Scaffolding and Gap Closure Using Next Generation Sequencing 以次世代定序平台進行比較式連續序列之串接與重組缺口之填補 Wang, Hsiaowen 王筱文 碩士 國立中正大學 資訊工程研究所 99 Next Generation Sequencing (NGS) technologies have been widely used to assemble the genomes of unstudied species in the biosphere. In practice, the assembled genomes are very fragmented due to the complexity of the genome and relatively short length of reads. In this thesis, we design and implement a comparative Scaffolding And Gap closure Assembler (called SAGA). By comparatively analyzing a related genome, SAGA carefully refines the boundary of each contig mapped on the related genome and links the contigs with no rearrangement events into scaffolds. For each gap within the scaffold, SAGA further used a jumping assembly approach to assemble isolated islands of reads in the gap, which overcomes the limitations of assembling low or no coverage regions. SAGA has been tested and compared with other methods using a variety of simulated and real data sets. The experimental results indicated that SAGA significantly produced a more contiguous genome with larger N50 compared with other programs. It is worth mentioning that SAGA is able to assist scaffolding the assembly using related genome with similarity as low as 80%. Compared with physical or optical map approaches, SAGA is very cost-effective toward the genome finishing. Huang, Yaoting 黃耀廷 2011 學位論文 ; thesis 34 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立中正大學 === 資訊工程研究所 === 99 === Next Generation Sequencing (NGS) technologies have been widely used to assemble the genomes of unstudied species in the biosphere. In practice, the assembled genomes are very fragmented due to the complexity of the genome and relatively short length of reads. In this thesis, we design and implement a comparative Scaffolding And Gap closure Assembler (called SAGA). By comparatively analyzing a related genome, SAGA carefully refines the boundary of each contig mapped on the related genome and links the contigs with no rearrangement events into scaffolds. For each gap within the scaffold, SAGA further used a jumping assembly approach to assemble isolated islands of reads in the gap, which overcomes the limitations of assembling low or no coverage regions. SAGA has been tested and compared with other methods using a variety of simulated and real data sets. The experimental results indicated that SAGA significantly produced a more contiguous genome with larger N50 compared with other programs. It is worth mentioning that SAGA is able to assist scaffolding the assembly using related genome with similarity as low as 80%. Compared with physical or optical map approaches, SAGA is very cost-effective toward the genome finishing.
author2 Huang, Yaoting
author_facet Huang, Yaoting
Wang, Hsiaowen
王筱文
author Wang, Hsiaowen
王筱文
spellingShingle Wang, Hsiaowen
王筱文
Comparative Scaffolding and Gap Closure Using Next Generation Sequencing
author_sort Wang, Hsiaowen
title Comparative Scaffolding and Gap Closure Using Next Generation Sequencing
title_short Comparative Scaffolding and Gap Closure Using Next Generation Sequencing
title_full Comparative Scaffolding and Gap Closure Using Next Generation Sequencing
title_fullStr Comparative Scaffolding and Gap Closure Using Next Generation Sequencing
title_full_unstemmed Comparative Scaffolding and Gap Closure Using Next Generation Sequencing
title_sort comparative scaffolding and gap closure using next generation sequencing
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/34573992629587433153
work_keys_str_mv AT wanghsiaowen comparativescaffoldingandgapclosureusingnextgenerationsequencing
AT wángxiǎowén comparativescaffoldingandgapclosureusingnextgenerationsequencing
AT wanghsiaowen yǐcìshìdàidìngxùpíngtáijìnxíngbǐjiàoshìliánxùxùlièzhīchuànjiēyǔzhòngzǔquēkǒuzhītiánbǔ
AT wángxiǎowén yǐcìshìdàidìngxùpíngtáijìnxíngbǐjiàoshìliánxùxùlièzhīchuànjiēyǔzhòngzǔquēkǒuzhītiánbǔ
_version_ 1718109743630778368