Eliminating Redundancies during Conversion of Paired-End Reads to Long Sequences

碩士 === 國立中正大學 === 資訊工程研究所 === 104 === Whole-genome shotgun sequences are now routinely generated by next generation sequencing technologies. However, a substantial amount of redundant sequences are included in the generated sequences. These redundant sequences greatly reduces the efficiency of subse...

Full description

Bibliographic Details
Main Authors: PAN, SHIH-HONG, 潘世洪
Other Authors: HUANG, YAO-TING
Format: Others
Language:zh-TW
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/95817813903901728969
Description
Summary:碩士 === 國立中正大學 === 資訊工程研究所 === 104 === Whole-genome shotgun sequences are now routinely generated by next generation sequencing technologies. However, a substantial amount of redundant sequences are included in the generated sequences. These redundant sequences greatly reduces the efficiency of subsequent genome assembly. This paper presents a method for removing redundant sequences during conversion of paired-end reads to long sequence via a hash table, which aims to boost the speed of genome assembly. The experiemntal results indicate that this new method indeed improves the computational efficiency of previous method using a number of real data sets. key word: NGS, genome, long sequence, hash table, redundancy, eliminate