A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching
碩士 === 國立臺灣海洋大學 === 電機工程學系 === 93 === Since the inception of Human Genome Project back in 1990s researchers in multiple disciplines have been seeking for highly efficient sequence alignment algorithms. Because a genome sequence can be regarded as a text string, many string matching algorithms can in...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2005
|
Online Access: | http://ndltd.ncl.edu.tw/handle/65851832614229881825 |
id |
ndltd-TW-093NTOU5442059 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093NTOU54420592016-06-01T04:25:07Z http://ndltd.ncl.edu.tw/handle/65851832614229881825 A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching 基於四元樹結構比對的基因字串比對演算法 Chih-Yuan Chen 陳志遠 碩士 國立臺灣海洋大學 電機工程學系 93 Since the inception of Human Genome Project back in 1990s researchers in multiple disciplines have been seeking for highly efficient sequence alignment algorithms. Because a genome sequence can be regarded as a text string, many string matching algorithms can indeed be use for the genome sequencing tasks. However, many well-known, traditional string matching algorithms such as the Boyer More and the Knuth-Morris-Pratt (KMP), when used to match the genome sequences are often plagued by inefficiency, computational complexity, and consumption of huge memory space. Therefore, algorithms tailored for genomic sequencing have attracted a new generation of algorithm designers. In this thesis, we focus our attention on improving the existing algorithms for genome sequence matching. We have successfully reduced the huge memory consumption of the suffix tree method. Also, by introducing a new quadtree pattern matching we are able to avoid the highly complicated matching process of the ACGT-words tree algorithm. Last but not least, our method performs multi-pattern searching, which is beyond the capability of the algorithms we set out to improve. Show-Wei Leu 呂紹偉 2005 學位論文 ; thesis 51 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣海洋大學 === 電機工程學系 === 93 === Since the inception of Human Genome Project back in 1990s researchers in multiple disciplines have been seeking for highly efficient sequence alignment algorithms. Because a genome sequence can be regarded as a text string, many string matching algorithms can indeed be use for the genome sequencing tasks. However, many well-known, traditional string matching algorithms such as the Boyer More and the Knuth-Morris-Pratt (KMP), when used to match the genome sequences are often plagued by inefficiency, computational complexity, and consumption of huge memory space. Therefore, algorithms tailored for genomic sequencing have attracted a new generation of algorithm designers.
In this thesis, we focus our attention on improving the existing algorithms for genome sequence matching. We have successfully reduced the huge memory consumption of the suffix tree method. Also, by introducing a new quadtree pattern matching we are able to avoid the highly complicated matching process of the ACGT-words tree algorithm. Last but not least, our method performs multi-pattern searching, which is beyond the capability of the algorithms we set out to improve.
|
author2 |
Show-Wei Leu |
author_facet |
Show-Wei Leu Chih-Yuan Chen 陳志遠 |
author |
Chih-Yuan Chen 陳志遠 |
spellingShingle |
Chih-Yuan Chen 陳志遠 A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching |
author_sort |
Chih-Yuan Chen |
title |
A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching |
title_short |
A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching |
title_full |
A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching |
title_fullStr |
A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching |
title_full_unstemmed |
A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching |
title_sort |
genomic sequence matching algorithm with edge-base quadtree pattern matching |
publishDate |
2005 |
url |
http://ndltd.ncl.edu.tw/handle/65851832614229881825 |
work_keys_str_mv |
AT chihyuanchen agenomicsequencematchingalgorithmwithedgebasequadtreepatternmatching AT chénzhìyuǎn agenomicsequencematchingalgorithmwithedgebasequadtreepatternmatching AT chihyuanchen jīyúsìyuánshùjiégòubǐduìdejīyīnzìchuànbǐduìyǎnsuànfǎ AT chénzhìyuǎn jīyúsìyuánshùjiégòubǐduìdejīyīnzìchuànbǐduìyǎnsuànfǎ AT chihyuanchen genomicsequencematchingalgorithmwithedgebasequadtreepatternmatching AT chénzhìyuǎn genomicsequencematchingalgorithmwithedgebasequadtreepatternmatching |
_version_ |
1718291253198585856 |