A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching

碩士 === 國立臺灣海洋大學 === 電機工程學系 === 93 === Since the inception of Human Genome Project back in 1990s researchers in multiple disciplines have been seeking for highly efficient sequence alignment algorithms. Because a genome sequence can be regarded as a text string, many string matching algorithms can in...

Full description

Bibliographic Details
Main Authors: Chih-Yuan Chen, 陳志遠
Other Authors: Show-Wei Leu
Format: Others
Language:zh-TW
Published: 2005
Online Access:http://ndltd.ncl.edu.tw/handle/65851832614229881825
id ndltd-TW-093NTOU5442059
record_format oai_dc
spelling ndltd-TW-093NTOU54420592016-06-01T04:25:07Z http://ndltd.ncl.edu.tw/handle/65851832614229881825 A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching 基於四元樹結構比對的基因字串比對演算法 Chih-Yuan Chen 陳志遠 碩士 國立臺灣海洋大學 電機工程學系 93 Since the inception of Human Genome Project back in 1990s researchers in multiple disciplines have been seeking for highly efficient sequence alignment algorithms. Because a genome sequence can be regarded as a text string, many string matching algorithms can indeed be use for the genome sequencing tasks. However, many well-known, traditional string matching algorithms such as the Boyer More and the Knuth-Morris-Pratt (KMP), when used to match the genome sequences are often plagued by inefficiency, computational complexity, and consumption of huge memory space. Therefore, algorithms tailored for genomic sequencing have attracted a new generation of algorithm designers. In this thesis, we focus our attention on improving the existing algorithms for genome sequence matching. We have successfully reduced the huge memory consumption of the suffix tree method. Also, by introducing a new quadtree pattern matching we are able to avoid the highly complicated matching process of the ACGT-words tree algorithm. Last but not least, our method performs multi-pattern searching, which is beyond the capability of the algorithms we set out to improve. Show-Wei Leu 呂紹偉 2005 學位論文 ; thesis 51 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣海洋大學 === 電機工程學系 === 93 === Since the inception of Human Genome Project back in 1990s researchers in multiple disciplines have been seeking for highly efficient sequence alignment algorithms. Because a genome sequence can be regarded as a text string, many string matching algorithms can indeed be use for the genome sequencing tasks. However, many well-known, traditional string matching algorithms such as the Boyer More and the Knuth-Morris-Pratt (KMP), when used to match the genome sequences are often plagued by inefficiency, computational complexity, and consumption of huge memory space. Therefore, algorithms tailored for genomic sequencing have attracted a new generation of algorithm designers. In this thesis, we focus our attention on improving the existing algorithms for genome sequence matching. We have successfully reduced the huge memory consumption of the suffix tree method. Also, by introducing a new quadtree pattern matching we are able to avoid the highly complicated matching process of the ACGT-words tree algorithm. Last but not least, our method performs multi-pattern searching, which is beyond the capability of the algorithms we set out to improve.
author2 Show-Wei Leu
author_facet Show-Wei Leu
Chih-Yuan Chen
陳志遠
author Chih-Yuan Chen
陳志遠
spellingShingle Chih-Yuan Chen
陳志遠
A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching
author_sort Chih-Yuan Chen
title A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching
title_short A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching
title_full A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching
title_fullStr A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching
title_full_unstemmed A Genomic Sequence Matching Algorithm with Edge-base Quadtree Pattern Matching
title_sort genomic sequence matching algorithm with edge-base quadtree pattern matching
publishDate 2005
url http://ndltd.ncl.edu.tw/handle/65851832614229881825
work_keys_str_mv AT chihyuanchen agenomicsequencematchingalgorithmwithedgebasequadtreepatternmatching
AT chénzhìyuǎn agenomicsequencematchingalgorithmwithedgebasequadtreepatternmatching
AT chihyuanchen jīyúsìyuánshùjiégòubǐduìdejīyīnzìchuànbǐduìyǎnsuànfǎ
AT chénzhìyuǎn jīyúsìyuánshùjiégòubǐduìdejīyīnzìchuànbǐduìyǎnsuànfǎ
AT chihyuanchen genomicsequencematchingalgorithmwithedgebasequadtreepatternmatching
AT chénzhìyuǎn genomicsequencematchingalgorithmwithedgebasequadtreepatternmatching
_version_ 1718291253198585856