High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs

碩士 === 國立臺灣師範大學 === 電機工程學系 === 105 === Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern...

Full description

Bibliographic Details
Main Authors: Huang, Chun-cheng, 黃俊程
Other Authors: Lin, Cheng-Hung
Format: Others
Language:zh-TW
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/5b596z
id ndltd-TW-105NTNU5442009
record_format oai_dc
spelling ndltd-TW-105NTNU54420092019-05-15T23:46:58Z http://ndltd.ncl.edu.tw/handle/5b596z High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs 實現於圖形處理器的高性能平行位置檢知近似字串比對演算法 Huang, Chun-cheng 黃俊程 碩士 國立臺灣師範大學 電機工程學系 105 Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insertion, deletion, and substitution operations. Among the proposed algorithms, the bit-parallel algorithms are considered to be the best and highly efficient algorithms. However, the traditional bit-parallel algorithms lacks the ability of identifying the start and end positions of a matched pattern. Furthermore, acceleration of the bit-parallel algorithms has become a crucial issue for processing big data nowadays. In this paper, we propose two kinds of parallel location-aware algorithms called data-segmented parallelism and high-degree parallelism as means to accelerate approximate string matching using graphic processing units. Experimental results show that the high-degree parallelism on GPUs achieves significant improvement in system and kernel throughputs compared to CPU counterparts. Compared to state-of-the-art approaches, the proposed high-degree parallelism achieves 11 to 105 times improvement. Finally, we develop a web service which allows users to perform approximate string matching on line and deliver all matched substrings with the start and end positions. Lin, Cheng-Hung 林政宏 2017 學位論文 ; thesis 44 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣師範大學 === 電機工程學系 === 105 === Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insertion, deletion, and substitution operations. Among the proposed algorithms, the bit-parallel algorithms are considered to be the best and highly efficient algorithms. However, the traditional bit-parallel algorithms lacks the ability of identifying the start and end positions of a matched pattern. Furthermore, acceleration of the bit-parallel algorithms has become a crucial issue for processing big data nowadays. In this paper, we propose two kinds of parallel location-aware algorithms called data-segmented parallelism and high-degree parallelism as means to accelerate approximate string matching using graphic processing units. Experimental results show that the high-degree parallelism on GPUs achieves significant improvement in system and kernel throughputs compared to CPU counterparts. Compared to state-of-the-art approaches, the proposed high-degree parallelism achieves 11 to 105 times improvement. Finally, we develop a web service which allows users to perform approximate string matching on line and deliver all matched substrings with the start and end positions.
author2 Lin, Cheng-Hung
author_facet Lin, Cheng-Hung
Huang, Chun-cheng
黃俊程
author Huang, Chun-cheng
黃俊程
spellingShingle Huang, Chun-cheng
黃俊程
High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs
author_sort Huang, Chun-cheng
title High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs
title_short High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs
title_full High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs
title_fullStr High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs
title_full_unstemmed High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs
title_sort high-performance parallel location-aware algorithms for approximate string matching on gpus
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/5b596z
work_keys_str_mv AT huangchuncheng highperformanceparallellocationawarealgorithmsforapproximatestringmatchingongpus
AT huángjùnchéng highperformanceparallellocationawarealgorithmsforapproximatestringmatchingongpus
AT huangchuncheng shíxiànyútúxíngchùlǐqìdegāoxìngnéngpíngxíngwèizhìjiǎnzhījìnshìzìchuànbǐduìyǎnsuànfǎ
AT huángjùnchéng shíxiànyútúxíngchùlǐqìdegāoxìngnéngpíngxíngwèizhìjiǎnzhījìnshìzìchuànbǐduìyǎnsuànfǎ
_version_ 1719153885059219456