Pipeline Architecture for Segmented Pattern Matching

碩士 === 國立成功大學 === 資訊工程學系 === 102 === In recent years, due to the rapid growth of network traffic, the demand of deep packet inspection (DPI) system to ensure the network security is becoming more and more important. The DPI systems rely on pattern matching to detect the payload of the packet, in ord...

Full description

Bibliographic Details
Main Authors: Chia-YiChu, 朱家毅
Other Authors: Yeim-Kuau Chang
Format: Others
Language:en_US
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/81190732293244439418
id ndltd-TW-102NCKU5392045
record_format oai_dc
spelling ndltd-TW-102NCKU53920452016-03-07T04:11:05Z http://ndltd.ncl.edu.tw/handle/81190732293244439418 Pipeline Architecture for Segmented Pattern Matching 以切割字串為基礎之字串比對的管線化架構 Chia-YiChu 朱家毅 碩士 國立成功大學 資訊工程學系 102 In recent years, due to the rapid growth of network traffic, the demand of deep packet inspection (DPI) system to ensure the network security is becoming more and more important. The DPI systems rely on pattern matching to detect the payload of the packet, in order to find possible threats in the packet. Therefore, the performance of pattern matching is the key point of the DPI systems. We have to figure out a solution that provides stable throughput and has low memory requirement to improve the performance of pattern matching. In order to let the well-known Aho-Corasick (AC) algorithm work on the pipeline architecture, we segment the patterns into pattern segments for decreasing the number of threads that tracks the input stream. In this thesis, we propose a technique to find out all the common subpatterns in the pattern set and use them to divide the patterns. After dividing the patterns, we use the pattern segments to build up the AC-DFA that eliminates all the failure transitions that are not needed in the pipeline architecture. In addition, we have to combine the subpatterns into their original pattern. Here, a tree-like structure is used for AC-DFA without failure transitions to find the original patterns. Finally, we use the transition tables of these two structures to construct the pipeline architecture. Our implementation result shows that on Xilinx Virtex-7 XC7V2000T, when group threshold K = 5, which handles 58k characters, we utilize 5.9% of Block RAM, 0.2% of Slice Registers, and 16% of Slice LUTs. The memory cost of our scheme is lower than Split AC and Bit-split by 29% and 75%. Yeim-Kuau Chang 張燕光 2014 學位論文 ; thesis 49 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 資訊工程學系 === 102 === In recent years, due to the rapid growth of network traffic, the demand of deep packet inspection (DPI) system to ensure the network security is becoming more and more important. The DPI systems rely on pattern matching to detect the payload of the packet, in order to find possible threats in the packet. Therefore, the performance of pattern matching is the key point of the DPI systems. We have to figure out a solution that provides stable throughput and has low memory requirement to improve the performance of pattern matching. In order to let the well-known Aho-Corasick (AC) algorithm work on the pipeline architecture, we segment the patterns into pattern segments for decreasing the number of threads that tracks the input stream. In this thesis, we propose a technique to find out all the common subpatterns in the pattern set and use them to divide the patterns. After dividing the patterns, we use the pattern segments to build up the AC-DFA that eliminates all the failure transitions that are not needed in the pipeline architecture. In addition, we have to combine the subpatterns into their original pattern. Here, a tree-like structure is used for AC-DFA without failure transitions to find the original patterns. Finally, we use the transition tables of these two structures to construct the pipeline architecture. Our implementation result shows that on Xilinx Virtex-7 XC7V2000T, when group threshold K = 5, which handles 58k characters, we utilize 5.9% of Block RAM, 0.2% of Slice Registers, and 16% of Slice LUTs. The memory cost of our scheme is lower than Split AC and Bit-split by 29% and 75%.
author2 Yeim-Kuau Chang
author_facet Yeim-Kuau Chang
Chia-YiChu
朱家毅
author Chia-YiChu
朱家毅
spellingShingle Chia-YiChu
朱家毅
Pipeline Architecture for Segmented Pattern Matching
author_sort Chia-YiChu
title Pipeline Architecture for Segmented Pattern Matching
title_short Pipeline Architecture for Segmented Pattern Matching
title_full Pipeline Architecture for Segmented Pattern Matching
title_fullStr Pipeline Architecture for Segmented Pattern Matching
title_full_unstemmed Pipeline Architecture for Segmented Pattern Matching
title_sort pipeline architecture for segmented pattern matching
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/81190732293244439418
work_keys_str_mv AT chiayichu pipelinearchitectureforsegmentedpatternmatching
AT zhūjiāyì pipelinearchitectureforsegmentedpatternmatching
AT chiayichu yǐqiègēzìchuànwèijīchǔzhīzìchuànbǐduìdeguǎnxiànhuàjiàgòu
AT zhūjiāyì yǐqiègēzìchuànwèijīchǔzhīzìchuànbǐduìdeguǎnxiànhuàjiàgòu
_version_ 1718199536631939072