Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail
碩士 === 華梵大學 === 資訊管理學系碩士班 === 97 === The network has been established and developed since 1970; people have generally used the network. People artificially delivered mail before, but this tendency was transferred to E-mail. The time and distance of communication were decreased by E-mail, and E-mail...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2009
|
Online Access: | http://ndltd.ncl.edu.tw/handle/00992363360301001047 |
id |
ndltd-TW-097HCHT0396036 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-097HCHT03960362016-05-06T04:11:31Z http://ndltd.ncl.edu.tw/handle/00992363360301001047 Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail 混合式資料探勘技術應用於垃圾郵件過濾之研究 I-Ping Shu 徐一平 碩士 華梵大學 資訊管理學系碩士班 97 The network has been established and developed since 1970; people have generally used the network. People artificially delivered mail before, but this tendency was transferred to E-mail. The time and distance of communication were decreased by E-mail, and E-mail gradually changed our live and working way. At this moment, some beneficial people use the malicious programs or collect the email boxes in many ways, then send email arbitrarily. It has been perplexed to the receiver. This study (GA/DT) adopts the genetic algorithm (Genetic Algorithms, GA) and decision tree (Decision Tree, DT) of data mining techniques to select Minimum Case and Pruning CF parameters. Experiment results indicate that the accuracy of the hybrid GA/DT algorithm is 95.0%. In other algorithms, Logistic Algorithm has a better accuracy of 5.0% and ANN has an accuracy of 2.337 % and SVM has an accuracy of 4.0 %, and it shows that the GA/DT algorithm can accurately select the Minimum Case and Pruning CF parameters in the DT algorithm and effectively enhance the performance of identifying spam mails. Zne-Jung Lee 李仁鐘 2009 學位論文 ; thesis 82 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 華梵大學 === 資訊管理學系碩士班 === 97 === The network has been established and developed since 1970; people have generally used the network. People artificially delivered mail before, but this tendency was transferred to E-mail. The time and distance of communication were decreased by E-mail, and E-mail gradually changed our live and working way. At this moment, some beneficial people use the malicious programs or collect the email boxes in many ways, then send email arbitrarily. It has been perplexed to the receiver. This study (GA/DT) adopts the genetic algorithm (Genetic Algorithms, GA) and decision tree (Decision Tree, DT) of data mining techniques to select Minimum Case and Pruning CF parameters. Experiment results indicate that the accuracy of the hybrid GA/DT algorithm is 95.0%. In other algorithms, Logistic Algorithm has a better accuracy of 5.0% and ANN has an accuracy of 2.337 % and SVM has an accuracy of 4.0 %, and it shows that the GA/DT algorithm can accurately select the Minimum Case and Pruning CF parameters in the DT algorithm and effectively enhance the performance of identifying spam mails.
|
author2 |
Zne-Jung Lee |
author_facet |
Zne-Jung Lee I-Ping Shu 徐一平 |
author |
I-Ping Shu 徐一平 |
spellingShingle |
I-Ping Shu 徐一平 Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail |
author_sort |
I-Ping Shu |
title |
Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail |
title_short |
Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail |
title_full |
Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail |
title_fullStr |
Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail |
title_full_unstemmed |
Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail |
title_sort |
study of hybrid data mining techniques applied for filtering spam mail |
publishDate |
2009 |
url |
http://ndltd.ncl.edu.tw/handle/00992363360301001047 |
work_keys_str_mv |
AT ipingshu studyofhybriddataminingtechniquesappliedforfilteringspammail AT xúyīpíng studyofhybriddataminingtechniquesappliedforfilteringspammail AT ipingshu hùnhéshìzīliàotànkānjìshùyīngyòngyúlājīyóujiànguòlǜzhīyánjiū AT xúyīpíng hùnhéshìzīliàotànkānjìshùyīngyòngyúlājīyóujiànguòlǜzhīyánjiū |
_version_ |
1718261199689220096 |