Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail

碩士 === 華梵大學 === 資訊管理學系碩士班 === 97 === The network has been established and developed since 1970; people have generally used the network. People artificially delivered mail before, but this tendency was transferred to E-mail. The time and distance of communication were decreased by E-mail, and E-mail...

Full description

Bibliographic Details
Main Authors: I-Ping Shu, 徐一平
Other Authors: Zne-Jung Lee
Format: Others
Language:zh-TW
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/00992363360301001047
id ndltd-TW-097HCHT0396036
record_format oai_dc
spelling ndltd-TW-097HCHT03960362016-05-06T04:11:31Z http://ndltd.ncl.edu.tw/handle/00992363360301001047 Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail 混合式資料探勘技術應用於垃圾郵件過濾之研究 I-Ping Shu 徐一平 碩士 華梵大學 資訊管理學系碩士班 97 The network has been established and developed since 1970; people have generally used the network. People artificially delivered mail before, but this tendency was transferred to E-mail. The time and distance of communication were decreased by E-mail, and E-mail gradually changed our live and working way. At this moment, some beneficial people use the malicious programs or collect the email boxes in many ways, then send email arbitrarily. It has been perplexed to the receiver. This study (GA/DT) adopts the genetic algorithm (Genetic Algorithms, GA) and decision tree (Decision Tree, DT) of data mining techniques to select Minimum Case and Pruning CF parameters. Experiment results indicate that the accuracy of the hybrid GA/DT algorithm is 95.0%. In other algorithms, Logistic Algorithm has a better accuracy of 5.0% and ANN has an accuracy of 2.337 % and SVM has an accuracy of 4.0 %, and it shows that the GA/DT algorithm can accurately select the Minimum Case and Pruning CF parameters in the DT algorithm and effectively enhance the performance of identifying spam mails. Zne-Jung Lee 李仁鐘 2009 學位論文 ; thesis 82 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 華梵大學 === 資訊管理學系碩士班 === 97 === The network has been established and developed since 1970; people have generally used the network. People artificially delivered mail before, but this tendency was transferred to E-mail. The time and distance of communication were decreased by E-mail, and E-mail gradually changed our live and working way. At this moment, some beneficial people use the malicious programs or collect the email boxes in many ways, then send email arbitrarily. It has been perplexed to the receiver. This study (GA/DT) adopts the genetic algorithm (Genetic Algorithms, GA) and decision tree (Decision Tree, DT) of data mining techniques to select Minimum Case and Pruning CF parameters. Experiment results indicate that the accuracy of the hybrid GA/DT algorithm is 95.0%. In other algorithms, Logistic Algorithm has a better accuracy of 5.0% and ANN has an accuracy of 2.337 % and SVM has an accuracy of 4.0 %, and it shows that the GA/DT algorithm can accurately select the Minimum Case and Pruning CF parameters in the DT algorithm and effectively enhance the performance of identifying spam mails.
author2 Zne-Jung Lee
author_facet Zne-Jung Lee
I-Ping Shu
徐一平
author I-Ping Shu
徐一平
spellingShingle I-Ping Shu
徐一平
Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail
author_sort I-Ping Shu
title Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail
title_short Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail
title_full Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail
title_fullStr Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail
title_full_unstemmed Study of Hybrid Data Mining Techniques Applied for Filtering Spam Mail
title_sort study of hybrid data mining techniques applied for filtering spam mail
publishDate 2009
url http://ndltd.ncl.edu.tw/handle/00992363360301001047
work_keys_str_mv AT ipingshu studyofhybriddataminingtechniquesappliedforfilteringspammail
AT xúyīpíng studyofhybriddataminingtechniquesappliedforfilteringspammail
AT ipingshu hùnhéshìzīliàotànkānjìshùyīngyòngyúlājīyóujiànguòlǜzhīyánjiū
AT xúyīpíng hùnhéshìzīliàotànkānjìshùyīngyòngyúlājīyóujiànguòlǜzhīyánjiū
_version_ 1718261199689220096