A Chinese spam detector using text and image features

碩士 === 中華大學 === 資訊工程學系(所) === 95 === With the internet growing popular rapidly, the issue of e-mail spam becomes more and more important. In this thesis, we proposed some methods for spam mail detection by content analyzing. We take some important features for email classification. First, we take th...

Full description

Bibliographic Details
Main Author: 楊僑友
Other Authors: 韓欽銓
Format: Others
Language:zh-TW
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/56500321880459820137
Description
Summary:碩士 === 中華大學 === 資訊工程學系(所) === 95 === With the internet growing popular rapidly, the issue of e-mail spam becomes more and more important. In this thesis, we proposed some methods for spam mail detection by content analyzing. We take some important features for email classification. First, we take the keywords for the feature of text content of emails. With a new method for keyword selecting and nearest neighbor classifier training, we proposed a nearest neighbors classifier with evolutionary algorithm. Be due to keyword selecting, it not only can improve the accuracy of email classification via text-analyzing, but also reduces the number of features and references. For the new type of spam-image spam, we proposed a new method for image analyzing. By taking some special features of image, the spam image classifier can get high accuracy of 82%. We expect the two ways for spam mail will be able to improve the efficiency of spam mail detection.