A Chinese text classification system based on Naive Bayes algorithm
In this paper, aiming at the characteristics of Chinese text classification, using the ICTCLAS(Chinese lexical analysis system of Chinese academy of sciences) for document segmentation, and for data cleaning and filtering the Stop words, using the information gain and document frequency feature sele...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2016-01-01
|
Series: | MATEC Web of Conferences |
Subjects: | |
Online Access: | http://dx.doi.org/10.1051/matecconf/20164401015 |