A Chinese text classification system based on Naive Bayes algorithm

In this paper, aiming at the characteristics of Chinese text classification, using the ICTCLAS(Chinese lexical analysis system of Chinese academy of sciences) for document segmentation, and for data cleaning and filtering the Stop words, using the information gain and document frequency feature sele...

Full description

Bibliographic Details
Main Author: Cui Wei
Format: Article
Language:English
Published: EDP Sciences 2016-01-01
Series:MATEC Web of Conferences
Subjects:
Online Access:http://dx.doi.org/10.1051/matecconf/20164401015