Text Classification using String Kernels

We propose a novel approach for categorizing text documents based on the use of a special kernel. The kernel is an inner product in the feature space generated by all subsequences of length k. A subsequence is any ordered sequence of k characters occurring in the text though not necessarily contiguo...

Full description

Bibliographic Details
Main Authors: Lodhi, H. (Author), Saunders, C. (Author), Shawe-Taylor, J. (Author), Cristianini, N. (Author), Watkins, C. (Author)
Format: Article
Language:English
Published: 2002.
Subjects:
Online Access:Get fulltext