Text Classification using String Kernels
We propose a novel approach for categorizing text documents based on the use of a special kernel. The kernel is an inner product in the feature space generated by all subsequences of length k. A subsequence is any ordered sequence of k characters occurring in the text though not necessarily contiguo...
Main Authors: | Lodhi, H. (Author), Saunders, C. (Author), Shawe-Taylor, J. (Author), Cristianini, N. (Author), Watkins, C. (Author) |
---|---|
Format: | Article |
Language: | English |
Published: |
2002.
|
Subjects: | |
Online Access: | Get fulltext |
Similar Items
-
Latent Semantic Kernels
by: Cristianini, N., et al.
Published: (2002) -
Kernel-Based Learning of Hierarchical Multilabel Classification Models
by: Rousu, J., et al.
Published: (2006) -
Text Clustering with String Kernels in R
by: Karatzoglou, Alexandros, et al.
Published: (2006) -
On the Eigenspectrum of the Gram matrix and the generalisation error of kernel PCA
by: Shawe-Taylor, John, et al.
Published: (2004) -
String to String Correction Kernelization
by: Watt, Nathaniel
Published: (2013)