Vector representation based on a supervised codebook for Nepali documents classification

Document representation with outlier tokens exacerbates the classification performance due to the uncertain orientation of such tokens. Most existing document representation methods in different languages including Nepali mostly ignore the strategies to filter them out from documents before learning...

Full description

Bibliographic Details
Main Authors: Chiranjibi Sitaula, Anish Basnet, Sunil Aryal
Format: Article
Language:English
Published: PeerJ Inc. 2021-03-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-412.pdf