Text Categorization with Latent Dirichlet Allocation

This paper focuses on the text categorization of Slovak text corpora using latent Dirichlet allocation. Our goal is to build text subcorpora that contain similar text documents. We want to use these better organized text subcorpora to build more robust language models that can be used in the area of...

Full description

Bibliographic Details
Main Authors: ZLACKÝ Daniel, STAŠ Ján, JUHÁR Jozef, CIŽMÁR Anton
Format: Article
Language:English
Published: Editura Universităţii din Oradea 2014-05-01
Series:Journal of Electrical and Electronics Engineering
Subjects:
Online Access:http://electroinf.uoradea.ro/images/articles/CERCETARE/Reviste/JEEE/JEEE_V7_N1_MAY_2014/Zlacky_may2014.pdf