Text Categorization with Latent Dirichlet Allocation
This paper focuses on the text categorization of Slovak text corpora using latent Dirichlet allocation. Our goal is to build text subcorpora that contain similar text documents. We want to use these better organized text subcorpora to build more robust language models that can be used in the area of...
Main Authors: | ZLACKÝ Daniel, STAŠ Ján, JUHÁR Jozef, CIŽMÁR Anton |
---|---|
Format: | Article |
Language: | English |
Published: |
Editura Universităţii din Oradea
2014-05-01
|
Series: | Journal of Electrical and Electronics Engineering |
Subjects: | |
Online Access: | http://electroinf.uoradea.ro/images/articles/CERCETARE/Reviste/JEEE/JEEE_V7_N1_MAY_2014/Zlacky_may2014.pdf |
Similar Items
-
Categorization of Unorganized Text Corpora for better Domain-Specific Language Modeling
by: Jan Stas, et al.
Published: (2013-01-01) -
Comparing Latent Dirichlet Allocation and Latent Semantic Analysis as Classifiers
by: Anaya, Leticia H.
Published: (2011) -
Latent Dirichlet Allocation in R
by: Ponweiser, Martin
Published: (2012) -
Tag recommendation using Latent Dirichlet Allocation.
by: Choubey, Rahul
Published: (2011) -
Topic modeling using latent dirichlet allocation on disaster tweets
by: Patel, Virashree Hrushikesh
Published: (2018)