Combining available datasets for building named entity recognition models of Croatian and Slovene

The paper presents efforts in developing freely available models for named entity recognition and classification in Croatian and Slovene text. Our experiments focus on the most informative set of linguistic features taking into account the availability of language tools and resources for the languag...

Full description

Bibliographic Details
Main Authors: Nikola Ljubešić, Marija Stupar, Tereza Jurić, Željko Agić
Format: Article
Language:English
Published: Znanstvena založba Filozofske fakultete Univerze v Ljubljani (Ljubljana University Press, Faculty of Arts) 2013-12-01
Series:Slovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave
Subjects:
Online Access:http://www.trojina.org/slovenscina2.0/arhiv/2013/2/Slo2.0_2013_2_03.pdf