A Hybrid Approach for NER System for Scarce Resourced Language-URDU: Integrating n-gram with Rules and Gazetteers

We present a hybrid NER (Name Entity Recognition) system for Urdu script by integration of n-gram model (unigram and bigram), rules and gazetteers. We used prefix and suffix characters for rule construction instead of first name and last name lists or potential terms on the output list that is produ...

Full description

Bibliographic Details
Main Authors: Saeeda Naz, Arif Iqbal Umar, Imran Razzak
Format: Article
Language:English
Published: Mehran University of Engineering and Technology 2015-10-01
Series:Mehran University Research Journal of Engineering and Technology
Subjects:
Online Access:http://publications.muet.edu.pk/research_papers/pdf/pdf1145.pdf