EAI Endorsed Transactions on Scalable Information Systems (Apr 2018)
Big Data and Named Entity Recognition Approaches for Urdu Language
Abstract
Nowadays data is stored in digital form and Terabyte of data is generated on daily basis. It is difficult task to extract useful information from Big data efficiently. From unstructured text Information extraction is a technique which used to extract information. Named Entity Recognition (NER) is an essential component of information extraction in the field of Natural Language Processing (NLP). Further, Urdu language has various challenges to NER due to its agglutinative, inflectional nature and rich morphology. Therefore, NER systems for Urdu language are not mature yet due to lack of resources and ambiguities. This paper specifically addresses the different approaches to NER and explore the existing work for NER in Urdu language.
Keywords