Journal of Applied Computer Science & Mathematics (Apr 2016)

Framework for Urdu News Headlines Classification

  • Kashif AHMED,
  • Mubashir ALI,
  • Shehzad KHALID,
  • Muhammad KAMRAN

DOI
https://doi.org/10.4316/JACSM.201601002
Journal volume & issue
Vol. 10, no. 1
pp. 17 – 21

Abstract

Read online

Automatic text classification has great significance in the field of text mining and plays a pivotal role in areas such as spam filtering, news classification, noise reduction etc. It is evident from the literature that there is ample of research conducted for classifying text documents e.g. English news classification, Persian text classification etc. but there is no copious amount of work related to short Urdu text or Urdu news headlines classification. Therefore, after examining various existing news classification methodologies we propose an SVM based framework in this paper for classification of Urdu news headlines. This approach classifies Urdu news based on headlines in their respective pre-defined categories by utilizing their feature vector’s maximum indexes. This proposed system is compared with existing state-of-the art techniques.

Keywords