Journal of Universal Computer Science (Jan 2020)
Detecting Epidemic Diseases Using Sentiment Analysis of Arabic Tweets
Abstract
Read online Read online Read online
Opinion mining is an important step towards facilitating information in health data. Several studies have demonstrated the possibility of tracking diseases using public tweets. However, most studies were applied to English language tweets. Influenza is currently one of the world's greatest infectious disease challenges. In this study, a new approach is proposed in order to detect Influenza using machine learning techniques from Arabic tweets in Arab countries. This paper is the first study of epidemic diseases based on Arabic language tweets. In this work, we have collected, labeled, filtered and analyzed the influenza-related tweets written in the Arabic language. Several classifiers were used to measure the quality and the performance of the approach, which are: Naive Bayes, Support Vector Machines, Decision Trees, and K-Nearest Neighbor. The classifiers which achieved the best accuracy results for the three experiments were: Naïve Bayes with 89.06%, and K-Nearest Neighbor with 86.43%, respectively.
Keywords