Investigating the Impact of Utilizing the K-Nearest Neighbor and Levenshtein Distance Algorithms for Arabic Sentiment Analysis on Mobile Applications
الكلمات المفتاحية:
Arabic Sentiment Analysis، K-Nearest Neighbor، Levenshtein Distance، Google Playالملخص
The field of sentiment analysis of Arabic scripts remains a major challenge due to the features, characteristics, and complexity of the Arabic language. Few studies on Arabic Sentiment Analysis (ASA) have been conducted to the best of our knowledge when compared to English or other Latin languages. In addition, for ASA, very few studies have been conducted regarding the comments of the users (customers) of the apps available on the Google Play Store within various mobile application reviews. Most of the current studies have been conducted on datasets collected from Twitter user comments. In this paper, we propose a new approach to the analysis of sentiment in Arabic script based on the comments dataset of users of some mobile applications available on the Google Play Store. The proposed approach involves improving algorithms such as the Levenshtein distance (LD) algorithm for data preprocessing and then combining it with the K-Nearest Neighbor (K-NN) algorithm. Through the proposed approach, the results of the experiment were shown, investigating the impact of utilizing the K-NN and LD algorithms for ASA on mobile applications effectively. The experiments were carried out. The K-NN with LD algorithm has gained a better level of evaluation compared to the K-NN algorithm. K-NN with LD algorithm has achieved the highest accuracy, recall, precision, and F-score, which were 83.11% in accuracy, 66.30% in recall, 85.10% in precision, and 74.53% in f-score evaluation measure when =3.
التنزيلات
منشور
كيفية الاقتباس
إصدار
القسم
الحقوق الفكرية (c) 2023 Salah Alhagree, Ghaleb Al-Gaphari, Ahmed A. Al-Shalabi (Author)
هذا العمل مرخص بموجب Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.