fbpx A hybrid composite features based sentence level sentiment analyzer |ARAB AMERICAN UNIVERSITY
Contact information for Technical Support and Student Assistance ... Click here

A hybrid composite features based sentence level sentiment analyzer

Authors: 
Mohammed Maree
Mujahed Eleyat
Shatha Rabayah
Mohammed Belkhatir
ISSN: 
20894872
Journal Name: 
International Journal of Artificial Intelligence
Volume: 
12
Issue: 
1
Pages From: 
284
To: 
294
Date: 
Sunday, January 1, 2023
Keywords: 
Composite features; Experimental evaluation; Extrinsic semantic resources; Natural language processing pipelines; Sentiment classification
Abstract: 
Current lexica and machine learning based sentiment analysis approaches still suffer from a two-fold limitation. First, manual lexicon construction and machine training is time consuming and error-prone. Second, the prediction’s accuracy entails sentences and their corresponding training text should fall under the same domain. In this article, we experimentally evaluate four sentiment classifiers, namely support vector machines (SVMs), Naive Bayes (NB), logistic regression (LR) and random forest (RF). We quantify the quality of each of these models using three real-world datasets that comprise 50,000 movie reviews, 10,662 sentences, and 300 generic movie reviews. Specifically, we study the impact of a variety of natural language processing (NLP) pipelines on the quality of the predicted sentiment orientations. Additionally, we measure the impact of incorporating lexical semantic knowledge captured by WordNet on expanding original words in sentences. Findings demonstrate that the utilizing different NLP pipelines and semantic relationships impacts the quality of the sentiment analyzers. In particular, results indicate that coupling lemmatization and knowledge-based n-gram features proved to produce higher accuracy results. With this coupling, the accuracy of the SVM classifier has improved to 90.43%, while it was 86.83%, 90.11%, 86.20%, respectively using the three other classifiers.