Page 59 - IJOCTA-15-1

P. 59

BSO: Binary Sailfish Optimization for feature selection in sentiment analysis

results, the SVM-based and RF-based SA mod- Cai et. al. 46 proposed a recommendation model
els outperformed the others which are DT-based, based on DL (i.e., DeepCGSR) which uses users’
KNN-based, and LR-based models by consider- comment sets and items as a collection to do
ing F-score. For negative reviews, SVM-based cross-grained SA that consubstantiates textual
and RF-based SA algorithms were computed as critique sentiments and rating matrix. This
0.87 and 0.80 in terms of F-score, respectively. method was used on the e-commerce datasets
For neutral reviews, SVM-based and RF-based from Amazon. According to the results of
SA algorithms were computed as 0.77 and 0.64 the experiments, DeepCGSR model outperformed
in terms of F-score, respectively. For positive re- LFM 47 by 14.113%, SVD++ 48 by 13.786%,
views, SVM-based and RF-based SA algorithms TOPICMF 49 by 9.92%, DeepCoNN 50 by 5.122%
were calculated as 0.94 and 0.90 in terms of F- and NARRE 51 by 2.765%.
score, respectively.

2.2.3. SA based on nature inspired algorithms

2.2.2. SA based on DL There are not many studies where nature-inspired
languages are used as feature selectors in SA. For
this reason, since this study is thought to add
The Attention-Centred Bidirectional CNN-RNN
innovation and richness to the literature, similar
Deep Model (ABCDM) model, which removes
studies are presented in this section.
both past and future contexts, using 2 indepen-
dent bidirectional LSTM together with GRU lay- 52
ers was created to perform SA of product re- Sumathi et al. proposed to use Artificial Bee
views on Twitter in the study of Basiri et al. 43 Colony (ABC) as feature selector to classify opin-
The proposed ABCDM model was studied on five ion classification on Internet Movie Database
reviews and three Twitter datasets in order to (IMDb). Na¨ıve Bayes (NB), Fuzzy Unordered
recognize sentiment polarity. According to the Rule Induction Algorithm (FURIA) and Rip-
results, despite focusing only on document-level ple Down Rule Learner (RIDOR) are used as
sentence analysis, ABCDM, which aims to obtain classifiers with ABC. Experimental results show
both long reviews, accomplished its purpose by that ABC-RIDOR provides the best result with
using short tweet polarity classification. 93.75% accuracy.

In the study of Li et al., 44 DL-based SA algo- In the study Wahyudi and Kristiyanti, 53 SVM
rithms called lexicon integrated 2-channel Convo- Algorithm-Based Particle Swarm Optimization
lutional Neural Network–Long Short Term Mem- (PSO) was used for SA of smartphone product
ory family models were developed. To formu- review from the web site www.gsmarena.com. Ac-
late sample data entered, the sentiment padding cording to the experimental results, an increasing
methodology was used. This method was also was obtained in accuracy with SVM-based PSO
used to develop the sentiment information’s pro- as from 82.00% to 94.50%.
portion in every review and was formulated Sen-
timent Analysis’ premium lexicon components. Yuvaraj and Sabari 54 applied the Binary Shuf-
fled Frog algorithm (BSFA) as a feature selec-
In the study of Bilen and Horasan, 45 the Long- tor in sentiment analysis on the Twitter dataset
Short Term Memory Network and different ML al- provided by Stanford University. Radial Basis
gorithms were used on the Turkish Customer Re- Function (RBF), k-NN, NB and Logistic Model
views dataset which includes 8.500 customer re- Tree (LMT) were used as classifiers. The results
views collected from various electronic stores and showed that the BSFA-RBF model outperformed
Stanford Large Movie Reviews (IMDB) dataset others based on accuracy score.
which contains 50.000 movie reviews (25.000 posi-
tives and 25.000 negatives). Also, Zemberek NLP In the study of Naz et al., 55 for classification
Library for Turkish Language and Regular Ex- of sentiments using ensemble-based classifiers, to
pression models were used for the preprocessing solve the feature subset selection problem, a hy-
phase. The proposed model outperformed the brid strategy of minimum redundancy and max-
other ML models both on the Turkish dataset and imum relevance (mRMR) and Forest Optimiza-
IMDB dataset as 90.59% and 89.02% in terms of tion Algorithm (FOA)-based feature selection was
accuracy. proposed. This study employed to the Blitzer
53

54 55 56 57 58 59 60 61 62 63 64