Interactions Between Term Weighting and Feature Selection Methods on the Sentiment Analysis of Turkish Reviews
Yükleniyor...
Tarih
2018
Yazarlar
Dergi Başlığı
Dergi ISSN
Cilt Başlığı
Yayıncı
Springer International Publishing Ag
Erişim Hakkı
info:eu-repo/semantics/closedAccess
Özet
Term weighting methods assign appropriate weights to the terms in a document so that more important terms receive higher weights for the text representation. In this study, we consider four term weighting and three feature selection methods and investigate how these term weighting methods respond to the reduced text representation. We conduct experiments on five Turkish review datasets so that we can establish baselines and compare the performance of these term weighting methods. We test these methods on the English reviews so that we can identify their differences with the Turkish reviews. We show that both tf and tp weighting methods are the best for the Turkish, while tp is the best for the English reviews. When feature selection is applied, tf * idf method with DFD and chi(2) has the highest accuracies for the Turkish, while tf * idf and tp methods with chi(2) have the best performance for the English reviews.
Açıklama
17th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing) -- APR 03-09, 2016 -- Mevlana Univ, Konya, TURKEY
Anahtar Kelimeler
Sentiment analysis, Feature selection, Term weighting
Kaynak
Computational Linguistics and Intelligent Text Processing, (Cicling 2016), Pt Ii
WoS Q Değeri
N/A
Scopus Q Değeri
Q3
Cilt
9624