An investigation of term weighting and feature selection methods for sentiment analysis

dc.authorscopusid57191611053
dc.authorscopusid6603978393
dc.contributor.authorParlar, Tuba
dc.contributor.authorÖzel, Selma Ayse
dc.date.accessioned2024-09-19T15:47:14Z
dc.date.available2024-09-19T15:47:14Z
dc.date.issued2018
dc.departmentHatay Mustafa Kemal Üniversitesien_US
dc.description.abstractSentiment analysis automatically classifies the opinions, which are expressed in a document, usually as positive or negative. A review document in general, reflects its author's opinion about the objects mentioned in the text. Therefore, it can have many useful applications such as opinionated web search and automatic analysis of reviews. Although sentiment analysis is a kind of text classification problem, structures of review documents are different from texts like news, articles, or web pages; so that techniques applied for text classification are needed to be re-experimented for the sentiment analysis. Assigning appropriate weights to features is important to the performance of sentiment analysis so that important features can receive higher weights for the feature vectors. Feature selection reduces feature vector size by eliminating redundant or irrelevant features to improve classification accuracy. In this study, our aim is to examine the effects of term weighting methods on newly proposed Query Expansion Ranking (QER) feature selection method and also compare the classification results with one of the well-known feature selection method namely Chi-square statistic. We use three popular term weighting methods (i.e., term presence, term frequency, term frequency and inverse document frequency-tf*idf) and perform experiments using multinomial Naïve Bayes classifier. The experimental results show that when QER feature selection method is used with tf*idf term weighting method, the classification performance improves in terms of F-score. © 2017 Majlesi Branch, Islamic Azad University.en_US
dc.identifier.endpage68en_US
dc.identifier.issn2345-377X
dc.identifier.issue2en_US
dc.identifier.scopus2-s2.0-85048693882en_US
dc.identifier.scopusqualityQ4en_US
dc.identifier.startpage63en_US
dc.identifier.urihttps://hdl.handle.net/20.500.12483/15057
dc.identifier.volume12en_US
dc.indekslendigikaynakScopusen_US
dc.language.isoenen_US
dc.publisherIslamic Azad Universityen_US
dc.relation.ispartofMajlesi Journal of Electrical Engineeringen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectFeature selectionen_US
dc.subjectSentiment analysisen_US
dc.subjectTerm weightingen_US
dc.subjectText classificationen_US
dc.titleAn investigation of term weighting and feature selection methods for sentiment analysisen_US
dc.typeArticleen_US

Dosyalar