IMI-BAS BAS
 

BulDML at Institute of Mathematics and Informatics >
IMI >
IMI Periodicals >
Serdica Journal of Computing >
2019 >
Volume 13, Number 1-2 >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10525/3871

Title: SMS Sentiment Classification based on Lexical Features, Emoticons and Informal Abbreviations
Authors: Šandrih, Branislava
Keywords: Computer Application in Arts and Humanities
Web-Based Services
Document Analysis
Issue Date: 2019
Publisher: Institute of Mathematics and Informatics Bulgarian Academy of Sciences
Citation: Serdica Journal of Computing, Vol. 13, No 1-2, (2019), 081p-096p
Abstract: In this paper we investigate the influence of emoticons, informal speech, lexical and other linguistic features on the sentiment contained in SMS messages. Using the dataset of ∼ 6,000 samples, we trained a linear SVM classifier able to determine positive, negative and neutral sentiments. The dataset mostly contains messages in Serbian, but also in English and German. The classifier had an average accuracy score of 92.3% in a 5-fold Cross Validation setting, and F1-score of 92.1%, 74.0% and 93.3% in favor of the positive, negative and neutral class, respectively.
URI: http://hdl.handle.net/10525/3871
ISSN: 1312-6555
Appears in Collections:Volume 13, Number 1-2

Files in This Item:

File Description SizeFormat
sjc-vol13-num1-2-2019-p081-p096.pdf426.19 kBAdobe PDFView/Open

 



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0!   Creative Commons License