Optimasi Analisis Sentimen Pada Twitter Olshop Tokopedia Menggunakan Textmining Dengan Algoritma Naïve Bayes & Adaboost

H Hartati(1*), Deni Hermawan(2), M. Akhsanal(3), Zailani Wahyudi(4), Angga Ariyanto(5), Dedi Dwi Saputra(6),

(1) Universitas Nusa Mandiri
(2) Universitas Nusa Mandiri
(3) Universitas Nusa Mandiri
(4) Universitas Nusa Mandiri
(5) Universitas Nusa Mandiri
(6) Universitas Nusa Mandiri
(*) Corresponding Author

Abstract


Sentiment Analysis or commonly called Opinion Mining is the process of understanding, extracting and processing textual data automatically to obtain sentiment information contained in a sentence of opinion or opinion on a problem or object by someone, whether it tends to have a positive or negative opinion. This study aims to classify tweet data into 2 classifications, namely positive and negative. In this study, Indonesian text is used on Twitter social media in the form of tweets related to Tokopedia. Public opinion contained in the tweet can be used as material to find out whether tweets on Twitter, especially on Tokopedia, are classified as positive or negative. The data used consists of 1,000 tweet data. This dataset comes from the tweets of Tokopedia customers written on the Tokopedia twitter account. In text mining techniques, “transform case”, “tokenize”, “token filter by length”, “stemming” are used to build classifications. Gataframework is used to help during the preprocessing and cleansing process. RapidMiner is used to help create sentiment analysis in comparing three different classification methods, on Tokopedia's tweet data. The method used to compare in this research is the Naïve Bayes algorithm and the Naïve Bayes algorithm which is added with the Synthetic Minority Over-sampling Technique (SMOTE) feature and the Naïve Bayes algorithm is added with the Synthetic Minority Over-sampling Technique (SMOTE) feature which is optimized with Adboost. . The Naïve Bayes algorithm added with the Synthetic Minority Over-sampling Technique (SMOTE) feature, which was optimized with Adboost, got the best score. With 94.95% accuracy, 90.86% precision, 100.00% recall and 0.950 AUC

Full Text:

PDF

References


J. Ahmed, “Analisis sentimen dan Klasifikasi Tweet Menggunakan Data Mining,” pp. 1471–1474, 2017.

A. V. Sudiantoro et al., “Analisis Sentimen Twitter Menggunakan Text Mining Dengan,” vol. 10, no. 2, pp. 398–401, 2018.

“Text Mining,” Text Mining. [Online]. Available: https://id.wikipedia.org/wiki/Penambangan_teks

“Sentiment Analysis,” Sentiment Analysis. [Online]. Available: https://en.wikipedia.org/wiki/Sentiment_analysis

D. D. Saputra et al., “Optimization Sentiments of Analysis from Tweets in myXLCare using Naïve Bayes Algorithm and Synthetic Minority over Sampling Technique Method,” J. Phys. Conf. Ser., vol. 1471, no. 1, 2020, doi: 10.1088/1742-6596/1471/1/012014.

M. Shoeb and J. Ahmed, “Sentiment Analysis and Classification of Tweets Using Data Mining,” Int. Res. J. Eng. Technol., vol. 4, no. 12, pp. 1471–1474, 2017, [Online]. Available: www.irjet.net

A. P. Giovani, A. Ardiansyah, T. Haryanti, L. Kurniawati, and W. Gata, “Analisis Sentimen Aplikasi Ruang Guru Di Twitter Menggunakan Algoritma Klasifikasi,” J. Teknoinfo, vol. 14, no. 2, p. 115, 2020, doi: 10.33365/jti.v14i2.679




DOI: http://dx.doi.org/10.30645/j-sakti.v6i2.493

Refbacks

  • There are currently no refbacks.



J-SAKTI (Jurnal Sains Komputer & Informatika)
Published Papers Indexed/Abstracted By:


Jumlah Kunjungan :

View My Stats