Pemilihan Algoritma Terbaik Untuk Klasifikasi Jenis E-Mail dengan Metode TF-IDF

Denisa Fitria(1*), Yana Cahyana(2), Dwi Sulistya(3), Kiki Ahmad Baihaqi(4),

(1) Universitas Buana Perjuangan Karawang, Indonesia
(2) Universitas Buana Perjuangan Karawang, Indonesia
(3) Universitas Buana Perjuangan Karawang, Indonesia
(4) Universitas Buana Perjuangan Karawang, Indonesia
(*) Corresponding Author

Abstract


Spam emails, sent en masse to numerous addresses, are a major annoyance. To combat this, effective filters are necessary, such as classification to separate spam from non-spam. This can be achieved through an anti-spam model utilizing text mining like TF-IDF. Using the KDD process, a study analyzed a dataset of 6046 entries, split 77.2% non-spam and 22.8% spam. Logistic Regression showed the best accuracy at 98%, outperforming Decision Tree (59%) and Support Vector Machine (95%). Thus, Logistic Regression emerged as the optimal algorithm for email classification.

Full Text:

PDF

References


R. S. Lutfiyani And N. Retnowati, “Implementasi Pendeteksian Spam Email Menggunakan Metode Text Mining Dengan Algoritma Naïve Bayes Dan Decision Tree J48,” Jurnal Komputer Dan Informatika, Vol. 9, No. 2, Pp. 244–252, Oct. 2021, Doi: 10.35508/Jicon.V9i2.5304.

M. Budi Hartono, A. Kisnu Darmawan, And P. Sistem Informasi, “Komparasi Deep Learning Dan Traditional Machine Learning Untuk Email Spam Filtering,” Jurnal Minfo Polgan, Vol. 12, No. 2, 2023, Doi: 10.33395/Jmp.V12i2.12474.

N. Adila, S. Khasanah, And T. Sutabri, “Strategi Perancangan Sistem Amavis Dan Spamassassin Pada Spam Mail,” 2023.

N. Ahmed, R. Amin, H. Aldabbas, D. Koundal, B. Alouffi, And T. Shah, “Machine Learning Techniques For Spam Detection In Email And Iot Platforms: Analysis And Research Challenges,” Security And Communication Networks, Vol. 2022. Hindawi Limited, 2022. Doi: 10.1155/2022/1862888.

“Arif+Hidayat”.

N. Ahmed, R. Amin, H. Aldabbas, D. Koundal, B. Alouffi, And T. Shah, “Machine Learning Techniques For Spam Detection In Email And Iot Platforms: Analysis And Research Challenges,” Security And Communication Networks, Vol. 2022. Hindawi Limited, 2022. Doi: 10.1155/2022/1862888.

M. Jazzar, R. F. Yousef, And D. Eleyan, “Evaluation Of Machine Learning Techniques For Email Spam Classification,” International Journal Of Education And Management Engineering, Vol. 11, No. 4, Pp. 35–42, Aug. 2021, Doi: 10.5815/Ijeme.2021.04.04.

F. Rahma, A. Z. Farmadiansyah, And A. F. Hidayatullah, “Deteksi Surel Spam Dan Non Spam Bahasa Indonesia Menggunakan Metode Naïve Bayes.”

I. Abdulnabi And Q. Yaseen, “Spam Email Detection Using Deep Learning Techniques,” In Procedia Computer Science, Elsevier B.V., 2021, Pp. 853–858. Doi: 10.1016/J.Procs.2021.03.107.

N. Suarna, A. Ajiz, And A. Bahtiar, “Kopertip: Jurnal Ilmiah Manajemen Informatika Dan Komputer Perbandingan Kinerja Algoritma Naïve Bayes Dan C.45 Dalam Klasifikasi Spam Email”, [Online]. Available: Http://Jurnal.Kopertipindonesia.Or.Id/8

H. Iswanto, E. Seniwati, Y. Astuti, And D. Maulina, “Comparison Of Algorithms On Machine Learning For Spam Email Classification,” International Journal Of Information System & Technology Akreditasi, Vol. 5, No. 4, Pp. 446–455, 2021.

A. Karim, S. Azam, B. Shanmugam, K. Kannoorpatti, And M. Alazab, “A Comprehensive Survey For Intelligent Spam Email Detection,” Ieee Access, Vol. 7. Institute Of Electrical And Electronics Engineers Inc., Pp. 168261–168295, 2019. Doi: 10.1109/Access.2019.2954791.

M. Wahyudi, “Klasifikasi Algoritma Naïve Bayes Dan Svm Berbasis Pso Dalam Memprediksi Spam Email Pada Hotline-Sapto,” Vol. 22, No. 1, 2020, Doi: 10.31294/P.V21i2.

D. Pakpahan, V. Siallagan, And S. Siregar, “Classification Of E-Commerce Product Descriptions With The Tf-Idf And Svm Methods,” Sinkron, Vol. 8, No. 4, Pp. 2130–2137, Oct. 2023, Doi: 10.33395/Sinkron.V8i4.12779.

A. M. Siregar, “Accounting Information System Perbandingan Algoritme Klasifikasi Untuk Prediksi Cuaca.”

F. D. Adhiatma And A. Qoiriah, “Penerapan Metode Tf-Idf Dan Deep

Neural Network Untuk Analisa Sentimen Pada Data Ulasan Hotel,” Journal Of Informatics And Computer Science.

P. Djodi, “Informasi Dan Teknologi Ilmiah (Inti),” 2022.




DOI: http://dx.doi.org/10.30645/jurasik.v9i1.747

DOI (PDF): http://dx.doi.org/10.30645/jurasik.v9i1.747.g722

Refbacks

  • There are currently no refbacks.



JURASIK (Jurnal Riset Sistem Informasi dan Teknik Informatika)
Published Papers Indexed/Abstracted By:

Jumlah Kunjungan : View My Stats