Pengembangan Deteksi Pesan Spam pada Website Inti Everspring Indonesia Menggunakan Algoritma Support Vector Machine

Authors

  • Syafaat Akbar Prodi Teknik Informatika, Jurusan Teknologi Informasi, Politeknik Negeri Malang
  • Mamluatul Hani'ah Prodi Teknik Informatika, Jurusan Teknologi Informasi, Politeknik Negeri Malang
  • Imam Fahrur Rozi Prodi Teknik Informatika, Jurusan Teknologi Informasi, Politeknik Negeri Malang

DOI:

https://doi.org/10.35746/jtim.v8i2.872

Keywords:

Deteksi Spam, Support Vector Machine, klasifikasi email, Pengolahan Teks, Inti Everspring Indonesia

Abstract

The development of information technology has driven the growth of email-based communica-tion in business environments, including at Inti Everspring Indonesia. However, the high volume of incoming emails increases the potential for spam messages that may disrupt work effectiveness and data security. This study develops a spam detection system on the company’s website by ap-plying the Support Vector Machine (SVM) algorithm. SVM was selected because of its ability to perform text classification efficiently. The dataset used in this research comes from the company’s internal emails, consisting of labeled spam and non-spam messages. Since the dataset is imbal-anced, an oversampling process was applied, followed by text preprocessing steps including case folding, tokenization, removal of stop words, symbols, numbers, and stemming. The model was then trained using the SVM algorithm, and its performance was evaluated using several metrics: accuracy, recall, precision, and F1-score. Based on the experiments, the SVM-based spam detec-tion model achieved 100% precision, 100% recall, and a 100% F1-score. To validate the reliabil-ity of the algorithm, SVM performance was compared with BERT and Naïve Bayes. BERT achieved 96% accuracy, and Naïve Bayes achieved 97% accuracy. These results indicate that SVM is capable of classifying messages accurately, and SVM outperforms both algorithms.

Downloads

Download data is not yet available.

References

M. R. Ningsih, J. Unjung, H. al Farih, and M. A. Muslim, "Classification email spam using Naive Bayes algorithm and Chi-squared feature selection," J. Appl. Intell. Syst., vol. 9, no. 1, pp. 74–87, 2024, https://doi.org/10.62411/jais.v9i1.9695

Z. A. Diekson, M. R. B. Prakoso, M. S. Q. Putra, M. S. A. F. Syaputra, S. Achmad, and R. Sutoyo, “Sentiment analysis for customer review: Case study of Traveloka,” Procedia Computer Science, vol. 216, pp. 682–690, 2023, https://doi.org/10.1016/j.procs.2022.12.184

R. R. Salam, M. F. Jamil, Y. Ibrahim, R. Rahmaddeni, S. Soni, and H. Herianto, “Analisis sentimen terhadap Bantuan Langsung Tunai (BLT) Bahan Bakar Minyak (BBM) menggunakan Support Vector Machine: Sentiment analysis of cash direct assistance distribution for fuel oil using Support Vector Machine,” MALCOM: Indonesian Journal of Machine Learning and Computer Science, vol. 3, no. 1, pp. 27–35, 2023, https://doi.org/10.57152/malcom.v3i1.590

R. Fatima, M. M. S. Fareed, S. Ullah, G. Ahmad, and S. Mahmood, "An optimized approach for detection and classifica-tion of spam email's using ensemble methods," Wireless Pers. Commun., pp. 1–27, 2024, https://doi.org/10.1007/s11277-024-11628-9

Y. R. Hutagaol and Y. Arifin, "Klasifikasi spam email berbasis semantik menggunakan metode BERT," INTECOMS J. Inf. Technol. Comput. Sci., vol. 7, no. 5, pp. 1823–1836, 2024, https://doi.org/10.31539/intecoms.v7i5.12515

T. Sahmoud and M. Mikki, "Spam detection using BERT," arXiv preprint arXiv:2206.02443, 2022. https://doi.org/10.48550/arXiv.2206.02443

R. Meléndez, M. Ptaszynski, and F. Masui, "Comparative investigation of traditional machine-learning models and transformer models for phishing email detection," Electronics, vol. 13, no. 24, p. 4877, 2024. https://doi.org/10.3390/electronics13244877

A. Wantoro and D. Saputra, "Perbandingan metode Naive Bayes dan Support Vector Machine (SVM) pada analisis kendaraan listrik pada media sosial 'X'," J. Inform. Polinema, vol. 11, no. 2, pp. 227–234, 2025, https://doi.org/10.33795/jip.v11i2.6458

R. Apriani and D. Gustian, "Analisis sentimen dengan Naïve Bayes terhadap komentar aplikasi Tokopedia," J. Rekayasa Teknol. Nusa Putra, vol. 6, no. 1, pp. 54–62, 2019, https://doi.org/10.52005/rekayasa.v6i1.86

M. Jaiswal and S. Das, "Detecting spam e-mails using stop word TF-IDF and stemming algorithm with Naïve Bayes classifier on the multicore GPU," Int. J. Electr. Comput. Eng. (IJECE), vol. 11, no. 4, pp. 3168–3175, Aug. 2021, https://doi.org/10.11591/ijece.v11i4.pp3168-3175

R. R. Sani, Y. A. Pratiwi, S. Winarno, E. D. Udayanti, and F. Alzami, "Analisis perbandingan algoritma Naive Bayes classifier dan Support Vector Machine untuk klasifikasi berita hoax pada berita online Indonesia," J. Masy. Informatika, vol. 13, no. 2, pp. 85–98, Nov. 2022, https://doi.org/10.14710/jmasif.13.2.47983

D. Wahyuningtyas, "Spam detection on Twitter using Naive Bayes algorithm," J. Ilmu Komput. Apl. (JIKA), vol. 7, no. 1, pp. 31–40, 2020, https://doi.org/10.29244/jika.7.1.31-40

I. Sitohang, T. H. Saragih, D. Kartini, R. A. Nugroho, and M. R. Faisal, "Implementasi SMOTE dan extreme learning machines pada klasifikasi dataset microarray," J. Inform. Polinema, vol. 8, no. 4, pp. 9–16, 2022, https://doi.org/10.33795/jip.v8i4.1029

M. Rahayu, A. Luthfiarta, L. Cahyaningrum, and A. N. Azzahra, "Pengaruh oversampling dan cross validation pada model machine learning untuk sentimen analisis kebijakan luaran kelulusan mahasiswa," J. Media Inform. Budidarma, vol. 8, no. 1, pp. 163–172, Jan. 2024, https://doi.org/10.30865/mib.v8i1.7012

S. Vijayarani and R. Janani, "String matching algorithms for retrieving information from desktop—comparative analy-sis," in Proc. Int. Conf. Inventive Comput. Technol. (ICICT), vol. 3, pp. 1–6, Aug. 2016, https://doi.org/10.1109/INVENTIVE.2016.7830233

R. Wongso, F. A. Luwinda, B. C. Trisnajaya, and O. Rusli, "News article text classification in Indonesian language," Pro-cedia Comput. Sci., vol. 116, pp. 137–143, 2017, https://doi.org/10.1016/j.procs.2017.10.039

A. Subadi and K. Kusrini, "Diagnosa stunting berdasarkan gejala medis menggunakan algoritma Naive Bayes, SVM dan K-NN," J. Inform. Polinema, vol. 10, no. 4, pp. 501–510, 2024, https://doi.org/10.33795/jip.v10i4.5628

H. Apriyani and K. Kurniati, "Perbandingan metode Naive Bayes dan Support Vector Machine dalam klasifikasi pen-yakit diabetes melitus," J. Inf. Technol. Ampera, vol. 1, no. 3, pp. 133–143, 2020, https://doi.org/10.51519/journalita.volume1.isssue3.year2020.page133-143

A. Fadli, T. Limbong, R. Priskila, and V. H. Pranatawijaya, "Penggunaan algoritma Naive Bayes untuk memprediksi kelulusan mahasiswa," J. Mahas. Tek. Inform. (JATI), vol. 8, no. 3, pp. 3773–3779, Jun. 2024, https://doi.org/10.36040/jati.v8i3.9791

N. Capuano, L. Greco, P. Ritrovato, and M. Vento, “Sentiment analysis for customer relationship management: an incre-mental learning approach,” Applied Intelligence 2020 51:6, vol. 51, no. 6, pp. 3339–3352, Nov. 2020, https://doi.org/10.1007/S10489-020-01984-X

Downloads

Published

2026-03-19

Issue

Section

Articles

How to Cite

[1]
S. Akbar, M. Hani'ah, and I. F. Rozi, “Pengembangan Deteksi Pesan Spam pada Website Inti Everspring Indonesia Menggunakan Algoritma Support Vector Machine”, jtim, vol. 8, no. 2, pp. 209–219, Mar. 2026, doi: 10.35746/jtim.v8i2.872.