Exploring the Application of Machine Learning for Automatic Inbound Email Classification in CRM System at XYZ Company

Authors

  • Lukman Arif Sanjani Institut Teknologi Sepuluh Nopember
  • Raden Bimo Mandala Putra Institut Teknologi Sepuluh Nopember
  • Umi Laili Yuhana Institut Teknologi Sepuluh Nopember

DOI:

https://doi.org/10.37802/joti.v6i1.715

Keywords:

classification, Customer Service, Machine Learning, Random Forest, SVM, XGBoost

Abstract

Customer service has become increasingly crucial in today's business landscape, necessitating companies to provide fast, responsive, and personalized assistance to their clientele. However, amidst the challenges posed by surges in email volume, manual categorization and response strategies often lead to performance declines. To address this, we propose a system leveraging Machine Learning techniques for automated email classification. Our evaluation reveals promising results, with SVM achieving the highest accuracy of 96.59%, followed by XGB (96.02%) and RF (95.27%). These models exhibit commendable precision, recall, F1 scores, and Matthews Correlation Coefficient (MCC), showcasing their effectiveness in improving customer service efficiency and responsiveness. This integration of technology not only enhances operational efficiency but also fosters harmonious customer relationships, ultimately leading to increased loyalty and profitability for companies.

Downloads

Download data is not yet available.

References

H. Drucker, D. Wu, and Y. N. Yapnik, “Support Vector Machines for spam categorization,” IEEE Trans Neural Netw, vol. 10, no. xx, 1999.

Md. R. Islam, M. U. Chowdhury, and Wanlei Zhou, “An Innovative Spam Filtering Model Based on Support Vector Machine,” in International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC’06), IEEE, pp. 348–353. doi: 10.1109/CIMCA.2005.1631493.

C. Anilkumar, A. Karrothu, N. S. Mouli, and C. B. Tej, “Recognition and Processing of phishing Emails Using NLP: A Survey,” in 2023 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, India, Jan. 2023.

R. Suprayoga, S. Zega, Muhathir, and S. Mardiana, “Classification of Mango Leaf Diseases Using XGBoost Method and HoG Feature Extraction,” in 2023 International Conference on Modeling & E-Information Research, Artificial Learning and Digital Applications (ICMERALDA), 2023.

K. Schneider, “A comparison of event models for Naive Bayes antispam e-mail filtering,” in Proceedings of the II th Conference of the European Chapter of the Association for Computational Linguistics (EACL’03), 2003.

K. Iqbal and M. S. Khan, “Email classification analysis using machine learning techniques,” Applied Computing and Informatics, May 2022, doi: 10.1108/ACI-01-2022-0012.

J. D. Brutiag and C. Meek, “Challenges of the email domain for text classification,” in ICML, 2000, pp. 103–110.

A. Putu Candra et al., “Sistem Informasi Penjualan Online Thrift Shop Berbasis Web,” Journal of Technology and Informatics (JoTI), vol. 5, no. 2, pp. 116–124, Apr. 2024, doi: 10.37802/joti.v5i2.586.

A. Hidayatur, A. Yuana, A. Wafi, R. Harjo, T. Maulana, and A. Terza, “Pengembangan Proses Bisnis Pelayanan Statistik Terpadu Badan Pusat Statistik Kota Surabaya Menggunakan Metode Prototyping,” Journal of Technology and Informatics (JoTI), vol. 5, no. 2, pp. 70–79, Apr. 2024, doi: 10.37802/joti.v5i2.548.

F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,” in Proceedings of the 28th International Conference on Computational Linguistics, D. Scott, N. Bel, and C. Zong, Eds., Barcelona, Spain (Online): International Committee on Computational Linguistics, Dec. 2020, pp. 757–770. doi: 10.18653/v1/2020.coling-main.66.

H. Lucky, Roslynlia, and D. Suhartono, “Towards Classification of Personality Prediction Model: A Combination of BERT Word Embedding and MLSMOTE,” in 2021 1st International Conference on Computer Science and Artificial Intelligence (ICCSAI), 2021, pp. 346–350. doi: 10.1109/ICCSAI53272.2021.9609750.

Pavithra and Savitha, “Topic Modeling for Evolving Textual Data Using LDA, HDP, NMF, BERTOPIC, and DTM With a Focus on Research Papers,” Journal of Technology and Informatics (JoTI), vol. 5, no. 2, pp. 53–63, Apr. 2024, doi: 10.37802/joti.v5i2.618.

J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding”.

S. Ravichandiran, Getting Started with Google BERT: Build and train state-of-the-art natural language processing models using BERT. Packt Publishing Ltd, 2021.

L. A. Sanjani, S. J. Hartati, and P. Sudarmaningtyas, “Rancang Bangun Sistem Informasi Penggajian Pegawai dan Remunerasi Jasa Medis Pada Rumah Sakit Bedah Surabaya,” Jurnal Sistem Informasi dan Komputer Akuntansi (JSIKA), vol. 3, no. 1, pp. 87–93, 2014.

Downloads