Performance Analysis of Machine Learning Algorithm in Spam Email Classification

Adamu Suliaman Usman; Oluwasegun Ishaya Adelaiye; Musa Yusuf

International Journal of Advanced Multidisciplinary Research and Studies

Volume 5, Issue 4, 2025

Performance Analysis of Machine Learning Algorithm in Spam Email Classification

Author(s): Adamu Suliaman Usman, Oluwasegun Ishaya Adelaiye, Musa Yusuf

DOI: https://doi.org/10.62225/2583049X.2025.5.4.4718

Abstract:

The proliferation of unsolicited spam emails poses a growing threat to cybersecurity and user productivity. This study investigates the effectiveness of machine learning algorithms which includes Naïve Bayes, Support Vector Machines (SVM), and Random Forest, in accurately detecting spam emails. Using two labeled dataset of spam and legitimate (ham) messages sourced from kaggle, the models were trained and evaluated using standard performance metrics. The Naïve Bayes classifier achieved an accuracy of 94.6%, with a precision of 93.3% and an F1-score of 94.0%. The SVM model demonstrated slightly improved performance with an accuracy of 95.2%, precision of 94.5%, and F1-score of 94.8%. Random Forest outperformed both with the highest accuracy of 96.8%, precision of 95.7%, and F1-score of 96.2%. These findings affirm the viability of machine learning for robust spam detection and highlight the Random Forest algorithm as particularly effective for real-world applications in email filtering systems.

Keywords: Spam Detection, Machine Learning, Naive Bayes, Support Vector Machine (SVM), Random Forest, Email Filtering

Pages: 1037-1045

Download Full Article: Click Here

Volume 5, Issue 4, 2025

Performance Analysis of Machine Learning Algorithm in Spam Email Classification

Contact Information