E ISSN: 2583-049X
logo

International Journal of Advanced Multidisciplinary Research and Studies

Volume 5, Issue 5, 2025

Detecting Email Phishing Using Machine Learning Algorithms



Author(s): Samuel M Orokpo, Oluwasegun Ishaya Adelaiye, Adamu Suliaman Usman

Abstract:

Email phishing remains a persistent cybersecurity threat, undermining communication systems and exploiting unsuspecting users. This study proposes a machine learning based approach to detect phishing emails using the CRISP-DM methodology. A dataset of 18,650 emails obtained from Kaggle was analyzed with multiple algorithms, including Naïve Bayes and Random Forest, to evaluate their effectiveness in distinguishing between legitimate and malicious messages. Performance was assessed using accuracy, precision, recall, and F1-score. The Random Forest model achieved the highest performance with 98.30% accuracy, 0.96 precision, 1.00 recall, and an F1-score of 0.90. In contrast, Naïve Bayes produced a lower accuracy of 53.80% but achieved perfect recall, highlighting the importance of comparing algorithms to uncover their respective strengths and limitations. Future work will focus on refining the proposed solution and extending its applicability to address region-specific challenges, particularly in Nigeria, where phishing scams such as “yahoo yahoo” are increasingly prevalent. This research contributes to the advancement of robust email security strategies against evolving phishing threats.


Keywords: Phishing Email, Anomaly Detection, Machine Learning, Naive Bayes, Random Forest, Email Filtering, CRISP-DM, Email Classification

Pages: 664-672

Download Full Article: Click Here