An Intelligent Machine Learning-based System for Detection of Phishing Domains Using AI/ML

Authors

  • Ashish Vats
  • Aaditya Pareek
  • Vinay Soni
  • Priyal Toshniwal
  • Vinod Kataria

Keywords:

Cybersecurity, Domain features, Machine learning, Phishing detection, Random forest, URL analysis

Abstract

Phishing attacks continue to pose a serious challenge to cybersecurity by exploiting deceptive domain names that closely resemble legitimate websites. Phishing detection techniques mainly depend on static features that are chosen using traditional feature selection or ranking techniques. Existing detection techniques, including blacklist-based and rule-driven systems, are often ineffective against newly created phishing domains and zero-day attacks. In this work, an intelligent machine learning-based system is developed to identify phishing domains by analysing URL and domain-level characteristics. The proposed approach focuses on extracting lexical, structural, and statistical features that capture common patterns observed in malicious domains. Machine learning classifiers, including Random Forest and Support Vector Machine, are trained and evaluated using a publicly available dataset containing both phishing and legitimate URLs. Random Forest shows strong performance with faster execution, making it suitable for real-time applications where fast and reliable decisions are essential. Experimental evaluation shows that the Random Forest model achieves higher accuracy, precision, recall, and F1-score, and better overall performance, compared to SVM. The results indicate that domain-level feature analysis combined with machine learning provides an effective and scalable solution for real-time phishing domain detection. The findings confirm that feature-driven intelligent systems can successfully detect phishing domains that imitate the look and feel of genuine websites. The proposed system offers a scalable, accurate, and real-time solution for phishing domain detection and contributes to strengthening cybersecurity defences against evolving phishing strategies. The purpose is to give users a reliable tool that warns them instantly about suspicious websites, reduces the risk of financial and data loss, and strengthens overall trust in online activities.

References

Y. Ari Kustiawan and K. I. Ghauth, “Evaluating the impact of feature engineering in phishing URL detection: A comparative study of URL, HTML, and derived features,” in IEEE Access, vol. 13, pp. 126756–126768, 2025.

A. Khan, M. Ahmed, and A. Fathima, “Enhanced phishing detection using machine learning algorithms: A comparative study of random forest, SVM, and logistic regression models,” in Proceedings of the International Conference on Innovative Computing & Communication (ICICC 2024), Mar. 2025.

G. A. Rahmat, R. Sarno, K. R. Sungkono, A. Tri Haryono, A. F. Septiyanto and Sholiq, “Detecting phising website using machine learning methods,” 2025 International Conference on Computer Sciences, Engineering, and Technology Innovation (ICoCSETI), Jakarta, Indonesia, 2025, pp. 351–356.

H. Saleem, M. Fuzail, A. Naeem, N. Aslam, and A. Faiz, “Model for the detection of web phishing attacks by using machine learning algorithms,” Journal of Multidisciplinary Research, vol. 2, no. 6, pp. 223–239, Jun. 2025.

L. R. R. Kandula, J. L. Tangirala, K. Alla, and R. Chivukula, “An intelligent prediction of phishing URLs using ML algorithms,” International Journal of Safety and Security Engineering, vol. 12, no. 3, pp. 381–386, 2022.

S. P and S. M, “Intelligent system for identification of phishing emails, fake websites and sand-boxing targeted entity,” International Journal of Sciences and Innovation Engineering, vol. 2, no. 5, pp. 268–271, May 2025

A. Mandadi, S. Boppana, V. Ravella and R. Kavitha, “Phishing website detection using machine learning,” 2022 IEEE 7th International conference for Convergence in Technology (I2CT), Mumbai, India, 2022, pp. 1–4.

I. Kara, M. Ok and A. Ozaday, “Characteristics of understanding URLs and domain names features: The detection of phishing websites with machine learning methods,” in IEEE Access, vol. 10, pp. 124420–124428, 2022.

S. S. Beevi, V. S. Nithin, K. Venkatesh, C. S. Rohit, and V. S. Teja, “Machine learning-based fraudulent and harmful link detection system,” International Journal of Innovative Research in Science Engineering and Technology, vol. 14, no. 4, pp. 9506–9510, Apr. 2025.

F. F. Tampinongkol, A. R. Kamila, A. C. Wardhana, A. W. C. Kusuma, and D. Revaldo, “Implementation of random forest classification and support vector machine algorithms for phishing link detection,” Journal of Informatics Information System Software Engineering and Applications (INISTA), vol. 7, no. 1, pp. 127–137, 2024.

H. B. Pandya and Khyati Zalawadia, “A comprehensive review of phishing detection techniques based on machine learning,” Metallurgical and Materials Engineering, vol. 31, no. 5, pp. 302–310, May 2025.

O. Ablayeva, “Comparative analysis of random forest, SVM, and LSTM algorithms for threat detection in internet domains”, International Journal of Artificial Intelligence, vol. 1, no. 4, pp. 1499–1504, Jun. 2025.

P. Tuli, A. Verma, A. Shah, and Vaishnavi, “Phishing Url detection using machine learning technique,” in Proceedings of the International Conference on Advances and Applications in Artificial Intelligence (ICAAAI 2025), Atlantis Press, Jun. 2025, pp. 935–944.

J. Lamas Piñeiro and L. Wong Portillo, “Web architecture for URL-based phishing detection based on random forest, classification trees, and support vector machine”, Inteligencia Artificial, vol. 25, no. 69, pp. 107–121, May 2022.

A. S. Araujo Arévalo, G. A. Felix-Díaz Monzon, and J. P. J. Mansilla López, “Intelligent system for phishing detection on web pages using random forest,” in Leadership in Education and Innovation in Engineering in the Framework of Global Transformations: Integration and Alliances for Integral Development, Proceeding of 21st LACCEI International Multi-Conference for Engineering, Education and Technology (LACCEI 2023), Buenos Aires, Argentina, Jul. 19–21, 2023.

S. S. Patil, N. M. Shekokar, and S. C. Iyer, “Design of intelligent feature selection technique for phishing detection”, International Islamic University Malaysia Engineering Journal, vol. 26, no. 1, pp. 254–277, Jan. 2025.

Published

2026-02-28