Comparative Analysis of Classical and Deep Learning Methods for Robust Face Detection Systems
Keywords:
Computer vision, CNN, Deep learning, Face detection, MTCNN, Viola-JonesAbstract
Face detection is a fundamental task in computer vision and artificial intelligence, playing a critical role in applications such as surveillance, biometric authentication, and human-computer interaction. This study presents a comparative analysis of classical and deep learning-based face detection methods to evaluate their effectiveness under real-world conditions. The classical Viola-Jones algorithm is implemented as a baseline due to its computational efficiency and real-time performance capabilities. In contrast, a deep learning-based approach using the multi-task cascaded convolutional neural network (MTCNN) is employed to demonstrate the advantages of data-driven feature learning. The performance of both methods is evaluated using standard benchmark datasets, including WIDER FACE and FDDB, under challenging conditions such as occlusion, illumination variation, and scale diversity. Experimental results indicate that while the Viola-Jones method offers faster processing time, it suffers from reduced accuracy in complex environments. Conversely, the MTCNN model achieves higher detection accuracy, precision, and robustness, albeit at a higher computational cost. The findings highlight the trade-off between efficiency and accuracy and emphasize the importance of selecting appropriate detection methods based on application requirements. This study contributes to the development of more reliable and efficient face detection systems for real-world deployment.
References
I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. Cambridge, MA, USA: MIT Press, 2016.
R. Szeliski, Computer Vision: Algorithms and Applications. London, U.K.: Springer, 2010.
H. Li, Z. Lin, X. Shen, J. Brandt, and G. Hua, “A convolutional neural network cascade for face detection,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2015, pp. 5325–5334.
D. Saez-Trigueros, L. Meng, and M. Hartnett, “Face recognition: From traditional to deep learning methods,” arXiv (Cornell University), Jan. 2018.
X. Zhou and T. Zhu, “Survey of research on face recognition methods based on depth learning,” Journal of Physics Conference Series, vol. 2717, no. 1, pp. 012027–012027, Mar. 2024.
P. Viola and M. Jones, “Rapid object detection using a boosted cascade of simple features,” Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, vol. 1, Kauai, HI, USA, 2001.
V. Jain and E. Learned-Miller, FDDB: A Benchmark for Face Detection in Unconstrained Settings. Technical Report, University of Massachusetts Amherst, 2010.
S. Yang, P. Luo, Chen Change Loy, and X. Tang, “WIDER FACE: A face detection benchmark,” arXiv (Cornell University), Nov. 2015.
K. Zhang, Z. Zhang, Z. Li, and Y. Qiao, “Joint face detection and alignment using multitask cascaded convolutional networks,” IEEE Signal Processing Letters, vol. 23, no. 10, pp. 1499–1503, Oct. 2016.
J. Deng, J. Guo, E. Ververas, I. Kotsia, and S. Zafeiriou, “RetinaFace: Single-shot multi-level face localisation in the wild,” 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, Jun. 2020, pp. 5202–5211.