Real-Time Object Detection for Autonomous Drone Navigation Using YOLOv8
Abstract
This paper presents an original implementation and evaluation of YOLOv8-based real-time object detection for autonomous drone navigation. While previous works have primarily focused on earlier YOLO versions or simulation environments, this study deploys and benchmarks multiple YOLOv8 variants (YOLOv8n to YOLOv8x) on real-world, drone-compatible embedded platforms, such as the NVIDIA Jetson AGX Xavier, and Xavier NX. We propose an optimized object detection pipeline integrated with autonomous navigation algorithms, which have been tested in various environments and weather conditions. YOLOv8s demonstrated the best trade-off between detection accuracy (mAP@0.5 = 89.3%) and real-time inference speed (45.2 FPS) on Jetson AGX Xavier. The system consistently achieved over 90% successful autonomous flight performance in urban, rural, and mixed scenarios. These results confirm the feasibility of YOLOv8s for reliable real-time object detection in autonomous UAVs, paving the way for robust deployment in real-world applications.
References
J. Redmon, S. Divvala, R. Girshick and A. Farhadi, “You only look once: Unified, real-time object detection,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 779-788, doi: https://doi.org/10.1109/CVPR.2016.91
A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, “YOLOv4: Optimal speed and accuracy of object detection,” arXiv.org, Apr. 22, 2020. Available: https://www.arxiv.org/abs/2004.10934v1
G. Jocher, A. Chaurasia, and J. Qiu, YOLO by Ultralytics. GitHub repository, 2023. Available: https://github.com/ultralytics/ultralytics
H. Shakhatreh et al., “Unmanned aerial vehicles (UAVs): A survey on civil applications and key research challenges,” in IEEE Access, vol. 7, pp. 48572-48634, 2019, doi: https://doi.org/10.1109/ACCESS.2019.2909530
T. -Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan and S. Belongie, “Feature pyramid networks for object detection,” 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 936-944, doi: https://doi.org/10.1109/CVPR.2017.106
S. Liu, L. Qi, H. Qin, J. Shi and J. Jia, “Path aggregation network for instance segmentation,” 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018, pp. 8759-8768, doi: https://doi.org/10.1109/CVPR.2018.00913
Z. Tian, C. Shen, H. Chen and T. He, “FCOS: Fully convolutional one-stage object detection,” 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), 2019, pp. 9626-9635, doi: https://doi.org/10.1109/ICCV.2019.00972
A. Howard et al., “Searching for MobileNetV3,” 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), 2019, pp. 1314-1324, doi: https://doi.org/10.1109/ICCV.2019.00140
M. Tan, R. Pang, and Q. V. Le, “EfficientDet: Scalable and efficient object detection,” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10781-10790, Available: https://openaccess.thecvf.com/content_CVPR_2020/papers/Tan_EfficientDet_Scalable_and_Efficient_Object_Detection_CVPR_2020_paper.pdf
M. Shah Alam and J. Oluoch, “A survey of safe landing zone detection techniques for autonomous unmanned aerial vehicles (UAVs),” Expert Systems with Applications, vol. 179, p. 115091, Oct. 2021, doi: https://doi.org/10.1016/j.eswa.2021.115091
A. Carrio, C. Sampedro, A. Rodriguez-Ramos, and P. Campoy, “A review of deep learning methods and applications for unmanned aerial vehicles,” Journal of Sensors, vol. 2017, pp. 1–13, 2017, doi: https://doi.org/10.1155/2017/3296874
A. Al-Kaff, M. J. Gómez-Silva, F. M. Moreno, A. de la Escalera, and J. M. Armingol, “An appearance-based tracking algorithm for aerial search and rescue purposes,” Sensors, vol. 19, no. 3, p. 652, Jan. 2019, doi: https://doi.org/10.3390/s19030652
Y. Lyu, Q. Pan, C. Zhao, Y. Zhang, and J. Hu, “Vision-based UAV collision avoidance with 2D dynamic safety envelope,” in IEEE Aerospace and Electronic Systems Magazine, vol. 31, no. 7, pp. 16-26, July 2016, doi: https://doi.org/10.1109/MAES.2016.150155
M. Radovic, O. Adarkwa, and Q. Wang, “Object recognition in aerial images using convolutional neural networks,” Journal of Imaging, vol. 3, no. 2, p. 21, Jun. 2017, doi: https://doi.org/10.3390/jimaging3020021
B. Benjdira, T. Khursheed, A. Koubaa, A. Ammar and K. Ouni, “Car detection using unmanned aerial vehicles: comparison between faster R-CNN and YOLOv3,” 2019 1st International Conference on Unmanned Vehicle Systems-Oman (UVS), Muscat, Oman, 2019, pp. 1-6, doi: https://doi.org/10.1109/UVS.2019.8658300
V. Godase and A. Jagadale. “Three element control using PLC, PID & SCADA Interface.” IJSRD - International Journal for Scientific Research & Development, vol. 7, no. 2, 2019, Available: https://www.ijsrd.com/articles/IJSRDV7I21002.pdf
V. Godase, A. Mulani, A. Pawar and K. Sahani, “A comprehensive review on PIR sensor-based light automation systems,” International Journal of Image Processing and Smart Sensors, vol. 1, no. 1, pp. 22-29, May 2025, Available: https://matjournals.net/engineering/index.php/IJIPSS/article/view/1846
V. V. Godase, “A comprehensive study of revolutionizing EV charging with solar-powered wireless solutions,” Advance Research in Power Electronics and Devices, vol. 2, no. 1, pp. 23-37, Apr. 2025, Available: https://matjournals.net/engineering/index.php/ARPED/article/view/1752
V. Godase, “Optimized algorithm for face recognition using Deepface and multi-task cascaded convolutional network (MTCNN),” Optimum Science Journal, vol. 3, pp. 66-74, 2025, doi: https://doi.org/10.5281/zenodo.15341560
V. K. Jamadade, M. G. Ghodke, S. S. Katakdhond and V. Godase, “A comprehensive review on scalable Arduino radar platform for real-time object detection and mapping,” Journal of Microprocessor and Microcontroller Research, vol. 2, no. 2, pp. 1-12, May 2025, Available: https://matjournals.net/engineering/index.php/JoMMR/article/view/1888
S. K. A., A. Kathiravan, P. Mannam, D. Akila, B. S. Kumar, and V. Vilas Godase, “Advanced neural network models for optimal energy management in microgrids with integrated electric vehicles,” 2025 5th International Conference on Trends in Material Science and Inventive Materials (ICTMIM), Kanyakumari, India, 2025, pp. 1869-1874, doi: https://doi.org/10.1109/ICTMIM65579.2025.10988248
S. Karve, S. Gangonda, G. Birajadar, V. Godase, R. Ghodake, and A. O. Mulani, “Optimized neural network for prediction of neurological disorders,” Communications in Computer and Information Science, pp. 183–191, 2025, doi: https://doi.org/10.1007/978-3-031-88762-8_18