From Cloud to Edge: Advancing Real-Time AI Applications
Abstract
Edge Computing together with Real-Time Artificial Intelligence (AI) is gradually changing how modern computing system works. Instead of relying completely on centralized cloud servers, this approach focuses on processing close to where it is created. In simple terms, devices like sensors, smart devices, and embedded systems can analyse data directly on the device. This reduces communication delay, lowers bandwidth usage, and helps systems respond more quickly. When methods like Machine Learning and Deep Learning are used in these edge devices, they can analyze data in real time and make fast decisions without waiting for cloud support. The rapid growth of the Internet of Things has resulted in a large volume of real-time data being generated every second. Sending all this data to distant cloud servers is not always efficient, especially in situations where time is critical. For example, in autonomous vehicles or industrial automation systems, even a slight delay can affect performance or safety. Because of this, edge AI becomes important. Local data processing also improves privacy since sensitive information does not always need to travel across networks. At the same time, cloud platforms still play an important role in training complex AI models and storing large datasets, while edge devices mainly handle real-time predictions. Recent improvements in hardware technology, lightweight AI models, and energy-efficient processors have made it easier to deploy AI at the edge. These technological advancements improve scalability and reliability in distributed systems. Overall, the combination of edge computing and AI supports distributed systems that can provide faster and more secure services in many industries. As technology advances, this integration will become even more important in supporting future real-time applications.
References
E. Li, L. Zeng, Z. Zhou, and X. Chen, “Edge AI: On-demand accelerating deep neural network inference via edge computing,” IEEE Transactions on Wireless Communications, vol. 19, no. 1, pp. 447–457, Jan. 2020.
S. S. Gill et al., “Edge AI: A taxonomy, systematic review and future directions,” Cluster Computing, vol. 28, no. 1, p. 18, Feb. 2025.
L. Wang et al., “Shoggoth: Towards efficient edge-cloud collaborative real-time video inference via adaptive online learning,” in Proceedings of the 60th ACM/IEEE Design Automation Conference (DAC), Jul. 2023, pp. 1–6.
H. Rafi, A. Mohammed, and R. Kumar, “Integrating edge computing and AI for real-time data processing in. NET-based applications: A scalable solution for low-latency systems,” International Journal of Social Sciences Bulletin, vol. 3, no. 1, pp. 97–101, 2025.
A. B. Negi, “Edge computing for real-time IoT applications: Architectures and case studies,” International Journal of Advanced Research in Computer Science and Technology, vol. 7, no. 5, pp. 10911–10914, Sep. 2024.
M. T. Aslam, “The future of edge AI: Combining artificial intelligence with edge computing,” Journal of Social and Technological Advanced Research (J-STAR), vol. 1, no. 1, pp. 1–7, Feb. 2025.
P. C. Kinf, “AI and edge computing: Enhancing real-time decision-making in IoT systems,” British Journal of Multidisciplinary Research, vol. 6, no. 6, 2024.
T. Al-Momani and M. Al-Hussein, “Real-time decision making with edge AI technologies: Advanced techniques for optimizing performance, scalability, and low-latency processing,” Journal of Artificial Intelligence and Machine Learning in Management, vol. 8, no. 2, pp. 71–91, 2024.
E. Petrov, “Edge computing for real-time data analytics: Exploring the use of edge computing to enable real-time data analytics in IoT applications,” IoT and Edge Computing Journal, vol. 4, no. 1, pp. 1–13, Feb. 2024.
A. Singh, “Edge computing for real-time data analytics in IoT,” International Journal of Engineering and Extended Technologies Research, vol. 6, no. 6, pp. 9023–9025, Nov. 2024.