Generative Al: A Survey of Recent Advances and Applications
Keywords:
Deep learning, Diffusion models, Ethical AI, GANs, Generative AI, Image synthesis, Multimodal AI, Text generation, Transformers, VAEsAbstract
One of the latest subfields of artificial intelligence that has become one of the most disruptive ones is generative artificial intelligence (Generative AI), which can generate very realistic and contextually relevant content that is multimodal, i.e., it can be generated in the form of text, images, music, and video. In contrast to the classic AI-based solutions, whose core task was to divide and analyze data, generative AI is aimed at producing completely new content, which in many cases can hardly be distinguished from human content. This is going to be a thorough survey of how this idea of generative AI has been progressing, beginning with the theoretical principles of symbolic AI and early work on machine learning, to propose relatively modern techniques of deep learning systems. Within the framework of the paper, the basic generative models, including generative adversarial networks (GANs), variational autoencoders (VAEs), Transformer-based models, and Diffusion Models, are discussed thoroughly with the emphasis on the specificity and peculiarities of their structures as well as their advantages and drawbacks. We go deeper into the newest technologies of the training methodology and scalability factors, multimodal learning, as well as into the open-source technologies, which have provided a chance to democratize access to the most powerful tools. The survey spans a wide variety of applications, including natural language processing (NLP), computer vision, creative arts, games, healthcare, and scientific research. It also provides a weathervane indicating how geographically widespread the field of generative AI applications is. The mentioned critical ethical considerations are bias, fairness, intellectual property issues, deepfakes, and misinformation, privacy concerns, and environmental footprint. Major limitations like data needs, computational complexity, assessment challenges, and text hallucination are addressed together with the future expectations of cross-disciplinary incorporation, robotics usage, model explanations, regulating methodologies, and democratization endeavors. The present survey is targeted at helping researchers, practitioners, and policymakers obtain a complete picture of the generative AI ecosystem and its capabilities of transforming human-computer collaboration.
References
G. Yenduri et al., “GPT (generative pre-trained transformer)—A comprehensive review on enabling technologies, potential applications, emerging challenges, and future directions,” in IEEE Access, vol. 12, pp. 54608–54649, 2024, doi: https://doi.org/10.1109/ACCESS.2024.3389497
L. Yang et al., “Diffusion models: A comprehensive survey of methods and applications,” ACM Computing Surveys, vol. 56, no. 4, pp. 1–39, Nov. 2023, doi: https://doi.org/10.1145/3626235
P. Gupta, B. Ding, C. Guan, and D. Ding, “Generative AI: A systematic review using topic modelling techniques,” Data and Information Management, vol. 8, no. 2, Jun. 2024, doi: https://doi.org/10.1016/j.dim.2024.100066
P. Dhariwal and A. Nichol, “Diffusion models beat GANs on image synthesis,” in NIPS’21: Proceedings of the 35th International Conference on Neural Information Processing Systems, Red Hook, NY, United States: Curran Associates Inc., Dec. 2021, pp. 8780–8794. Available: https://dl.acm.org/doi/10.5555/3540261.3540933
M. Chen, S. Mei, J. Fan, and M. Wang, “Opportunities and challenges of diffusion models for generative AI,” National Science Review, vol. 11, no. 12, Dec. 2024, doi: https://doi.org/10.1093/nsr/nwae348
N. Khan, Z. Khan, A. Koubaa, M. K. Khan, and R. bin Salleh, “Global insights and the impact of generative AI-ChatGPT on multidisciplinary: A systematic review and bibliometric analysis,” Connection science, vol. 36, no. 1, May 2024, doi: https://doi.org/10.1080/09540091.2024.2353630
X. Wang, Z. He, and X. Peng, “Artificial-intelligence-generated content with diffusion models: A literature review,” Mathematics, vol. 12, no. 7, 2024, doi: https://doi.org/10.3390/math12070977
K. S. Kaswan, J. S. Dhatterwal, K. Malik and A. Baliyan, “Generative AI: A review on models and applications,” 2023 International Conference on Communication, Security and Artificial Intelligence (ICCSAI), Greater Noida, India, 2023, pp. 699–704, doi: https://doi.org/10.1109/ICCSAI59793.2023.10421601
Z. B. Akhtar, “Unveiling the evolution of generative AI (GAI): A comprehensive and investigative analysis toward LLM models (2021–2024) and beyond,” Journal of Electrical Systems and Information Technology, vol. 11, no. 1, Jun. 2024, doi: https://doi.org/10.1186/s43067-024-00145-1
Z. Sordo et al., “Synthetic scientific image generation with VAE, GAN, and diffusion model architectures,” Journal of Imaging, vol. 11, no. 8, Jul. 2025, doi: https://doi.org/10.3390/jimaging11080252
E. Y. Zhang, A. D. Cheok, Z. Pan, J. Cai, and Y. Yan, “From turing to transformers: A comprehensive review and tutorial on the evolution and applications of generative transformer models,” Sci, vol. 5, no. 4, Dec. 2023, doi: https://doi.org/10.3390/sci5040046
N. Aydın and O. A. Erdem, “A research on the new generation artificial intelligence technology generative pretraining transformer 3,” 2022 3rd International Informatics and Software Engineering Conference (IISEC), Ankara, Turkey, 2022, pp. 1–6, doi: https://doi.org/10.1109/IISEC56263.2022.9998298
J.-H. Park, Y.-J. Ju, and S.-W. Lee, “Explaining generative diffusion models via visual analysis for interpretable decision-making process,” Expert Systems with Applications, vol. 248, Aug. 2024, doi: https://doi.org/10.1016/j.eswa.2024.123231
M. N. Yeğin and M. F. Amasyalı, “Generative diffusion models: A survey of current theoretical developments,” Neurocomputing, vol. 608, Dec. 2024, doi: https://doi.org/10.1016/j.neucom.2024.128373
S. Bengesi, H. El-Sayed, M. K. Sarker, Y. Houkpati, J. Irungu and T. Oladunni, “Advancements in generative AI: A comprehensive review of GANs, GPT, autoencoders, diffusion model, and transformers,” in IEEE Access, vol. 12, pp. 69812–69837, 2024, doi: https://doi.org/10.1109/ACCESS.2024.3397775