Architecting Intelligence: A New Era of Generative AI and Deep Learning Models

R. Naveenkumar; Rubi Sarkar; Nitin Kumar

Authors

R. Naveenkumar
Rubi Sarkar
Nitin Kumar

Keywords:

Deep learning architectures, Diffusion models, Generative AI, Large language models (LLMs), Transformer models

Abstract

Generative AI, or GenAI, has been a highly transformative element in the realm of AI, shaped by advancements in architectures of deep learning. Transformer-based models such as GPT, BERT, and their successors have taken the NLP domain into an unprecedented sphere by infusing contextual and generative capabilities. Large language models (LLMs) are in the process of revolutionising human-computer interaction through applications involving content generation, summarisation, and translation. Similarly, small language models (SLMs) are emerging for resource-constrained settings, balancing efficiency with performance. In computer vision, CNNs are still at the core of things, but new architectures like EfficientNet and Vision Transformers (ViTs) have expanded the scope of applicability in multiple tasks such as object detection and image classification. GANs continue to push the creative boundaries of what can be synthesized into hyper-realistic images, videos, and even music. Despite these successes, difficulties such as mode collapse led researchers to develop other models, like diffusion models, that proved highly robust for the production of high-quality output. Encoders are commonly combined with decoders in transformer models. Underlying most successful state-of-the-art models are tasks in machine translation and representation learning. Diffusion models, inspired by physical processes, have recently proven to be a breakthrough in generative modeling in terms of coherent and diverse data samples. This paper surveys the evolution and synergy of these deep learning architectures and their roles in advancing AI across domains. Through exploring innovations and their interdisciplinary applications, we investigate how the convergence of these technologies is shaping the future of AI. The discussion then touches on scalability, efficiency, and ethical considerations as it moves towards emerging trends, such as hybrid models that blend the strengths of multiple architectures, catapulting AI to new frontiers of creativity and utility.

References

R. L. Ackoff, “Towards a system of systems concepts,” Management Science, vol. 17, no. 11, pp. 661–671, Jul. 1971, doi: https://doi.org/10.1287/mnsc.17.11.661

R. L. Ackoff and F. E. Emery, On purposeful systems: An interdisciplinary analysis of individual and social behavior as a system of purposeful events, 1st ed. Abingdon, Oxon: Routledge, Taylor & Francis Group, 2005.

S. Agrawal, “Are LLMs the master of all trades?: Exploring domain-agnostic reasoning skills of LLMs,” ar5iv, 2023, Available: https://ar5iv.labs.arxiv.org/html/2303.12810

M. Alavi, E. D. Leidner, and R. Mousavi, “A knowledge management perspective of generative artificial intelligence,” Journal of the Association for Information Systems, vol. 25, no. 1, pp. 1–12, Jan. 2024, doi: https://doi.org/10.2139/ssrn.4782875

M. Alavi and G. Westerman, “How generative AI will transform knowledge work,” Harvard Business Review, Nov. 07, 2023, Available: https://hbr.org/2023/11/how-generative-ai-will-transform-knowledge-work

A.-S. Mayer, R. M. Baygi, and R. Buwalda, “Generation AI: Job crafting by entry-level professionals in the age of generative AI,” Business & Information Systems Engineering, vol. 67, pp. 595–613, Aug. 2025, doi: https://doi.org/10.1007/s12599-025-00959-x

S. Alter, “Sociotechnical systems through a work system lens: A possible path for reconciling system conceptualizations, business realities, and humanist values in IS development,” in STPIS 2015 (1st International Workshop on Socio-Technical Perspective in IS Development) associated with CAISE 2015 (Conference on Advanced Information System Engineering), Stockholm, Sweden, 2015, Available: https://repository.usfca.edu/cgi/viewcontent.cgi?article=1053&context=at

S. M. Padmaja et al., “Deep learning in remote sensing for climate-induced disaster resilience: A comprehensive interdisciplinary approach,” Remote Sensing in Earth Systems Sciences, vol. 8, pp. 145–160, Dec. 2024, doi: https://doi.org/10.1007/s41976-024-00178-0

R. N. Kumar and M. A. Kumar, “Enhanced fuzzy K-NN approach for handling missing values in medical data mining,” Indian Journal of Science and Technology, vol. 9, no. S1, pp. 1–7, 2016, doi: https://doi.org/10.17485/ijst/2016/v9is1/94094

S. Bhadra, A. Goon, R. Naveenkumar, S. Roy, Tanu, and J. Aich, “Fortifying the resilience and integrity of cyber-physical systems through meticulous assessment,” Nanotechnology Perceptions, vol. 20, no. S16, pp. 1193–1201, 2024. Available: https://www.researchgate.net/publication/387711575_Fortifying_the_Resilience_and_Integrity_of_Cyber-Physical_Systems_through_Meticulous_Assessment

N. Kalavani, R. Naveenkumar, S. Bhattacharjee, R. Sharkar, and N. Kumar, “Evaluating the performance of machine learning models in cancer prediction through ROC and PRC metrics,” Nanotechnology Perceptions, vol. 20, no. S16, pp. 553–567, 2024, doi: https://doi.org/10.62441/nano-ntp.vi.3966

G. Vani, R. Naveenkumar, R. Singha, R. Sarkar, N. Kumar, “Advancing predictive data analytics in IoT and AI leveraging real-time data for proactive operations and system resilience,” Nanotechnology Perceptions, vol. 20, no. S16, 2024, doi: https://doi.org/10.62441/nano-ntp.vi.3968

S. Bhattacharjee, R. Naveenkumar, R. Singha, S. Mullick, and R. Sarkar, “The impact of machine learning on enhancing diversity and inclusion through advanced recommence screening techniques,” Journal of Informatics Education and Research, vol. 4, no. 2, pp. 3141–3159, 2024, Available: https://www.researchgate.net/publication/385817100_The_Impact_of_Machine_Learning_on_Enhancing_Diversity_and_Inclusion_through_Advanced_Recommence_Screening_Techniques

A. Burton-Jones, J. Recker, M. Indulska, P. Green, and R. Weber, “Assessing representation theory with a framework for pursuing success and failure,” MIS Quarterly, vol. 41, no. 4, pp. 1307–1334, Dec. 2017, Available: https://www.jstor.org/stable/26630295

V. G. Cerf, “AI is not an excuse!,” Communications of the ACM, vol. 62, no. 10, pp. 7–7, Sep. 2019, doi: https://doi.org/10.1145/3359332

Y. B. Chang and V. Gurbaxani, “Information technology outsourcing, knowledge transfer, and firm productivity: An empirical analysis,” MIS Quarterly, vol. 36, no. 4, pp. 1043–1063, Dec. 2012, Available: https://www.jstor.org/stable/41703497

Y. Chang et al., “A survey on evaluation of large language models,” ACM Transactions on Intelligent Systems and Technology, vol. 15, no. 3, pp. 1–45, Jan. 2024, doi: https://doi.org/10.1145/3641289

S. Chatterjee, S. Sarker, M. J. Lee, X. Xiao, and A. Elbanna, “A possible conceptualization of the information systems (IS) artifact: A general systems theory perspective,” Information Systems Journal, vol. 31, no. 4, pp. 550–578, Dec. 2020, doi: https://doi.org/10.1111/isj.12320

Architecting Intelligence: A New Era of Generative AI and Deep Learning Models

Authors

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

Current Issue