The Role of Regression Analysis in Data-Driven Research: Foundation and Real-Life Applications

Authors

  • Dhanashree Pawgi
  • Mansi Mulik
  • Surbhi Zope
  • Anshika Yadav
  • Pragati Rana
  • Arshiya Shaikh

Abstract

Regression analysis is one of the most fundamental and widely applied statistical techniques in engineering and data science. This literature review synthesizes findings from ten research papers spanning theoretical foundations, methodological developments, and practical applications across diverse domains, including healthcare, education, real estate, transportation safety, and environmental science. The review demonstrates that regression methods—ranging from simple linear regression to advanced logistic and multivariate techniques—serve as essential tools for prediction, optimisation, and decision-making in engineering practice. Key findings reveal that while linear regression remains highly effective for continuous outcome prediction, logistic regression extends these capabilities to classification problems, and advanced techniques like regularisation and ensemble methods address challenges such as overfitting and multicollinearity. This review is structured to provide first-year engineering students with a clear understanding of regression fundamentals, awareness of common pitfalls, and appreciation for the breadth of real-world applications. The evidence base demonstrates that mastery of regression analysis is critical for modern engineering practice, enabling data-driven solutions to complex problems across multiple disciplines.

References

A. O. Sykes, “Regression Analysis Fundamentals,” Research Paper on Labor Market Earnings, 1993.

R. T. Dagdagui, “Predicting students’ academic performance using regression analysis,” American Journal of Educational Research, vol. 10, no. 11, pp. 640–646, Nov. 2022.

S. Rong and Z. Bao-Wen, “The research of regression model in machine learning field,” in MATEC Web of Conferences, vol. 176, p. 01033, 2018.

J. Parekh, “House Price Prediction Using Linear Regression Model,” International Journal for Multidisciplinary Research, vol. 5, no. 6, pp. 1–7, 2023.

D. Maulud and A. M. Abdulazeez, “A review on linear regression comprehensive in machine learning,” Journal of Applied Science and Technology Trends, vol. 1, no. 2, pp. 140–147, Dec. 2020.

A. Almalki, B. Gokaraju, Y. Acquaah, and A. Turlapaty, “Regression analysis for COVID-19 infections and deaths based on food access and health issues,” Healthcare, vol. 10, no. 2, p. 324, Feb. 2022.

K. J. Olowe, N. L. Edoh, S. J. Zouo, and J. Olamijuwon, “Comprehensive review of logistic regression techniques in predicting health outcomes and trends,” World Journal of Advanced Pharmaceutical and Life Sciences, vol. 7, no. 2, pp. 16–26, Nov. 2024.

Y. Sun, X. Wang, C. Zhang, and M. Zuo, “Multiple regression: Methodology and applications,” Highlights in Science, Engineering and Technology, vol. 49, pp. 542–548, May 2023.

B. Popović, “The optimal solution to the problems by regression analysis,” Journal of Engineering Management and Competitiveness, vol. 9, no. 1, pp. 25–37, Jan. 2019.

Y. T. Hsu, S. C. Chang, and T. H. Hsu, “Analysis of traffic accident severity at intersection using logistic regression model,” Journal of Engineering Research and Reports, vol. 13, no. 4, pp. 1–9, Jun. 2020.

Published

2026-04-13