Diabetes Identification and Forecasting through Data Analysis
DOI:
https://doi.org/10.46610/JoDEKD.2024.v01i01.005Keywords:
Data mining, Diabetes, Logistic regression, Random forest, Risk assessmentAbstract
This project endeavours to revolutionize diabetes care through an innovative approach aimed at empowering clinicians with a robust toolset for early detection, risk assessment, and personalized treatment strategies. Diabetes, a rapidly proliferating chronic ailment, has impacted millions worldwide, underscoring the critical need for enhanced diagnosis, prediction, and management protocols. Leveraging cutting-edge data mining techniques, our research focuses on forecasting methods to anticipate diabetes onset and associated critical events like hypo/hyperglycemia. Our proposed diabetes prediction model integrates four state-of-the-art data mining methodologies Random Forest, Support Vector Machine (SVM), Logistic Regression, and Naive Bayes trained and evaluated using Python on a real dataset sourced from Kaggle. Performance evaluation metrics including confusion matrix, sensitivity, and accuracy underscore the efficacy of our approach. Early diabetes prediction holds the promise of significantly improving treatment outcomes, marking a pivotal advancement in diabetes care treatment outcomes. As technology continues to evolve and more data becomes available, ongoing research in this field will play a pivotal role in addressing the growing burden of diabetes worldwide.