Deep Lungs: Revolutionizing Cancer Detection with Neural Networks
DOI:
https://doi.org/10.46610/JoAESP.2025.v02i02.004Keywords:
CT imaging, Deep learning, Lung cancer detection, MATLAB, Radiology assistanceAbstract
The increasing importance or concern of health-related risks due to rapidly changing environmental conditions, climate patterns, and lifestyle habits has raised significant concern in recent years. Among these, lung-related diseases have emerged as a leading cause of mortality in India, ranking second only to cardiovascular diseases. In 2022, globally, lung ailments accounted for approximately 2.48 million new cases and 1.8 million deaths, underscoring the urgency for timely detection and management. Early diagnosis remains critical in mitigating severe complications and reducing mortality rates associated with such conditions. Chest X-rays continue to be a primary and reliable tool for clinical diagnosis; however, the integration of advanced technologies, such as deep learning, with medical imaging like chest CT scans has demonstrated transformative potential. This paper proposes a deep learning-based diagnostic framework aimed at facilitating the accurate and timely identification of lung cancer from chest CT images. The system is designed to assist radiologists by providing high-accuracy predictions in real-time, thereby improving diagnostic capabilities, particularly in rural and underserved regions where access to expert radiological consultation is limited. A user-friendly Graphical user interface (GUI) developed using MATLAB enables seamless image loading and direct output generation, allowing for swift deployment in clinical and laboratory settings for real-time diagnosis.
References
E. M. F. El Houby, “A survey on applying machine learning techniques for management of diseases,” Journal of Applied Biomedicine, vol. 16, no. 3, pp. 165–174, Aug. 2018, doi: https://doi.org/10.1016/j.jab.2018.01.002
M. Naghavi , A. A. Abajobir, C. Abbafati, “Global, regional, and national age-sex specific mortality for 264 causes of death, 1980–2016: a systematic analysis for the Global Burden of Disease Study 2016,” The Lancet, vol. 390, no. 10100, pp. 1151–1210, Sep. 2017, doi: https://doi.org/10.1016/s0140-6736(17)32152-9
T. B. Murdoch and A. S. Detsky, “The Inevitable Application of Big Data to Health Care,” JAMA, vol. 309, no. 13, pp. 1351–1352, Apr. 2013, doi: https://doi.org/10.1001/jama.2013.393
S. Brin, “Extracting Patterns and Relations from the World Wide Web,” Springer eBooks, vol. 1590, pp. 172–183, Mar. 1998, doi: https://doi.org/10.1007/10704656_11.
M. Huang, X. Zhu, S. Ding, H. Yu, and M. Li, “ONBIRES: Ontology-Based Biological Relation Extraction System,” Proceedings of the 4th Asia-Pacific Bioinformatics Conference, pp. 327–336, Dec. 2005, doi: https://doi.org/10.1142/9781860947292_0036
S. Dattani, V. Samborska, H. Ritchie, and M. Roser, “Cancer,” Our World in Data, Nov. 2019. https://ourworldindata.org/cancer
M. Vidyasagar, “Machine learning methods in computational cancer biology,” Annual Reviews in Control, vol. 43, pp. 107–127, Jan. 2017, doi: https://doi.org/10.1016/j.arcontrol.2017.03.007
A. Callahan and N. H. Shah, “Machine Learning in Healthcare,” Key Advances in Clinical Informatics, pp. 279–291, Jan. 2017, doi: https://doi.org/10.1016/B978-0-12-809523-2.00019-4
G. D. Magoulas and A. Prentza, “Machine Learning in Medical Applications,” Machine Learning and Its Applications, pp. 300–307, Jan. 2001, doi: https://doi.org/10.1007/3-540-44673-7_19
N. Jothi, N. A. Rashid, and W. Husain, “Data Mining in Healthcare – A Review,” Procedia Computer Science, vol. 72, pp. 306–313, 2015, doi: https://doi.org/10.1016/j.procs.2015.12.145
M. Kantardzic, Data Mining: Concepts, Models, Methods, and Algorithms, NJ, USA: John Wiley & Sons, Inc., 2011. doi: https://doi.org/10.1002/9781118029145
R. Chauhan, H. Kaur, and M. A. Alam, "Data Clustering Method for Discovering Clusters in Spatial Cancer Databases," International Journal of Computer Applications, vol. 10, no. 6, 2010, pp. 9-14, Nov. 2010, https://www.ijcaonline.org/volume10/number6/pxc3872004.pdf
L. Kaufman and P. J. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis, Hoboken, NJ, USA: John Wiley & Sons, Inc., 1990. doi: https://doi.org/10.1002/9780470316801
M.-S. Chen, J. Han, and P. S. Yu, “Data mining: an overview from a database perspective,” IEEE Transactions on Knowledge and Data Engineering, vol. 8, no. 6, pp. 866–883, 1996, doi: https://doi.org/10.1109/69.553155.
Y. Zhao and G. Karypis, “Evaluation of hierarchical clustering algorithms for document datasets,” Proceedings of the eleventh international conference on Information and knowledge management - CIKM ’02, 2002, doi: https://doi.org/10.1145/584792.584877.
National Cancer Institute, “Surveillance, Epidemiology, and End Results Program,” SEER, 2018. https://seer.cancer.gov/
M. S. Santos, P. H. Abreu, P. J. García-Laencina, A. Simão, and A. Carvalho, “A new cluster-based oversampling method for improving survival prediction of hepatocellular carcinoma patients,” Journal of Biomedical Informatics, vol. 58, pp. 49–59, Dec. 2015, doi: https://doi.org/10.1016/j.jbi.2015.09.012
G. E. A. P. A. Batista and M. C. Monard, “An analysis of four missing data treatment methods for supervised learning,” Applied Artificial Intelligence, vol. 17, no. 5–6, pp. 519–533, May 2003, doi: https://doi.org/10.1080/713827181
N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “SMOTE: Synthetic Minority Over-sampling Technique,” Journal of Artificial Intelligence Research, vol. 16, no. 16, pp. 321–357, Jun. 2002, doi: https://doi.org/10.1613/jair.953
H. Yang and Y.-P. P. Chen, “Data mining in lung cancer pathologic staging diagnosis: Correlation between clinical and pathology information,” Expert Systems with Applications, vol. 42, no. 15–16, pp. 6168–6176, Sep. 2015, doi: https://doi.org/10.1016/j.eswa.2015.03.019
S. Chai, J. Yang and Y. Cheng, "The Research of Improved Apriori Algorithm for Mining Association Rules," 2007 International Conference on Service Systems and Service Management, Chengdu, China, 2007, pp. 1-4, doi: https://doi.org/10.1109/ICSSSM.2007.4280173
J. Hipp, U. Güntzer, and G. Nakhaeizadeh, “Algorithms for association rule mining --- a general survey and comparison,” ACM SIGKDD Explorations Newsletter, vol. 2, no. 1, pp. 58–64, Jun. 2000, doi: https://doi.org/10.1145/360402.360421
N. Shukla, M. Hagenbuchner, K. T. Win, and J. Yang, “Breast cancer data analysis for survivability studies and prediction,” Computer Methods and Programs in Biomedicine, vol. 155, pp. 199–208, Mar. 2018, doi: https://doi.org/10.1016/j.cmpb.2017.12.011
T. Kohonen, Self-Organizing Maps. Berlin, Heidelberg: Springer Berlin Heidelberg, 2001. doi: https://doi.org/10.1007/978-3-642-56927-2
M. Ester, H. Kriegel, J. Sander, and X. Xu, "A density-based algorithm for discovering clusters in large spatial databases with noise," in Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD'96), pp. 226–231, 1996, Available: https://file.biolab.si/papers/1996-DBSCAN-KDD.pdf
T. Ma and A. Zhang, “Affinity network fusion and semi-supervised learning for cancer patient clustering,” Methods, vol. 145, pp. 16–24, Aug. 2018, doi: https://doi.org/10.1016/j.ymeth.2018.05.020
B. Wang , A. M. Mezlini, F. Demir, M. Fiume, “Similarity network fusion for aggregating data types on a genomic scale,” Nature Methods, vol. 11, no. 3, pp. 333–337, Jan. 2014, doi: https://doi.org/10.1038/nmeth.2810
Mazin Abed Mohammed, Mohd, R. I. Hamed, Dheyaa Ahmed Ibrahim, and Mohamad Khir Abdullah, “Artificial neural networks for automatic segmentation and identification of nasopharyngeal carcinoma,” Journal of Computational Science, vol. 21, pp. 263–274, Jul. 2017, doi: https://doi.org/10.1016/j.jocs.2017.03.026
C. F. Wu, Y.-J. Wu, P. C. Liang, C. H. Wu, S. F. Peng, and H.W. Chiu, “Disease-free survival assessment by artificial neural networks for hepatocellular carcinoma patients after radiofrequency ablation,” Journal of the Formosan Medical Association, vol. 116, no. 10, pp. 765–773, Oct. 2017, doi: https://doi.org/10.1016/j.jfma.2016.12.006
M. S. Iraji, “Prediction of post-operative survival expectancy in thoracic lung cancer surgery with soft computing,” Journal of Applied Biomedicine, vol. 15, no. 2, pp. 151–159, Jan. 2017, doi: https://doi.org/10.1016/j.jab.2016.12.001
J. Guo, F. Iqbal, W. E. Mesker, “Revealing determinant factors for early breast cancer recurrence by decision tree,” Information Systems Frontiers, vol. 19, no. 6, pp. 1233–1241, Jun. 2017, doi: https://doi.org/10.1007/s10796-017-9764-0
C. J. Tseng, C. J. Lu, C.-C. Chang, G.-D. Chen, and C. Cheewakriangkrai, “Integration of data mining classification techniques and ensemble learning to identify risk factors and diagnose ovarian cancer recurrence,” Artificial Intelligence in Medicine, vol. 78, pp. 47–54, May 2017, doi: https://doi.org/10.1016/j.artmed.2017.06.003
I. H, Data Mining: Practical Machine Learning Tools and Techniques. Elsevier, 2011. doi: https://doi.org/10.1016/c2009-0-19715-5.
S. Patel and H. Patel, "Survey of data mining techniques used in healthcare domain," International Journal of Information Sciences and Techniques, vol. 6, no. 1, pp. 53–60, Mar. 2016. Available: https://www.aircconline.com/ijist/V6N2/6216ijist06.pdf