TY - GEN
T1 - Deep Learning for Lung Cancer Prediction
T2 - 6th International Conference on Artificial Intelligence, Robotics, and Control, AIRC 2025
AU - Coffie, Lord
AU - Kim, Jongyeop
AU - Chen, Lei
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025/5/7
Y1 - 2025/5/7
N2 - Globally, lung cancer continues to be among the common causes of cancer death; therefore, the importance of early patient survival increases in its detection. This study examines how deep learning can predict lung cancer using structured clinical and lifestyle survey data. A comparison was made between a multi-layer perceptron (MLP) and the random forest, XGBoost, and support vector machine (SVM) methods, as well as logistic regression machine learning models. The results demonstrate that ensemble-based models (Random Forest & XGBoost) outperformed the deep learning model, achieving an accuracy of 99.7 % compared to MLP's 94.7 %. We employed SHAP and LIME explainability techniques to improve model interpretability, identifying allergy, swallowing diffculty, and coughing as the most significant predictors of lung cancer. These results show how important it is to make AI-based medical predictions more interpretable in order to increase their use in clinical settings. While deep learning performed well, traditional machine learning models proved to be more effective for structured survey-based datasets. The study shows that machine learning could be useful for screening for lung cancer. It also stresses the need for more research on data augmentation biases, external validation on real-world clinical datasets, and advanced deep learning architectures to make the models more accurate.
AB - Globally, lung cancer continues to be among the common causes of cancer death; therefore, the importance of early patient survival increases in its detection. This study examines how deep learning can predict lung cancer using structured clinical and lifestyle survey data. A comparison was made between a multi-layer perceptron (MLP) and the random forest, XGBoost, and support vector machine (SVM) methods, as well as logistic regression machine learning models. The results demonstrate that ensemble-based models (Random Forest & XGBoost) outperformed the deep learning model, achieving an accuracy of 99.7 % compared to MLP's 94.7 %. We employed SHAP and LIME explainability techniques to improve model interpretability, identifying allergy, swallowing diffculty, and coughing as the most significant predictors of lung cancer. These results show how important it is to make AI-based medical predictions more interpretable in order to increase their use in clinical settings. While deep learning performed well, traditional machine learning models proved to be more effective for structured survey-based datasets. The study shows that machine learning could be useful for screening for lung cancer. It also stresses the need for more research on data augmentation biases, external validation on real-world clinical datasets, and advanced deep learning architectures to make the models more accurate.
KW - Deep Learning
KW - Explainability
KW - Lung Cancer Prediction
KW - Machine Learning
UR - https://www.scopus.com/pages/publications/105012173786
U2 - 10.1109/AIRC64931.2025.11077551
DO - 10.1109/AIRC64931.2025.11077551
M3 - Conference article
AN - SCOPUS:105012173786
SN - 9798331543488
T3 - 2025 6th International Conference on Artificial Intelligence, Robotics and Control (AIRC)
SP - 354
EP - 360
BT - 2025 6th International Conference on Artificial Intelligence, Robotics, and Control, AIRC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 7 May 2025 through 9 May 2025
ER -