Scopus Indexed Publications

Paper Details


Title
A comparative study of machine learning models with LASSO and SHAP feature selection for breast cancer prediction
Author
Md. Shazzad Hossain Shaon, MD. SHAHRIAR SHAKIL, MD. ZAHID HASAN, TASMIN KARIM,
Email
Abstract

In recent decades, breast cancer has become the most prevalent type of cancer that impacts women in the world, which shows a significant risk to the death rates of women. Early identification of breast cancer might drastically decrease patient mortality and greatly improve the chance of an effective treatment. In modern times, machine learning models have become crucial for classifying cancer and strengthening both the accuracy and efficiency of diagnostic and medical treatment strategies. Therefore, this study is focused on early detection of breast cancer using a variety of machine learning algorithms and desires to identify the most effective feature selection process with an amalgamated dataset. Initially, we evaluated five traditional models and two meta-models on separate datasets. To find the most valuable features, the study used the Least Absolute Shrinkage and Selection Operator (LASSO) as well as SHapley Additive exPlanations (SHAP) selection methods and analyzed them through a wide range of performance regulations. Additionally, we applied these models to the combined dataset and observed that the mergeddataset was significantly beneficial for breast cancer diagnosis. After analyzing the feature selection strategies, it was demonstrated that the majority of models performed more accurately when utilizing SHAP methodologies. Notably, three traditional models and two meta-classifiers obtained an accuracy of 99.82%, demonstrating superior performance compared to state-of-the-art methods. This advancement holds a crucial role as it lays the foundation for refining diagnostic tools and enhancing the progression of medical science in this field.


Keywords
Breast cancerMachine learningDiagnostic toolsFeature selectionMeta-modelsSHapley additive exPlanations (SHAP)
Journal or Conference Name
Healthcare Analytics
Publication Year
2024
Indexing
scopus