Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Logistic Regression for Breast Cancer Analysis


Affiliations
1 Department of Computer Science, The Northcap University, India
     

   Subscribe/Renew Journal


In this study, logistic regression on mammograms is used to diagnose breast cancer. The aim of using logistic regression is to obtain the significant clinical factors contributing more towards higher probability of breast cancer. The sample data set is taken from UC Irvine repository and modeled using the regression model. A 10-fold cross validation is applied on the training data set to avoid the over fitting problem. The sample data set contains mammograms samples collected by a survey conducted by the Radiologist. The classification table of 450 samples illustrations the correct classification percentage for mammogram as 96.6%. The result is then compared with 30 validated samples, correct classification 68.9%.The simulation results claims that the used linear regression model is able to map relationships among attributes by giving more accurate classification

Keywords

Breast Cancer, Mammograms, Prediction, Logistic Regression, Factors and Accuracy.
User
Subscription Login to verify subscription
Notifications
Font Size


  • Logistic Regression for Breast Cancer Analysis

Abstract Views: 404  |  PDF Views: 4

Authors

Bhoomi Sharma
Department of Computer Science, The Northcap University, India
Abhimanyu Abhimanyu
Department of Computer Science, The Northcap University, India
Anuradha Anuradha
Department of Computer Science, The Northcap University, India
Yogita Gigras
Department of Computer Science, The Northcap University, India

Abstract


In this study, logistic regression on mammograms is used to diagnose breast cancer. The aim of using logistic regression is to obtain the significant clinical factors contributing more towards higher probability of breast cancer. The sample data set is taken from UC Irvine repository and modeled using the regression model. A 10-fold cross validation is applied on the training data set to avoid the over fitting problem. The sample data set contains mammograms samples collected by a survey conducted by the Radiologist. The classification table of 450 samples illustrations the correct classification percentage for mammogram as 96.6%. The result is then compared with 30 validated samples, correct classification 68.9%.The simulation results claims that the used linear regression model is able to map relationships among attributes by giving more accurate classification

Keywords


Breast Cancer, Mammograms, Prediction, Logistic Regression, Factors and Accuracy.

References