Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

SMO, IBK and Logistic Classifiers for Classification of Breast Cancer


Affiliations
1 Department of ECE, Bannari Amman Institute of Technology, India
     

   Subscribe/Renew Journal


Breast cancer has become common cancer among various ages of women in world. The symptoms of breast cancer include the lumps, change in shape, change in color of skin, liquid oozing out from nipple. The occurrence of breast cancer is increasing every year by year and increased incidence is not surprising since there has been increase in numbers of women with high breast cancer risk factors, including lower age of menarche, late age of first pregnancy, fewer pregnancies, shorter or no periods of breastfeeding, later menopause, prolactin hormone levels, use of birth control pills and radiation exposure in youth. In order to reduce the mortality of breast cancer effective early classification detection techniques are required for long term survival of the women. Classification is one of the most commonly used tool analyze and classify the data. This paper analyzes the classifier algorithms such as SMO, Logistic and MLP for seer breast cancer dataset using WEKA software. The performance of the classifiers are evaluated against the parameters like classification accuracy, RMSE, TP Rate, FP Rate, Precision, Recall, F-Measure, ROC, Specificity, Sensitivity and so on. MLP Classifier is superior to SMO and Logistic and it has the highest classification accuracy of 93.5%.


Keywords

Classification, SMO, IBK, Logistic Regression, Perceptron, Accuracy, RMSE, Confusion Matrix.
User
Subscription Login to verify subscription
Notifications
Font Size

  • Aruna S, Rajagopalan SP, Nandakishore LV, "Knowledge based analysis of various statistical tools in detecting breast cancer,” Computer Science & Information Technology, 37–45, 2011.
  • Vaidehi K and Subashini TS, “Breast tissue characterization using combined K-NN classifier,” Indian Journal of Science and Technology, 6-23, 2015
  • Williams K, Idowu PA, Balogun JA, and Oluwaranti A, “Breast cancer risk prediction using data mining classification techniques,” IEEE Transactions on Networks and Communications, Vol. 3, No. 2, 1–11, 2015.
  • http://seer.cancer.gov/popdata/popdic.html-SEER dictionary
  • T. M. Cover, “Geometrical and Statistical Properties of Systems of Linear with Applications in Pattern Recognition,” IEEE Transactions on Electronic Computers EC-14, pp. 326-334, 1965
  • Ramnath Takiar et al, “Projections of Number of Cancer Cases in India (2010-2020) by Cancer Groups,” Asian Pacific Journal of Cancer Prevention, Vol.11, 2010
  • R. W. Brause, “Medical analysis and diagnosis by neural networks,” Lecture notes In Computer Science, vol. 2199, pp. 1-13, 2001
  • Evanthia E. Tripoliti et al, “Automated Diagnosis of Diseases Based on Classification: Dynamic Determination of the Number of Trees in Random Forests Algorithm,” IEEE Transactions On Information Technology In Bio medicine,Vol. 16,No.4, July 2012
  • Xindog Wu, Vipin Kumar et al, “Top 10 Algorithms in Data Mining,” Knowledge and Information Systems, Vol. 14, 1-13, 2008
  • H.L. Chen, B. Yang, G. Wang, S.J. Wang, J. Liu, D.Y. Liu, “Support vector machine based diagnostic system for breast cancer using swarm intelligence,” J. Med. Syst. Vol.36 No.4,pp 2505–2519, 2012.
  • G. Salama, M.B. Abdelhalim, and Magdy Abd-elghany Zeid. Son,” Breast Cancer Diagnosis on Three Different Datasets Using Multiclassifiers,” International Journal of Computer and Information Technology, Vol. 01, pp. 36-43, September 2012
  • Sharma AK, Sahni S.A,”Comparative study of classification algorithms for spam email data analysis,” International Journal on Computer Science and Engineering, Vol. 3, No.5, 2011.
  • Kaur G, Chhabra A,” Improved J48 classification algorithm for the prediction of diabetes,” International Journal of Computer Applications, Vol.98, No.22, pp 7-13, 2014
  • Aloraini A,” Different machine learning algorithms for breast cancer diagnosis,” International Journal of Artificial Intelligence and Applications, Vol.3, No.6, pp 21-30, 2012
  • Saabith ALS, Elankovan S, Abu Bakar A, “Comparative study on different classification techniques for breast cancer dataset”, Vol.3, No.10, 2010
  • Poomani N, Porkodi. R, “A comparison of Data Mining classification algorithms using breast cancer microarray dataset: A study”, International Journal for Scientific Research and Development, Vol.2, No.12, 2015

Abstract Views: 241

PDF Views: 4




  • SMO, IBK and Logistic Classifiers for Classification of Breast Cancer

Abstract Views: 241  |  PDF Views: 4

Authors

P. Hamsagayathri
Department of ECE, Bannari Amman Institute of Technology, India
P. Sampath
Department of ECE, Bannari Amman Institute of Technology, India

Abstract


Breast cancer has become common cancer among various ages of women in world. The symptoms of breast cancer include the lumps, change in shape, change in color of skin, liquid oozing out from nipple. The occurrence of breast cancer is increasing every year by year and increased incidence is not surprising since there has been increase in numbers of women with high breast cancer risk factors, including lower age of menarche, late age of first pregnancy, fewer pregnancies, shorter or no periods of breastfeeding, later menopause, prolactin hormone levels, use of birth control pills and radiation exposure in youth. In order to reduce the mortality of breast cancer effective early classification detection techniques are required for long term survival of the women. Classification is one of the most commonly used tool analyze and classify the data. This paper analyzes the classifier algorithms such as SMO, Logistic and MLP for seer breast cancer dataset using WEKA software. The performance of the classifiers are evaluated against the parameters like classification accuracy, RMSE, TP Rate, FP Rate, Precision, Recall, F-Measure, ROC, Specificity, Sensitivity and so on. MLP Classifier is superior to SMO and Logistic and it has the highest classification accuracy of 93.5%.


Keywords


Classification, SMO, IBK, Logistic Regression, Perceptron, Accuracy, RMSE, Confusion Matrix.

References