Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Comparative Analysis on Classification Data Mining Techniques Through WEKA


Affiliations
1 Department of Information Technology, Datta Meghe Institute of Engineering, Technology & Research, Wardha, India
2 Department of Information Technology, SGMCOE, Shegao, India
3 Cluebix Software Pvt. Ltd, Nagpur, India
     

   Subscribe/Renew Journal


Data mining refers to extraction or mining of information/knowledge from huge amounts of data. Data mining is also called as Knowledge Discovery from Database (KDD). There are number of data mining techniques such as classification and regression; the mining of frequent patterns, associations, and correlations; clustering analysis; and outlier analysis.
Classification is a major technique in data mining which is widely used in various fields.
Classification can be defined as the process of finding a model (or function) that describes and distinguishes data classes or concepts, for the purpose of being able to use the function/model to guess the class of objects whose class label is unknown. The derived model is based on the analysis of a set of training data (i.e., data objects whose class label is known). Classification models predict categorical class labels. Classification may also called as supervised Learning.
In this paper, we are going to discuss the various techniques of classification. Different kinds of classification techniques includes if-then rule, decision tree, Neural Network. Bayesian networks, k-nearest neighbor classifier, and support vector machine (SVM), and the aim of this study is to provide a comprehensive review of different classification techniques in data mining.

Keywords

Classification, Data Mining, Decision Tree, Supervised Learning, WEKA, Pattern Evaluation, Neural Network, KDD.
User
Subscription Login to verify subscription
Notifications
Font Size

  • Jiawei Han,”Data Mining: Concepts and Techniques”, Second Edition, Morgan Kaufmann, 2006.
  • Ritu Sharma et al “Comparative Analysis of Classification Techniques in Data Mining Using Different Datasets” International Journal of Computer Science and Mobile Computing, Vol.4 Issue.12, December-2015, pg. 125-134.
  • A Tutorial on Support Vector Machines for Pattern Recognition CHRISTOPHER J.C. BURGES.
  • S.Archana1, Dr. K. Elangovan, “Survey of Classification Techniques in Data Mining”, International Journal of Computer Science and Mobile Applications, Vol.2 Issue. 2, February-2014.
  • S.Neelamegam, Dr.E.Ramaraj, “Classification algorithm in Data mining: An Overview”, International Journal of P2P Network Trends and Technology, Volume 4 Issue 8-Sep 2013.
  • Megha Gupta, Naveen Aggarwal, “CLASSIFICATION TECHNIQUES ANALYSIS”, National Conference on Computational Instrumentation, March 2010.
  • Dr. Sudhir B. Jagtap, Dr. Kodge B. G, “Census Data Mining and Data Analysis using WEKA”, International Conference in “Emerging Trends in Science, Technology and Management, 2013.
  • Dr.A.Bharathi, E.Deepankumar,” Survey on Classification Techniques in Data Mining”, International Journal on Recent and Innovation Trends in Computing and Communication, Volume. 2 Issue. 7, July 2014.
  • A tutorial on support vector machines for pattern recognition,” Data Mining and Knowledge Discovery, vol. 2, no. 2, pp. 121-167, 1998 by C. Burges.
  • N. Abirami, T. Kamalakannan, Dr. A. Muthukumaravel, “A Study on Analysis of Various Data mining Classification Techniques on Healthcare Data”, International Journal of Emerging Technology and Advanced Engineering, Volume 3, Issue 7, July 2013.
  • SAGAR S. NIkAM,” A Comparative Study of Classification Techniques in Data Mining Algorithms, ORIENTAL JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, ISSN: 0974-6471, April 2015, Vol. 8, No. (1): Pgs. 13-19.
  • E. K. Onyari and F. M. Ilunga,” Application of MLP Neural Network and M5P Model Tree in Predicting Streamflow: A Case Study of Luvuvhu Catchment”, International Journal of Innovation, Management and Technology, Vol. 4, No. 1, February 2013.
  • M. Thangaraj, C. R. Vijayalakshmi, “Performance Study on Rule-based Classification Techniques across Multiple Database Relations, International Journal of Applied Information Systems (IJAIS) –ISSN: 2249-0868,Foundation of Computer Science FCS, New York, USA Volume 5–No.4, March2013–www.ijais.org
  • Overview of Decision Trees by H.Hamilton. E. Gurak, L. Findlater W. Olive.
  • Decision tree Lior Rokach Department of Industrial Engineering Tel-Aviv University, Oded Maimon Department of Industrial Engineering Tel-Aviv University maimon@eng.tau.ac.il
  • http://www.cs.ubbcluj.ro/~gabis/DocDiplome/DT/DecisionTrees.pdf
  • S. P. Shinde, V.P. Deshmukh, “Implementation of Pattern Recognition Techniques and Overview of Its Applications in Various Areas of Artificial Intelligence, International Journal of Advances in Engineering & Technology, Sept 2011. ©IJAET ISSN: 2231-1963.
  • http://www.brighthubpm.com/project-planning/106000-advantages-of-decision-tree-analysis/
  • Dr. Neeraj Bhargava, Girja Sharma, Dr. Ritu Bhargava Manish Mathuria, “Decision Tree Analysis on J48 Algorithm for Data Mining, International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 6,June 2013.
  • G.P. Zhang, “Neural networks for classification: a survey, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) (Volume: 30, Issue: 4, Nov 2000).
  • Hassan Ramchoun, Mohammed Amine Janati Idrissi, Youssef Ghanou, Mohamed Ettaouil, “Multilayer Perceptron: Architecture Optimization and Training, International Journal of Interactive Multimedia and Artificial Intelligence, Vol. 4, N°1.
  • Dayana C. Tejera Hernandez, “An Experimental Study of K* Algorithm, I.J. Information Engineering and Electronic Business, 2015, 2, 14-19 Published Online March 2015 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijieeb.2015.02.03.
  • Overview of Bayesian network approaches to model gene-environment interactions and cancer susceptibility Chengwei Su, Angeline Andrew, Margaret Karagas, Mark E. Borsuk.
  • Sayali D. Jadhav, H.P. Channe, “Comparative Study of K-NN, Naive Bayes and Decision Tree Classification Techniques, International Journal of Science and Research (IJSR) ISSN (Online): 2319-7064 Index Copernicus Value (2013): 6.14 | Impact Factor (2014): 5.611.
  • Data Mining: practical machine learning tools and techniques with java implementation” Morgan Kaufmann Publishers, San Francisco, USA, 2000. http://www.cs.waikato.ac.nz/ml/weka/. By I. H. Witten and E.farnk.

Abstract Views: 272

PDF Views: 4




  • Comparative Analysis on Classification Data Mining Techniques Through WEKA

Abstract Views: 272  |  PDF Views: 4

Authors

Lukeshkumar Barapatre
Department of Information Technology, Datta Meghe Institute of Engineering, Technology & Research, Wardha, India
Anand Sharma
Department of Information Technology, SGMCOE, Shegao, India
Hemant Barapatre
Cluebix Software Pvt. Ltd, Nagpur, India

Abstract


Data mining refers to extraction or mining of information/knowledge from huge amounts of data. Data mining is also called as Knowledge Discovery from Database (KDD). There are number of data mining techniques such as classification and regression; the mining of frequent patterns, associations, and correlations; clustering analysis; and outlier analysis.
Classification is a major technique in data mining which is widely used in various fields.
Classification can be defined as the process of finding a model (or function) that describes and distinguishes data classes or concepts, for the purpose of being able to use the function/model to guess the class of objects whose class label is unknown. The derived model is based on the analysis of a set of training data (i.e., data objects whose class label is known). Classification models predict categorical class labels. Classification may also called as supervised Learning.
In this paper, we are going to discuss the various techniques of classification. Different kinds of classification techniques includes if-then rule, decision tree, Neural Network. Bayesian networks, k-nearest neighbor classifier, and support vector machine (SVM), and the aim of this study is to provide a comprehensive review of different classification techniques in data mining.

Keywords


Classification, Data Mining, Decision Tree, Supervised Learning, WEKA, Pattern Evaluation, Neural Network, KDD.

References