Open Access Open Access  Restricted Access Subscription Access

A Comparative Analysis of Classification Algorithms on Weather Dataset Using Data Mining Tool


Affiliations
1 Department of CSE, S R Engineering College, Warangal 506371, Telangana, India
 

Data mining has become one of the emerging fields in research because of its vast contents. Data mining is used for finding hidden patterns in the database or any other information repository. This information is necessary to generate knowledge from the patterns. The main task is to extract knowledge out of the information. In this paper we use a data mining technique called classification to determine the playing condition based on the current temperature values. Classification technique is a powerful way to classify the attributes of the dataset into different classes. In our approach we use classification algorithms like Decision Tree (J48), REP Tree and Random Tree. Then we compare the efficiencies of these classification algorithms. The tool we use for this approach is WEKA (Waikato Environment for Knowledge Analysis) a collection of open source machine learning algorithms.

Keywords

Data Mining, Data Set, Classification.
User
Notifications
Font Size

  • Germano C. Vasconcelos, Paulo J. L. Adeodato and Domingos S. M. P. Monteiro. 1999. A Neural Network Based Solution for the Credit Risk Assessment Problem. Proceedings of the IV Brazilian Conference on Neural Networks - IV Congresso Brasileiro de Redes Neurais, (July 1999), 269-274.
  • Tian-Shyug Lee, Chih-Chou Chiu, Chi-Jie Lu and I-Fei Chen. 2002. Credit scoring using the hybrid neural discriminant technique. Expert Systems with Applications (Elsevier) 23, 245–254.
  • Dr. Sudhir B. Jagtap, Dr. Kodge B. G, “Census Data Mining and Data Analysis using WEKA”, International Conference in “Emerging Trends in Science, Technology and Management,2013
  • S.Archana1, Dr. K.Elangovan, “Survey of Classification Techniques in Data Mining”, International Journal of Computer Science and Mobile Applications, Vol.2 Issue. 2, February- 2014
  • Zan Huang, Hsinchun Chena, Chia-Jung Hsu, Wun-Hwa Chen and Soushan Wu. 2004. Credit rating analysis with support vector machines and neural networks: a market comparative study,” Decision Support Systems (Elsevier) 37, 543– 558.
  • Kin Keung Lai, Lean Yu, Shouyang Wang, and Ligang Zhou. 2006. Credit Risk Analysis Using a ReliabilityBased Neural Network Ensemble Model. S. Kollias et al. (Eds.): ICANN 2006, Part II, Springer LNCS 4132, 682 – 690.
  • Eliana Angelini, Giacomo di Tollo, and Andrea Roli. 2006. A Neural Network Approach for Credit Risk Evaluation,” Kluwer Academic Publishers, 1 – 22.
  • S. Kotsiantis. 2007. Credit risk analysis using a hybrid data mining model. Int. J. Intelligent Systems Technologies and Applications, Vol. 2, No. 4, 345 – 356.
  • Hamadi Matoussi and Aida Krichene. 2007. Credit risk assessment using Multilayer Neural Network Models - Case of a Tunisian bank.
  • Lean Yu, Shouyang Wang, and Kin Keung Lai. 2008. Credit risk assessment with a multistage neural network ensemble learning approach. Expert Systems with Applications (Elsevier) 34, pp.1434–1444.
  • Sanaz Pourdarab, Ahmad Nadali and Hamid Eslami Nosratabadi. 2011. A Hybrid Method for Credit Risk Assessment of Bank Customers. International Journal of Trade, Economics and Finance, Vol. 2, No. 2, (April 2011)
  • UCI Machine Learning Data Repository – http://archive.ics.uci.edu/ml/datasets.
  • Tina R. Patil, and S. S. Sherekar. 2013. Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification. International Journal Of Computer Science And Applications Vol. 6, No.2, (Apr 2013), 256 - 261.
  • Witten IH, and Frank E. 2005. Data mining: practical machine learning tools and techniques – 2nd ed. the United States of America, Morgan Kaufmann series in data management systems.
  • Quinlan J (1987) Simplifying decision trees, International Journal of Man Machine Studies, 27(3), 221–234.
  • S.K. Jayanthi and S.Sasikala. 2013. REPTree Classifier for indentifying Link Spam in Web Search Engines. IJSC, Volume 3, Issue 2, (Jan 2013), 498 – 505.
  • Leo Breiman. 2001. Random Forests. Machine Learning. 45(1): 5-32.
  • Margaret H. Danham, and S. Sridhar. 2006. Data mining, Introductory and Advanced Topics. Person education, 1st Edition
  • Lakshmi Devasena, C. 2014. Efficiency Comparison of Multilayer Perceptron and SMO Classifier for Credit Risk Prediction. International Journal of Advanced Research in Computer and Communication Engineering, Vol. 3, Issue 4, 6156 – 6162
  • Bhavani M, Vinod Kumar S “A data mining approach for precise diagnosis of dengue fever”, International journal of latest trends in engineering and technology,vol. 7, issue 4. 2016

Abstract Views: 403

PDF Views: 0




  • A Comparative Analysis of Classification Algorithms on Weather Dataset Using Data Mining Tool

Abstract Views: 403  |  PDF Views: 0

Authors

D. Ramesh
Department of CSE, S R Engineering College, Warangal 506371, Telangana, India
Syed Nawaz Pasha
Department of CSE, S R Engineering College, Warangal 506371, Telangana, India
G. Roopa
Department of CSE, S R Engineering College, Warangal 506371, Telangana, India

Abstract


Data mining has become one of the emerging fields in research because of its vast contents. Data mining is used for finding hidden patterns in the database or any other information repository. This information is necessary to generate knowledge from the patterns. The main task is to extract knowledge out of the information. In this paper we use a data mining technique called classification to determine the playing condition based on the current temperature values. Classification technique is a powerful way to classify the attributes of the dataset into different classes. In our approach we use classification algorithms like Decision Tree (J48), REP Tree and Random Tree. Then we compare the efficiencies of these classification algorithms. The tool we use for this approach is WEKA (Waikato Environment for Knowledge Analysis) a collection of open source machine learning algorithms.

Keywords


Data Mining, Data Set, Classification.

References





DOI: https://doi.org/10.13005/ojcst%2F10.04.13