Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Heart Disease Prediction using Data Mining Techniques


Affiliations
1 Department of Computer Science and Engineering, Manonmaniam Sundaranar University, India
     

   Subscribe/Renew Journal


Data mining is a technique that is performed on large databases for extracting hidden patterns by using combinational strategy from statistical analysis, machine learning and database technology. Further, the medical data mining is an extremely important research field due to its importance in the development of various applications in flourishing healthcare domain. While summarizing the deaths occurring worldwide, the heart disease appears to be the leading cause. The identification of the possibility of heart disease in a person is complicated task for medical practitioners because it requires years of experience and intense medical tests to be conducted. In this work, three data mining classification algorithms like Random Forest, Decision Tree and Naïve Bayes are addressed and used to develop a prediction system in order to analyse and predict the possibility of heart disease. The main objective of this significant research work is to identify the best classification algorithm suitable for providing maximum accuracy when classification of normal and abnormal person is carried out. Thus prevention of the loss of lives at an earlier stage is possible. The experimental setup has been made for the evaluation of the performance of algorithms with the help of heart disease benchmark dataset retrieved from UCI machine learning repository. It is found that Random Forest algorithm performs best with 81% precision when compared to other algorithms for heart disease prediction.

Keywords

Data Mining, Classification, Prediction, Heart Disease.
Subscription Login to verify subscription
User
Notifications
Font Size

  • Carlos Ordonez, “Improving Heart Disease Prediction using Constrained Association Rules”, Technical Seminar Presentation, University of Tokyo, 2004.
  • Franck Le Duff, CristianMunteanb, Marc Cuggiaa and Philippe Mabob, “Predicting Survival Causes After Out of Hospital Cardiac Arrest using Data Mining Method”, Studies in Health Technology and Informatics, Vol. 107, No. 2, pp. 1256-1259, 2004.
  • W.J. Frawley and G. Piatetsky-Shapiro, “Knowledge Discovery in Databases: An Overview”, AI Magazine, Vol. 13, No. 3, pp. 57-70, 1996.
  • Heon Gyu Lee, Ki Yong Noh and Keun Ho Ryu, “Mining Bio Signal Data: Coronary Artery Disease Diagnosis using Linear and Nonlinear Features of HRV”, Proceedings of International Conference on Emerging Technologies in Knowledge Discovery and Data Mining, pp. 56-66, 2007.
  • Kiyong Noh, HeonGyu Lee, Ho-Sun Shon, Bum Ju Lee and Keun Ho Ryu, “Associative Classification Approach for Diagnosing Cardiovascular Disease”, Intelligent Computing in Signal Processing and Pattern Recognition, Vol. 345, pp. 721-727, 2006.
  • Latha Parthiban and R. Subramanian, “Intelligent Heart Disease Prediction System using CANFIS and Genetic Algorithm”, International Journal of Biological, Biomedical and Medical Sciences, Vol. 3, No. 3, pp. 1-8, 2008.
  • Niti Guru, Anil Dahiya and Navin Rajpal, “Decision Support System for Heart Disease Diagnosis using Neural Network”, Delhi Business Review, Vol. 8, No. 1, pp. 1-6, 2007.
  • Sellappan Palaniappan and Rafiah Awang, “Intelligent Heart Disease Prediction System using Data Mining Techniques”, International Journal of Computer Science and Network Security, Vol. 8, No. 8, pp. 1-6, 2008.
  • Shantakumar B. Patil and Y.S. Kumaraswamy, “Intelligent and Effective Heart Attack Prediction System using Data Mining and Artificial Neural Network”, European Journal of Scientific Research, Vol. 31, No. 4, pp. 642-656, 2009.
  • X. Yanwei et al., “Combination Data Mining Models with New Medical Data to Predict Outcome of Coronary Heart Disease”, Proceedings of International Conference on Convergence Information Technology, pp. 868-872, 2007.
  • Ersen Yilmaz and Caglar Kilikcier, “Determination of Patient State from Cardiotocogram using LS-SVM with Particle Swarm Optimization and Binary Decision Tree”, Master Thesis, Department of Electrical Electronic Engineering, Uludag University, 2013.
  • Nidhi Singh and Divakar Singh, “Performance Evaluation of K-Means and Hierarchal Clustering in Terms of Accuracy and Running Time”, Ph.D Dissertation, Department of Computer Science and Engineering, Barkatullah University Institute of Technology, 2012.
  • A. Liaw and Matthew Wiener, “Classification and Regression by Random Forest”, R News, Vol. 2, No. 3, pp. 18-22, 2002.
  • J. Ross Quinlan, “Induction of Decision Trees”, Machine Learning, Vol. 1, No. 1, pp. 81-106, 1986.
  • K. Ming Leung, “Naive Bayesian Classifier”, Master Thesis, Department of Computer Science and Engineering, Polytechnic University, 2007.

Abstract Views: 185

PDF Views: 1




  • Heart Disease Prediction using Data Mining Techniques

Abstract Views: 185  |  PDF Views: 1

Authors

H. Benjamin Fredrick David
Department of Computer Science and Engineering, Manonmaniam Sundaranar University, India
S. Antony Belcy
Department of Computer Science and Engineering, Manonmaniam Sundaranar University, India

Abstract


Data mining is a technique that is performed on large databases for extracting hidden patterns by using combinational strategy from statistical analysis, machine learning and database technology. Further, the medical data mining is an extremely important research field due to its importance in the development of various applications in flourishing healthcare domain. While summarizing the deaths occurring worldwide, the heart disease appears to be the leading cause. The identification of the possibility of heart disease in a person is complicated task for medical practitioners because it requires years of experience and intense medical tests to be conducted. In this work, three data mining classification algorithms like Random Forest, Decision Tree and Naïve Bayes are addressed and used to develop a prediction system in order to analyse and predict the possibility of heart disease. The main objective of this significant research work is to identify the best classification algorithm suitable for providing maximum accuracy when classification of normal and abnormal person is carried out. Thus prevention of the loss of lives at an earlier stage is possible. The experimental setup has been made for the evaluation of the performance of algorithms with the help of heart disease benchmark dataset retrieved from UCI machine learning repository. It is found that Random Forest algorithm performs best with 81% precision when compared to other algorithms for heart disease prediction.

Keywords


Data Mining, Classification, Prediction, Heart Disease.

References