Open Access Open Access  Restricted Access Subscription Access

An Improved Prediction of Kidney Disease using SMOTE


Affiliations
1 K L University, Guntur - 522502, Andhra Pradesh, India
2 Department of Computer Science and Engineering, K L University, Guntur - 522502, Andhra Pradesh, India
 

Objectives: This article presents a framework to improve the accuracy of rule induction and decision tree models.Analysis: In this paper, we used a rebalancing algorithm called SMOTE to enhance the accuracy of different induction and decision tree models in order to predict kidney disease of patients. For this prediction, data collected from Apollo Hospitals, Tamil Nadu, India has been analysed. Findings:In this research, initial dataset is not balanced i.e. most of the instances belong to the same class. If dataset is imbalanced, the traditional models can’t produce accurate results. Thus the proposed framework improves the accuracy of models by balancing the imbalanced dataset. For this, a technique for sampling the minority class called SMOTE is applied on existing dataset and percentage of variation between classes is minimized. The examined findings with various classifiers algorithms and with the use of over sampling algorithm,the produced findings proves an increasing accuracy and also those results are compared with balanced and imbalanced dataset. In particular, this method can attain the average accuracy of 98.73%. Applications:This method can be applied in other areas to improve the accuracy in case of imbalanced dataset. In case of Big Data also SMOTE can be applied using Hadoop framework and Mapreduce programming model with new algorithmic approach.

 


Keywords

Classification, Data Mining, Health Informatics, Kidney Failure, SMOTE.
User

Abstract Views: 216

PDF Views: 0




  • An Improved Prediction of Kidney Disease using SMOTE

Abstract Views: 216  |  PDF Views: 0

Authors

Sai Prasad Potharaju
K L University, Guntur - 522502, Andhra Pradesh, India
M. Sreedevi
Department of Computer Science and Engineering, K L University, Guntur - 522502, Andhra Pradesh, India

Abstract


Objectives: This article presents a framework to improve the accuracy of rule induction and decision tree models.Analysis: In this paper, we used a rebalancing algorithm called SMOTE to enhance the accuracy of different induction and decision tree models in order to predict kidney disease of patients. For this prediction, data collected from Apollo Hospitals, Tamil Nadu, India has been analysed. Findings:In this research, initial dataset is not balanced i.e. most of the instances belong to the same class. If dataset is imbalanced, the traditional models can’t produce accurate results. Thus the proposed framework improves the accuracy of models by balancing the imbalanced dataset. For this, a technique for sampling the minority class called SMOTE is applied on existing dataset and percentage of variation between classes is minimized. The examined findings with various classifiers algorithms and with the use of over sampling algorithm,the produced findings proves an increasing accuracy and also those results are compared with balanced and imbalanced dataset. In particular, this method can attain the average accuracy of 98.73%. Applications:This method can be applied in other areas to improve the accuracy in case of imbalanced dataset. In case of Big Data also SMOTE can be applied using Hadoop framework and Mapreduce programming model with new algorithmic approach.

 


Keywords


Classification, Data Mining, Health Informatics, Kidney Failure, SMOTE.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i31%2F130639