Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

An Application of Naive Bayes Classifier to Explore Big Data Using XLSTAT.


Affiliations
1 Department of Statistics, New Arts Commerce and Science College, Ahmednagar (M.S.), India
2 Institute (M.P.K.V.), Rahuri, Ahmednagar (M.S.), India
     

   Subscribe/Renew Journal


The present ICT era has changed the scenario of multivariate data or information usage altogether. Organizations treat data as an asset and they try to employ various methodology to come up with organization progress oriented conclusions. A wide range of database tools to manage the huge data and equally number of software’s are also developed to visualize, present and analyses the big data. XLSTAT with diversified data analyzing utilities, is one such tool that can be appended to usual Excel software. The present paper gives an application of Naïve Bayes Classifier applied to a data on Global Super Store Orders-2016 (Source: secondary data obtain from data.world platform). This application will give the insight of understanding the concept of Naive Bayes Classifier. It will also show the effect of continuous data monitoring and maintenance on derived results of Naïve Bayes Classification. The summary of derived output will facilitate the comparison and will also give an idea about the overall trend of the factors under study. The step based analysis of big and diverse data shows that global accuracy of Naïve Bayes Classifier increases with increase in data size.

Keywords

Big Data, Confusion Matrix, Global Accuracy of the Model, Posterior Probability, Regression.
Subscription Login to verify subscription
User
Notifications
Font Size


  • Mai, Shouman, Tim, Turner and Rob, Stocker (2012). Appling KNearest Neighbor in diagnosing heart disease patient. Internat. J. Inform. & Edu. Technol., 2 (3): 220-223.
  • Rajakumar, R., Vishwanath, P. and Bindu, C. Shobha (2017). Nearest Neighbor Classifier: A review. Internat. J. Computational Inteligence Res., 13(2): 303-311.
  • Sadegh, Bafandeh, Imandousd and Mohammad, Bolandraftar (2013). Application of KNN approach for predicting economic events: Theoretical background. Internat. J. Engg. Res. & Applic., 3 (5): 605-610.

Abstract Views: 399

PDF Views: 0




  • An Application of Naive Bayes Classifier to Explore Big Data Using XLSTAT.

Abstract Views: 399  |  PDF Views: 0

Authors

M. S. Kasture
Department of Statistics, New Arts Commerce and Science College, Ahmednagar (M.S.), India
A. J. Shivagaje
Institute (M.P.K.V.), Rahuri, Ahmednagar (M.S.), India
C. G. Shelake
Department of Statistics, New Arts Commerce and Science College, Ahmednagar (M.S.), India
A. J. Nalavade
Institute (M.P.K.V.), Rahuri, Ahmednagar (M.S.), India

Abstract


The present ICT era has changed the scenario of multivariate data or information usage altogether. Organizations treat data as an asset and they try to employ various methodology to come up with organization progress oriented conclusions. A wide range of database tools to manage the huge data and equally number of software’s are also developed to visualize, present and analyses the big data. XLSTAT with diversified data analyzing utilities, is one such tool that can be appended to usual Excel software. The present paper gives an application of Naïve Bayes Classifier applied to a data on Global Super Store Orders-2016 (Source: secondary data obtain from data.world platform). This application will give the insight of understanding the concept of Naive Bayes Classifier. It will also show the effect of continuous data monitoring and maintenance on derived results of Naïve Bayes Classification. The summary of derived output will facilitate the comparison and will also give an idea about the overall trend of the factors under study. The step based analysis of big and diverse data shows that global accuracy of Naïve Bayes Classifier increases with increase in data size.

Keywords


Big Data, Confusion Matrix, Global Accuracy of the Model, Posterior Probability, Regression.

References