Open Access
Subscription Access
Data Preprocessing: the Techniques for Preparing Clean and Quality Data for Data Analytics Process
The model and pattern for real time data mining have an important role for decision making. The meaningful real time data mining is basically depends on the quality of data while row or rough data available at warehouse. The data available at warehouse can be in any format, it may huge or it may unstructured. These kinds of data require some process to enhance the efficiency of data analysis. The process to make it ready to use is called data preprocessing. There can be many activities for data preprocessing such as data transformation, data cleaning, data integration, data optimization and data conversion which are use to converting the rough data to quality data. The data preprocessing techniques are the vital step for the data mining. The analyzed result will be good as far as data quality is good. This paper is about the different data preprocessing techniques which can be use for preparing the quality data for the data analysis for the available rough data.
Keywords
Data Preprocessing; Data Cleaning; Data Transformation; Data integration; Data Optimization; Data Conversion.
User
Font Size
Information
- Suad A. Alasadi and Wesam S. Bhaya, “Review of Data Preprocessing techniques in data mining” – Journal of Engineering and Apllied Sciences, 1816-949X
- Dharmarajan R and R.Vijayashanthi, “An overview on data preprocessing methods in data mining” - International journal of Science and Research, 3544-3546
- Tomar D. and S. Agarwal, “A survey on preprocessing and post processing techniques in data mining” – International Journal of database theory application, 99128.
- S. B. Kotsiantis, D. Kanellopoulos and P. E. Pintelas, “Data Preprocessing for Supervised Leaning” - International Journal Of Computer Science Issn 1306-4428
- Vivek Agarwal, “Research on Data Preprocessing and Categorization Technique for Smartphone Review Analysis” International Journal of Computer Applications (0975 – 8887)
- Rima Houari, Ahcène Bounceur , Tahar Kechadi, “A New Method for Estimation of Missing Data Based on Sampling Methods for Data Mining” - https://www.researchgate.net/publication/259007815_A_New_Method_for_Estimation_of_Missing_Data_Based_on_Sampling_Methods_for_Data_Mining - (23/04/2020)
- DeSarbo, W.S, Green, P.E, Carroll, J.D, Missing data in product-concept testing. Decision Sciences 17,163-185,1986
- J.W, Some simple procedures for handling missing data in multivariate analysis. Psychometrika 41, 409-415,1976.
Abstract Views: 214
PDF Views: 2