Open Access Open Access  Restricted Access Subscription Access

Discovery of Outlier from Database using Different Clustering Algorithms


Affiliations
1 Computer Science, B.R.C.M.
2 Department of Computer Science, B.R.C.M
 

Data mining is the process of extracting valuable and important data from a large or massive set of data. There are several techniques existing for data extraction. Clustering is one of the techniques amongst them. In clustering technique, we form the group of similar objects (similarity in terms of distance or there may be any other factor). Outlier detection is one of the major issues in data mining. Outlier represents that data which possess different behavior from others. Therefore, it is important to detect outlier from the extracted data. There are so many techniques existing to detect outlier but Clustering is one of the efficient techniques. In this paper, I have compared the result of different techniques with the result of Clustering techniques in terms of time complexity and proposed a new solution by adding fuzziness to already existing Clustering techniques.

Keywords

Data Mining, Clustering, Outlier Detection
User
Notifications

  • Ajay Challagalla, Shivaji Dhiraj SS, Somayajulu, DVLN, Toms Shaji Mathew, Saurav Tiwari, Syed Sharique Ahmad (2010) Privacy Preserving Outlier Detection Using Hierarchical Clustering Methods, 34th Annual IEEE Computer Software and Applications Conference Workshops.
  • Al-Zoubi M. (2009) An Effective Clustering-Based Approach for Outlier Detection, European Journal of Scientific Research.
  • Jiang S & An, Q (2008) Clustering Based Outlier Detection Method, Fifth International Conference on Fuzzy Systems and Knowledge Discovery.
  • John Peter S (2011) Department of computer science and research center St. Xavier‟s College, Palayamkottai, An Efficient Algorithm for Local Outlier Detection Using Minimum Spanning Tree, International Journal of Research and Reviews in Computer Science (IJRRCS), March 2011.
  • Knorr E & Ng R (1997) A unified approach for mining outliers. In Proc. KDD, pp. 219–222.Knorr E, & Ng R (1998) Algorithms for mining distance-based outliers in large datasets. In Proc. 24th Int. Conf. Very Large Data Bases, VLDB, pages 392–403, 24–27.
  • Loureiro A, Torgo L, & Soares C. (2004) Outlier Detection using Clustering Methods: A Data Cleaning Application, in Proceedings of KDNet Symposium on Knowledge-Based Systems.
  • Vijayarani S and Nithya S, An Efficient Clustering Algorithm for Outlier Detection”, International Journal of Computer Applications (0975 – 8887)

Abstract Views: 318

PDF Views: 0




  • Discovery of Outlier from Database using Different Clustering Algorithms

Abstract Views: 318  |  PDF Views: 0

Authors

Deepak Soni
Computer Science, B.R.C.M.
Naveen Jha
Department of Computer Science, B.R.C.M
Deepak Sinwar
Department of Computer Science, B.R.C.M

Abstract


Data mining is the process of extracting valuable and important data from a large or massive set of data. There are several techniques existing for data extraction. Clustering is one of the techniques amongst them. In clustering technique, we form the group of similar objects (similarity in terms of distance or there may be any other factor). Outlier detection is one of the major issues in data mining. Outlier represents that data which possess different behavior from others. Therefore, it is important to detect outlier from the extracted data. There are so many techniques existing to detect outlier but Clustering is one of the efficient techniques. In this paper, I have compared the result of different techniques with the result of Clustering techniques in terms of time complexity and proposed a new solution by adding fuzziness to already existing Clustering techniques.

Keywords


Data Mining, Clustering, Outlier Detection

References