Enhancing the Performance of Hybrid Clustering of Documents Using Artificial Neural Network Based Approach
Subscribe/Renew Journal
Clustering and classification have been useful and active areas of machine learning research that promise to help us cope with the problem of information overload on the Internet. BIRCH is a clustering algorithm designed to operate under the assumption "the amount of memory available is limited, whereas the dataset can be arbitrary large". The algorithm generates "a compact dataset summary" minimizing the I/O cost involved .An application of k-means requires an initial partition to be supplied as an input. To generate a "good" initial partition of the "summaries" a clustering algorithm, PDDP can be used. Also we compare the performance of traditional K-Means algorithm with a new artificial neural network based clustering method. Experimental results show that the new method works more accurately than K-Means.
Keywords
Abstract Views: 212
PDF Views: 1