The PDF file you selected should load here if your Web browser has a PDF reader plug-in installed (for example, a recent version of Adobe Acrobat Reader).

If you would like more information about how to print, save, and work with PDFs, Highwire Press provides a helpful Frequently Asked Questions about PDFs.

Alternatively, you can download the PDF file directly to your computer, from where it can be opened using a PDF reader. To download the PDF, click the Download link above.

Fullscreen Fullscreen Off


The researchers of the data mining domain presume that the study of traditional clustering techniques is saturating day by day. But, a deep insight into those techniques unfolds many silhouettes which could lead to many more applications in diverged domains. In clustering, the attributes of the data provide the information needed for data segregation. There may exist some real world data with less number of attributes but more information contained in them and may be of interest for some applications. Because of less number of attributes, the data may not be well separated by any of the clustering techniques. Data expansion techniques are methods for constructing more number of attributes from less number of attributes. With the application of these techniques, an expanded data set may be reconstructed from a given data set during data preprocessing. The current work pronounces the fact that, the expanded data at times yield better clustering results than the real data. This paper is an attempt to empirically evaluate and analyze the effects of data expansion on clustering results where validity of the results are established through internal indexing techniques and probabilistic validation measures.

Keywords

Cluster Analysis, Cluster Validity, Data Expansion, Internal Indexing, Probabilistic Measures
User