Open Access
Subscription Access
Open Access
Subscription Access
Feature Selection for Text Clustering and Classification
Subscribe/Renew Journal
The quality of the data is one of the most important factors influencing the performance of any classification or clustering algorithm. The attributes defining the feature space of a given data set can often be inadequate, which make it difficult to discover useful information or desired output. However, even when the original attributes are individually inadequate, it is often possible to combine such attributes in order to construct new ones with greater predictive power. Feature selection, as a preprocessing step to machine learning, has been very effective in reducing dimensionality, removing irrelevant data, and noise from data to improving result comprehensibility. This paper addresses the task of feature selection for clustering and classification. Here we give a comparative study of variety of classification methods, including Naive Bayes, J48 etc.
Keywords
Classification, Clustering, Feature Selection, Machine Learning.
User
Subscription
Login to verify subscription
Font Size
Information
Abstract Views: 234
PDF Views: 2