Open Access
Subscription Access
Open Access
Subscription Access
Cancer Gene Expression Analysis Using Class Chunk
Subscribe/Renew Journal
Classification consists of assigning a class label to a set of unclassified cases. Supervised and unsupervised classification methods are used to assign class labels. Classification is performed in two steps learning or training and testing. Learning process is used to identify the class patterns from the labeled transactions. In training phase unlabeled transactions are assigned with the class values with reference to the learned class patterns. Bayesian classification and decision tree classification methods are used for the category assignment process. An outlier is a comment that varies so much from other comments as to produce suspicions. Distance based outlier detection methods are used to find records that are differ from the rest of the data set.
Critical nuggets are collections of records that have domain-specific with essential information. Nuggets are referred as class chunks. Nuggets are used to perform label or category assignment to transactions. Domains independent method is used to measure criticality and reduce the find space for identify critical nuggets. Criticality measure is the records that detached together from the data set or from values of attributes. Criticality Score (CR-score) indicates the outcome of removing a nearby data's on a classification model. Here we are using three kinds of algorithms. One is Get Nugget Score algorithm is used to calculate the CR-score value. Second is Find boundary algorithm is used to identify the class boundary values. Third is Find critical nuggets algorithm. We can split Find critical nuggets algorithm into two phases to detect critical nuggets for two classes. The centroid neighborhood relationship is used to find the nuggets for the significant classes.
To support this critical nugget for multiple classes we use identification and classification scheme under cancer gene expression environment. The scheme can be accepted to handle mixed attribute data values. To reduce the detection complexity we use the boundary approximation algorithm. With the help of Post processing operations we can able to identify class in multiple data environment.
Critical nuggets are collections of records that have domain-specific with essential information. Nuggets are referred as class chunks. Nuggets are used to perform label or category assignment to transactions. Domains independent method is used to measure criticality and reduce the find space for identify critical nuggets. Criticality measure is the records that detached together from the data set or from values of attributes. Criticality Score (CR-score) indicates the outcome of removing a nearby data's on a classification model. Here we are using three kinds of algorithms. One is Get Nugget Score algorithm is used to calculate the CR-score value. Second is Find boundary algorithm is used to identify the class boundary values. Third is Find critical nuggets algorithm. We can split Find critical nuggets algorithm into two phases to detect critical nuggets for two classes. The centroid neighborhood relationship is used to find the nuggets for the significant classes.
To support this critical nugget for multiple classes we use identification and classification scheme under cancer gene expression environment. The scheme can be accepted to handle mixed attribute data values. To reduce the detection complexity we use the boundary approximation algorithm. With the help of Post processing operations we can able to identify class in multiple data environment.
Keywords
Classification, Classification Accuracy, Class Boundary, Critical Nuggets, Outliers.
User
Subscription
Login to verify subscription
Font Size
Information
Abstract Views: 272
PDF Views: 2