Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Cancer Gene Expression Analysis Using Class Chunk


Affiliations
1 Park's College, Chinnakkarai, India
2 Dept. of Computer Science, Park's College, Chinnakkarai, India
     

   Subscribe/Renew Journal


Classification consists of assigning a class label to a set of unclassified cases. Supervised and unsupervised classification methods are used to assign class labels. Classification is performed in two steps learning or training and testing. Learning process is used to identify the class patterns from the labeled transactions. In training phase unlabeled transactions are assigned with the class values with reference to the learned class patterns. Bayesian classification and decision tree classification methods are used for the category assignment process. An outlier is a comment that varies so much from other comments as to produce suspicions. Distance based outlier detection methods are used to find records that are differ from the rest of the data set.
Critical nuggets are collections of records that have domain-specific with essential information. Nuggets are referred as class chunks. Nuggets are used to perform label or category assignment to transactions. Domains independent method is used to measure criticality and reduce the find space for identify critical nuggets. Criticality measure is the records that detached together from the data set or from values of attributes. Criticality Score (CR-score) indicates the outcome of removing a nearby data's on a classification model. Here we are using three kinds of algorithms. One is Get Nugget Score algorithm is used to calculate the CR-score value. Second is Find boundary algorithm is used to identify the class boundary values. Third is Find critical nuggets algorithm. We can split Find critical nuggets algorithm into two phases to detect critical nuggets for two classes. The centroid neighborhood relationship is used to find the nuggets for the significant classes.
To support this critical nugget for multiple classes we use identification and classification scheme under cancer gene expression environment. The scheme can be accepted to handle mixed attribute data values. To reduce the detection complexity we use the boundary approximation algorithm. With the help of Post processing operations we can able to identify class in multiple data environment.

Keywords

Classification, Classification Accuracy, Class Boundary, Critical Nuggets, Outliers.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 270

PDF Views: 2




  • Cancer Gene Expression Analysis Using Class Chunk

Abstract Views: 270  |  PDF Views: 2

Authors

E. Kalaiarasi
Park's College, Chinnakkarai, India
A. Boopathybabu
Dept. of Computer Science, Park's College, Chinnakkarai, India

Abstract


Classification consists of assigning a class label to a set of unclassified cases. Supervised and unsupervised classification methods are used to assign class labels. Classification is performed in two steps learning or training and testing. Learning process is used to identify the class patterns from the labeled transactions. In training phase unlabeled transactions are assigned with the class values with reference to the learned class patterns. Bayesian classification and decision tree classification methods are used for the category assignment process. An outlier is a comment that varies so much from other comments as to produce suspicions. Distance based outlier detection methods are used to find records that are differ from the rest of the data set.
Critical nuggets are collections of records that have domain-specific with essential information. Nuggets are referred as class chunks. Nuggets are used to perform label or category assignment to transactions. Domains independent method is used to measure criticality and reduce the find space for identify critical nuggets. Criticality measure is the records that detached together from the data set or from values of attributes. Criticality Score (CR-score) indicates the outcome of removing a nearby data's on a classification model. Here we are using three kinds of algorithms. One is Get Nugget Score algorithm is used to calculate the CR-score value. Second is Find boundary algorithm is used to identify the class boundary values. Third is Find critical nuggets algorithm. We can split Find critical nuggets algorithm into two phases to detect critical nuggets for two classes. The centroid neighborhood relationship is used to find the nuggets for the significant classes.
To support this critical nugget for multiple classes we use identification and classification scheme under cancer gene expression environment. The scheme can be accepted to handle mixed attribute data values. To reduce the detection complexity we use the boundary approximation algorithm. With the help of Post processing operations we can able to identify class in multiple data environment.

Keywords


Classification, Classification Accuracy, Class Boundary, Critical Nuggets, Outliers.