Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

An Improved Clustering Technique Based on Statistical Model Preprocessing Using Gene Expression Data


Affiliations
1 Sri Ramakrishna College of Arts and Science for Women, Coimbatore-44, India
2 Department of Computer Science, Cherran College for Women, India
     

   Subscribe/Renew Journal


Micro arrays have become the effective, broadly used tools in biological and medical research to address a wide range of problems, including classification of disease subtypes and tumors. Many statistical methods are available for analyzing and systematizing these complex data into meaningful information, and one of the main goals in analyzing gene expression data is the detection of samples or genes with similar expression patterns. In this work, a comparison of performance of several feature selection methods based on data preprocessing including strategies of normalization or data reduction is studied and a new classical statistic technique is proposed for preprocessing. Then clustering technique is applied and promising results were achieved. The work also proves choice of a good preprocessing technique prior to clustering improves the performance. The results were proven to be the best in comparison with previous work.

Keywords

Clustering, Feature Selection, Gene Expression.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 220

PDF Views: 4




  • An Improved Clustering Technique Based on Statistical Model Preprocessing Using Gene Expression Data

Abstract Views: 220  |  PDF Views: 4

Authors

R. Mallika
Sri Ramakrishna College of Arts and Science for Women, Coimbatore-44, India
G. Selvanayaki
Department of Computer Science, Cherran College for Women, India

Abstract


Micro arrays have become the effective, broadly used tools in biological and medical research to address a wide range of problems, including classification of disease subtypes and tumors. Many statistical methods are available for analyzing and systematizing these complex data into meaningful information, and one of the main goals in analyzing gene expression data is the detection of samples or genes with similar expression patterns. In this work, a comparison of performance of several feature selection methods based on data preprocessing including strategies of normalization or data reduction is studied and a new classical statistic technique is proposed for preprocessing. Then clustering technique is applied and promising results were achieved. The work also proves choice of a good preprocessing technique prior to clustering improves the performance. The results were proven to be the best in comparison with previous work.

Keywords


Clustering, Feature Selection, Gene Expression.