Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Particle Swarm Optimization for Triclustering High Dimensional Microarray Gene Expression Data


Affiliations
1 Anna University, Chennai, India
2 Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Sathyamangalam, India
     

   Subscribe/Renew Journal


The study of high dimensional microarray gene expression data represents the large computational challenge due to its huge volume of the data. Many clustering techniques are applied to extract the coexpressed genes over the samples. Biclustering improved the traditional clustering by grouping the genes that similarly expressed over only a subset of samples. However, to cluster the high dimensional data with three dimensions such as genes, samples and time points, Triclustering technique is employed for grouping the coexpressed genes over a subset of samples under a subset of time points which imposes huge computational burden. In this paper, Particle Swarm Optimization technique is applied to extract the triclusters from the high dimensional data with objective function as Mean Square Residue. The algorithm is applied to three real life microarray gene expression data and the performance of the work is analyzed using the objective function. The biological significances of the extracted triclusters from all the three datasets are also analyzed. The biological significance analysis are also compared with other triclustering algorithms and the proposed work outperforms the other algorithms.

Keywords

Particle Swarm Optimization, Triclustering, High Dimensional Data, Microarray Gene Expression Data, Mean Square Residue.
Subscription Login to verify subscription
User
Notifications
Font Size


  • Gunavathi C, Premalatha K, Sivasubramanian K. A Survey on Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification. Research J. Pharm. and Tech. 2017; 10(5): 1395-1401.
  • Nallakaruppan M. K, P. Ilango, N. Deepa, Anand Muthukumarappan. Clustering of Wireless Sensor Network Data. Research J. Pharm. and Tech. 2017; 10(1): 73-82.
  • Senthil Kumar T, Boselin Prabhu S R, S. Sophia. The Impact of Clustering Mechanism in Dense Wireless Sensor Network. Research J. Engineering and Tech. 7(1): 19-23.
  • Meenakshi K, Safa M, Karthick T, Sivaranjani N. A Novel Study of Machine Learning Algorithms for Classifying Health Care Data. Research J. Pharm. and Tech. 2017; 10(5): 1429-1432.
  • Rakesh Kumar Soni, Shrikant B. Burje. Tumor Detection Algorithm Using Boundary Based Approach. Int. J. Tech. 1(2): 101-106.
  • B. Lokapriya Nandan, Velin Marita Sequeira, Kanika Verma, K. Ramanathan. Exploring beta-tubulin-protein interactome using computational techniques. Research J. Pharm. and Tech. 8(12): 1679-1684.
  • Rajeshri Lanjewar, Tripti Sharma. A Determining Approach for Customer Behavior Analysis Using K-Mean Clustering. Int. J. Tech. 2011; 1(2): 125-129.
  • Santanu Halder, Abul Hasnat, Debotosh Bhattacharjee, Mita Nasipuri. A Novel Low Space Image Storing and Reconstruction Method by K-Means Clustering Algorithm. Int. J. Tech. 2014; 4(1): 186-196.
  • Cheng Y and Church GM. Biclustering of expression data. Proceedings of International Conference on Intelligent Systems Molecular Biology, 2000; 93-103.
  • Kumaran U, Neelu Khare, A Sai Suraj. Privacy Preserving in Data Mining Technical: A Review. Research J. Pharm. and Tech 2016; 9(11): 2023-2026.
  • Zhao L and Zaki MJ. TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data. Proceedings of the 2005 ACM SIGMOD international conference on Management of data. 2015; 694–705.
  • Feng J, Barbano PE and Mishra B. Time-frequency feature detection for timecourse microarray data. Proceedings of the ACM Symposium on Applied Computing. 2004; 128-132.
  • Zhao L and Zaki MJ. TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data. Proceedings of the 2005 ACM SIGMOD international conference on Management of data. 2015; 694–705.
  • Jiang H. Zhou S, Guan J and Zheng Y. gTRICLUSTER: A More General and Effective 3D Clustering Algorithm for Gene-Sample-Time Microarray Data. Data Mining for Biomedical Applications.2006; 3916: 48–59.
  • Bhar A, Haubrock M, Mukhopadhyay A, Maulik U, Bandyopadhyay S and Wingender E. δ-TRIMAX: Extracting triclusters and analyzing coregulation in time series gene expression data. Springer. 2012; 7534: 165–177.
  • Aviles DG, Escudero CR, Alvarez FM and Riquelme JC. TriGen: A genetic algorithm to mine triclusters in temporal gene expression data. Neurocomputing. 2014; 132: 42–53.
  • Bhar A, Haubrock M, Mukhopadhyay A, Maulik U, Bandyopadhyay S and Wingender E. Multiobjective triclustering of time-series transcriptome data reveals key genes of biological processes. BMC Bioinformatics. 2015; 16.
  • G. Manikandan. A Comparative Analysis on the Applicability of Various Mutation Types for Achieving Privacy in medical Data Mining. Research J. Pharm. and Tech. 2017; 10(8): 2451-2455.
  • Kennedy J and Eberhart RC. Particle Swarm Optimization. Proceedings of the 1995 IEEE International Conference on Neural Networks. 1995; 1942-1948.
  • Kennedy J and Eberhart RC. Swarm Intelligence. Academic Press, London. 2001.
  • Eberhart RC, Simpson P and Dobbins R. Computational Intelligence PC Tools, Academic Press. 1996.
  • Eberhart, R., Kennedy, J., “A new optimizer using particle swarm theory”, MHS’95. Proceedings of the Sixth International Symposium on MicroMachine and Human Science. pp. 39–43, 1995.
  • Shi Y, Eberhart R. A modified particle swarm optimizer. 1998 IEEE International Conference on Evolutionary Computation Proceedings, IEEE World Congress on Computational Intelligence. 1998; 69–73.
  • Swathypriyadharsini P and Premalatha k. TrioCuckoo: A Multi Objective Cuckoo Search Algorithm for Triclustering Microarray Gene Expression Data. Journal of Information Science and Engineering. 2018; 34(6): 1617-1631.
  • Bhar, A., Haubrock, M., Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S. and Wingender, E., “Coexpression and coregulation analysis of time-series gene expression data in estrogen-induced breast cancer cell,” Algorithms Mol Biol., Vol. 8, 2013.
  • Huang DW, Sherman BT and Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature Protocols. 2009; 4(1): 44-57.

Abstract Views: 233

PDF Views: 0




  • Particle Swarm Optimization for Triclustering High Dimensional Microarray Gene Expression Data

Abstract Views: 233  |  PDF Views: 0

Authors

P. Swathypriyadharsini
Anna University, Chennai, India
K. Premalatha
Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Sathyamangalam, India

Abstract


The study of high dimensional microarray gene expression data represents the large computational challenge due to its huge volume of the data. Many clustering techniques are applied to extract the coexpressed genes over the samples. Biclustering improved the traditional clustering by grouping the genes that similarly expressed over only a subset of samples. However, to cluster the high dimensional data with three dimensions such as genes, samples and time points, Triclustering technique is employed for grouping the coexpressed genes over a subset of samples under a subset of time points which imposes huge computational burden. In this paper, Particle Swarm Optimization technique is applied to extract the triclusters from the high dimensional data with objective function as Mean Square Residue. The algorithm is applied to three real life microarray gene expression data and the performance of the work is analyzed using the objective function. The biological significances of the extracted triclusters from all the three datasets are also analyzed. The biological significance analysis are also compared with other triclustering algorithms and the proposed work outperforms the other algorithms.

Keywords


Particle Swarm Optimization, Triclustering, High Dimensional Data, Microarray Gene Expression Data, Mean Square Residue.

References