Open Access Open Access  Restricted Access Subscription Access

Metabolic Pathway of Hereditary Cancer Disease from PPI-network of DEGS Detected using Mean-of-Mean Method


Affiliations
1 Department of Electronics and Communication Engineering, Kalyani Government Engineering College, Kalyani-741235, Nadia, West Bengal, India
 

Uncontrolled growth of cells often results Cancer disease in human body. When it is eventually transmitted throughout the generations in a family, it is referred to as hereditary diseases. Metabolic pathway explains several chemical reactions occurred for growth of a disease and KEGG pathway analysis identifies key genes involved with that disease. At first, differentially expressed genes (DEGs) are detected from cancer gene microarray time series datasets using a new and simple method, Mean of Mean (MoM). The MoM concept is developed from Gregor Johann Mendel's First Law of Heredity or Segregation rule of heredity. Highly expressed (HG) and lowly expressed (LG) genes in two different groups of microarray dataset are identified first using MoM. Then DEGs are found by implementing intersection operation between HG and LG genes. Performance of MoM method is analyzed by Support Vector Machine classifiers (SVMs) on some binary class cancer microarray data samples. Then all results are compared with the performance of other statistical parametric and nonparametric hypothetic tests. It is noticed that performance ofMoMis better than other statistical methods in almost all data sets. Finally, Protein-Protein Interaction Networks (PPINs) are constructed within identified DEGs using web based tool. Lastly, KEGG pathway analysis is performed for all proteins involved in PPINs to obtain list of key genes for growth of cancer disease.

Keywords

Hereditary Disease, Cancer, Mean of Mean (MoM), DEGs, SVM Classifier, PPI-Networks, Metabolic Pathway, KEGG Pathway.
User
Notifications
Font Size


  • Bandyopadhyay, S., Mallik, S. and Mukhopadhyay, A., A Survey and Comparative Study of Statistical Tests for Identifying Differential Expression from Microarray Data, IEEE/ACM Transactions on Computational Biology and Bioinformatics, IEEE ACM, Vol. 11, No.1, 2013. DOI: 10.1109/TCBB.2013.147.
  • Burges, C.J.C., A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery, Vol. 2, pp.121–167, 1998.
  • Chen, Z., Liu, J., Ng, H.K.T., Nadarajah, S., Kaufman, H.L., Yang, J.Y. and Deng, Y., Statistical Methods on Detecting Differentially Expressed Genes for RNA-Seq Data, BMC Systems Biology, Vol. 5, No.S1, 2011. DOI: 10.1186/1752-0509-5-S3-S1.
  • Dudoit, S., Yang, Y.H. Callow, M.J. and Speed, T.P. , Stat ist ical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments, Statistica Sinica, Vol. 12, No.1, pp.111-139, 2002.
  • Ma, P., Zhong, W. and Liu, J.S., Identifying Differentially Expressed Genes in Time Course Microarray Data, Statistics in Biosciences, Vol. 1, No.144, 2009.
  • Heydebreck,A.V., Statistical Tests for Differential Gene Expression,Technical Report.
  • Lin, C.J., Support Vector Machines for Data Classification, Technical Report, National Taiwan University, February 9, 2004.
  • Markowetz, F., Classification by Support Vector Machines, Technical Report, Max-Planck-Institute for Molecular Genetics, Berlin, 2003.
  • Moore, A.W., Cross-Validation for Detecting and Preventing Overfitting, Technical report, Carnegie Mellon University. http://www.cs.cmu.edu/~awm/tutorials.
  • Mukherjee, S., Classifying Microarray Data Using Support Vector Machines, In: Berrar, D.P., Dubitzky,W. and Granzow, M. (Eds.) A Practical Approach to Microarray Data Analysis, Springer. DOI: 10.1007/0-306-47815-3_9
  • Ospina, L. and Kleine, L.L., Identification of Differentially Expressed Genes in Microarray Data in a Principal Component Space, Springer Plus, Vol. 2, No.60, 2013.
  • Rakotomamonjy, A., Support Vector Machines andArea under Roc Curve, September 1, 2004.
  • Smyth, G.K., Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments, Statistical Applications in Genetics and Molecular Biology,Vol. 3, 2004.
  • Soneson, C. and Delorenzi, M.,AComparison of Methods for Differential Expression Analysis of RNA-Seq Data, BMC Bioinformatics, Vol. 14, 2013.
  • Tan, Y.D., Fornage, M. and Fu Y.X., Ranking Analysis of Microarray Data: A Powerful Method for Identifying Differentially Expressed Genes, Genomics, Vol. 88, No.6, pp.846-854, 2006.
  • Vapnik, V.N., An Overview of Statistical Learning Theory, IEEE Transactions on Neural Networks, Vol. 10, 1999.
  • Wang, J., Data Warehousing and Mining: Concepts, Methodologies, Tools, and Applications, Technical Report, Montclair State University, USA.
  • Zhu, W., Zeng, N. and Wang, N., Sensitivity, Specificity, Accuracy, Associated Confidence Interval and Roc Analysis with Practical Implementat ions, Heal th Care and Life Sciences, Proceedings ofNESUG2010, 2010.
  • Barman, B. and Mukhopadhyay, A., Extracting Biological Significant Subnetworks from ProteinProtein Interactions Induced by Differentially Expressed Genes of HIV-1 Vpr Variants, International Journal of System Dynamics Applications, Vol. 4, No.4, pp. 35-51, 2015.
  • Biswas, P. and Barman, B., An Approach to Identify Gene Markers Relating to Viral Carcinogenesis Using Data Mining Tools, Indian Science Cruiser, Vol. 33, No.2, pp.24-32, 2019. DOI: 10.24906/isc/2019/v33/i2/183890.
  • Barman, B. and Mukhopadhyay, A., Detection of Differentially Expressed Genes in Wild Type HIV-1 Vpr and Two HIV-1 Mutant Vprs, FICTA, Vol. 1, pp.597-604, 2014

Abstract Views: 569

PDF Views: 213




  • Metabolic Pathway of Hereditary Cancer Disease from PPI-network of DEGS Detected using Mean-of-Mean Method

Abstract Views: 569  |  PDF Views: 213

Authors

Bandana Barman
Department of Electronics and Communication Engineering, Kalyani Government Engineering College, Kalyani-741235, Nadia, West Bengal, India

Abstract


Uncontrolled growth of cells often results Cancer disease in human body. When it is eventually transmitted throughout the generations in a family, it is referred to as hereditary diseases. Metabolic pathway explains several chemical reactions occurred for growth of a disease and KEGG pathway analysis identifies key genes involved with that disease. At first, differentially expressed genes (DEGs) are detected from cancer gene microarray time series datasets using a new and simple method, Mean of Mean (MoM). The MoM concept is developed from Gregor Johann Mendel's First Law of Heredity or Segregation rule of heredity. Highly expressed (HG) and lowly expressed (LG) genes in two different groups of microarray dataset are identified first using MoM. Then DEGs are found by implementing intersection operation between HG and LG genes. Performance of MoM method is analyzed by Support Vector Machine classifiers (SVMs) on some binary class cancer microarray data samples. Then all results are compared with the performance of other statistical parametric and nonparametric hypothetic tests. It is noticed that performance ofMoMis better than other statistical methods in almost all data sets. Finally, Protein-Protein Interaction Networks (PPINs) are constructed within identified DEGs using web based tool. Lastly, KEGG pathway analysis is performed for all proteins involved in PPINs to obtain list of key genes for growth of cancer disease.

Keywords


Hereditary Disease, Cancer, Mean of Mean (MoM), DEGs, SVM Classifier, PPI-Networks, Metabolic Pathway, KEGG Pathway.

References





DOI: https://doi.org/10.21843/reas%2F2020%2F64-80%2F209273