Open Access Open Access  Restricted Access Subscription Access

Autism Gene Subset Selection from Microarray data – A Wrapper Approach


Affiliations
1 Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai 600 062, Tamil Nadu, India
2 Department of Information Science and Technology, College of Engineering, Anna University, Guindy, Chennai 600 025, Tamil Nadu, India
 

Autism spectrum disorder is a complex neurodevelopment disorder that affects an individual's social behavior. Microarray analysis is an extensively used technique to detect autism. Microarray data can provide additional insight into the etiology of the disorder. Identifying the specific set of genes associated with autism from complex microarray data poses a significant research challenge due to its high dimensionality. However, Gene subset selection is classified as an np-hard problem that can be handled by the meta-heuristic algorithm. In this paper, a novel meta-heuristic Game Theory Based Whale Optimization Algorithm is proposed. The proposed algorithm uses a two-person zero-sum game theory and convergence parameter to increase convergence rate and avoid local optima. The performance of the proposed algorithm is tested with 23 mathematical benchmark functions and compared with other state-of-the-art algorithms. Further, the proposed algorithm is employed as a wrapper-based gene subset selection model with a support vector machine. Furthermore, the outcomes demonstrate that the gene selection model utilizing a wrapper-based approach is capable of effectively identifying a subset of autism-related genes with desirable accuracy.

Keywords

Autism spectrum disorder, Dimensionality reduction, Feature selection, Meta-heuristic, Whale optimization algorithm.
User
Notifications
Font Size

  • Selvaraj S & Natarajan J, Microarray data analysis and mining tools microarray data analysis and mining tools, Bioinformation, 6(3) (2011) 95.
  • Latkowski T & Osowski S, Data mining for feature selection in gene expression autism data, Expert Syst Appl, 42(2) (2015) 864–872, DOI:10.1016/j.eswa.2014.08.043.
  • Ke L, Li M, Wang L, Deng S, Ye J, & Yu X, Improved swarm-optimization-based filter-wrapper gene selection from microarray data for gene expression tumor classification, Pattern Anal Appl, 26(2) (2023) 455–472, DOI: 10.1007/s10044-022-01117-9.
  • Azadifar S, Rostami M, Berahmand K, Moradi P & Oussalah M, Graph-based relevancy-redundancy gene selection method for cancer diagnosis, Comput Biol Med, 147(1) (2022) 105766, DOI: 10.1016/j.compbiomed.2022.105766.
  • Aziz R M, Application of nature inspired soft computing techniques for gene selection: a novel frame work for classification of cancer, Soft Comput, 26(22) (2022) 12179–96, DOI: 10.1007/s00500-022-07032-9 6.
  • Chen B, Hong J & Wang Y, The minimum feature subset selection problem, J Comput Sci Technol, 12(2) (1997) 145–153.
  • García-Torres M, García-López F C, Melián Batista B, Moreno Pérez J A & Moreno Vega J M, Solving feature subset selection problem by a hybrid metaheuristic, Proc First Int Work hybrid Metaheuristics as part 16th Eur Conf Artif Intell, 2004, 59–68.
  • Mirjalili S & Lewis A, The whale optimization algorithm, Adv Eng Softw, 95 (2016) 51–67. 9 Ling Y, Zhou Y & Luo Q, Lévy flight trajectory-based whale optimization algorithm for global optimization, IEEE Access, 5 (2017) 6168–6186.
  • Oliva D, Abd El A M & Ella H A, Parameter estimation of photovoltaic cells using an improved chaotic whale optimization algorithm, Appl Energy, 200 (2017) 141–54, DOI: 10.1016/j.apenergy.2017.05.029.
  • Sivalingam R, Chinnamuthu S & Dash S S, A modified whale optimization algorithm-based adaptive fuzzy logic PID controller for load frequency control of autonomous power generation systems, Automatika, 58(4) (2017) 410–421, DOI: 10.1080/00051144.2018.1465688.
  • Kaur G & Arora S, Chaotic whale optimization algorithm, J Comput Des Eng, 5(3) (2018) 275–284, DOI: 10.1016/j.jcde.2017.12.006
  • Luo J & Shi B, A hybrid whale optimization algorithm based on modified differential evolution for global optimization problems, Appl Intell, 49(5) (2019) 1982–2000.
  • Alzubi R, Ramzan N & Alzoubi H, Hybrid feature selection method for autism spectrum disorder SNPs, 2017 I EEE Conf Comput Intell Bioinforma Comput Biol (IEEE) 2017, 1–7.
  • Antovski A, Kostadinovska S, Simjanoska M, Eftimov T, Ackovska N & Bogdanova A M, Data-driven autism biomarkers selection by using signal processing and machine learning techniques, Bioinformat 2019 - 10th Int Conf Bioinformat Model Methods Algorithms, Proceedings; Part 12th Int Jt Conf Biomed Eng Syst Technol (BIOSTEC)2019, 201–208.
  • Feng X, Hao X, Xin R, Gao X, Liu M, Li F, Wang Y, Shi R, Zhao S & Zhou F, Detecting methylomic biomarkers of pediatric autism in the peripheral blood leukocytes, Interdiscip Sci Comput Life Sci, 11(2) (2019) 237–246, DOI: 10.1007/s12539-019-00328-9
  • Bonilla Huerta E, Duval B & Hao J K, A hybrid LDA and genetic algorithm for gene selection and classification of microarray data, Neurocomputing, 73(13–15) (2010) 2375–2383.
  • Ferguson T S, A course in game theory, World Scientific 2020.
  • Bauso D, Game theory: Models, numerical methods and applications. Vol. 1, Foundations and Trends in Systems and Control (Now Publishers) 2014, 379–522.
  • Mafarja M, Jaber I, Ahmed S & Thaher T, Whale optimisation algorithm for high-dimensional small-instance feature selection, Int J Parallel Emergent Distrib Syst, 36(2) (2019) 1–17, DOI: 17445760.2019.1617866
  • Xue B, Zhang M & Browne W N, New fitness functions in binary particle swarm optimisation for feature selection, 2012 IEEE Congr Evol Comput (IEEE) 2012, 1–8.
  • He Q & Wang L, An effective co-evolutionary particle swarm optimization for constrained engineering design problems, Eng Appl Artif Intell, 20(1) (2007) 89–99.
  • Rashedi E, Nezamabadi-pour H & Saryazdi S, GSA: A gravitational search algorithm, Inf Sci (Ny), 179(13) (2009) 2232–2248, DOI: 10.1016/j.ins.2009.03.004.
  • Storn R & Price K, Differential Evolution - A simple evolution strategy for fast optimization, Dr Dobb’s J, 22(4) (1997) 18–24.
  • Alter M D, Kharkar R, Ramsey K E, Craig D W, Melmed R D, Grebe T A, Bay R C, Ober-Reynolds S, Kirwan J, Jones J J & Turner J B, Autism and increased paternal age related changes in global levels of gene expression regulation, PLoS One, 6(2) (2011) e16715.
  • Kuwano Y, Kamio Y, Kawai T, Katsuura S, Inada N, Takaki A & Rokutan K, Autism-associated gene expression in peripheral leucocytes commonly observed between subjects with autism and healthy women having autistic children, PLoS One, 6(9) (2011) 24732.
  • Alisch R S, Barwick B G, Chopra P, Myrick L K, Satten G A, Conneely K N & Warren S T, Age-associated DNA methylation in pediatric populations, Genome Res, 22(4) (2012) 623–632.
  • Kim S H, Kim I B, Oh D H & Ahn D H, Predicting autism spectrum disorder using blood-based gene expression signatures and machine learning, Eur Neuropsychopharmacol, 27(1) (2017) S1090.
  • Sekaran K and Sudha M, Predicting autism spectrum disorder from associative genetic markers of phenotypic groups using machine learning, J Ambient Intell Humaniz Comput, 12(3) (2021) 3257–3270.

Abstract Views: 39

PDF Views: 27




  • Autism Gene Subset Selection from Microarray data – A Wrapper Approach

Abstract Views: 39  |  PDF Views: 27

Authors

Anurekha G.
Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai 600 062, Tamil Nadu, India
Geetha P.
Department of Information Science and Technology, College of Engineering, Anna University, Guindy, Chennai 600 025, Tamil Nadu, India

Abstract


Autism spectrum disorder is a complex neurodevelopment disorder that affects an individual's social behavior. Microarray analysis is an extensively used technique to detect autism. Microarray data can provide additional insight into the etiology of the disorder. Identifying the specific set of genes associated with autism from complex microarray data poses a significant research challenge due to its high dimensionality. However, Gene subset selection is classified as an np-hard problem that can be handled by the meta-heuristic algorithm. In this paper, a novel meta-heuristic Game Theory Based Whale Optimization Algorithm is proposed. The proposed algorithm uses a two-person zero-sum game theory and convergence parameter to increase convergence rate and avoid local optima. The performance of the proposed algorithm is tested with 23 mathematical benchmark functions and compared with other state-of-the-art algorithms. Further, the proposed algorithm is employed as a wrapper-based gene subset selection model with a support vector machine. Furthermore, the outcomes demonstrate that the gene selection model utilizing a wrapper-based approach is capable of effectively identifying a subset of autism-related genes with desirable accuracy.

Keywords


Autism spectrum disorder, Dimensionality reduction, Feature selection, Meta-heuristic, Whale optimization algorithm.

References