Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Gene Selection And Modified Long Short Term Memorynetworkbased Lung Cancer Classification Using Gene Expression Data


Affiliations
1 School of Computer Studies, Rathnavel Subramaniam College of Arts and Science, India
     

   Subscribe/Renew Journal


Lung cancer is one of the fatal forms of cancer worldwide. Genetic variability has been identified as influencing a person vulnerability to lung cancer in epidemiologic research. A new study undertaken by a team of experts from the United States National Cancer Institute on 14,000 Asian women discovered that Asian women, regardless of whether they smoke or not, are more likely to acquire cancer owing to genetic abnormalities. Early detection of this lethal disease is a novel clinical application of microarray data. Recent research establishes a model for the early diagnosis of lung cancer. Additionally, multilayer perceptron, random subspace, and Sequential Minimal Optimization (SMO) approaches are used for classification. While information acquisition is typically a good indicator of an attribute significance, it is not perfect. A noticeable issue develops when knowledge gain is applied to qualities that might take on many distinct values. This paper provides an efficient gene selection model based on the Improved Whale Optimization Algorithm (IWOA) to address these concerns. It saves time and identifies relevant genes from gene expression data, increasing lung cancer categorization accuracy. Then, a Modified Long Short-Term Memory (MLSTM) Network is used to classify lung cancer. It accepts specified genes as inputs and determines which class they belong to, such as lung cancer or normal subjects. As demonstrated by empirical observations, the suggested model is effective in precision, recall, accuracy, and f–measure.

Keywords

Lung Cancer, Early Stage, Developing Cancer, Genetic Variations, Feature Selection, Information Gain Attribute, Whale Optimization, Long Short Term Memory
Subscription Login to verify subscription
User
Notifications
Font Size

  • J.X. Liu, Y. Xu, C.H. Zheng and Z.H. Lai, “RPCA-Based Tumor Classification using Gene Expression Data”, IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 12, No. 4, pp. 964-970, 2014.
  • M. Hosni, J.M. Carrillo-De-Gea, A. Idri and J.L. FernandezAleman, “A Mapping Study of Ensemble Classification Methods in Lung Cancer Decision Support Systems”, Proceedings of International Conference on Medical and Biological Engineering and Computing, pp.1-17, 2020.
  • P. Tripathi, S. Tyagi and M. Nath, “A Comparative Analysis of Segmentation Techniques for Lung Cancer Detection”, Pattern Recognition and Image Analysis, Vol. 29, No. 1, pp. 167-173, 2019.
  • H. Azzawi, J. Hou and R. Alanni, “Multiclass Lung Cancer Diagnosis by Gene Expression Programming and Microarray Datasets, Proceedings of International Conference on Advanced Data Mining and Applications, pp. 541-553, 2017.
  • S. Vanjimalar, D. Ramyachitra and P. Manikandan, “A Review on Feature Selection Techniques for Gene Expression Data”, Proceedings of IEEE International Conference on Computational Intelligence and Computing Research, pp. 1-4, 2018.
  • J.C. Ang, A. Mirzal, H. Haron and H.N.A. Hamed, “Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection”, IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 13, No. 5, pp. 971-989, 2015.
  • C.S. Seah, S. Kasim, M.F.M. Fudzee and M.A. Ismail, “An Enhanced Topologically Significant Directed Random Walk in Cancer Classification using Gene Expression Datasets”, Saudi Journal of Biological Sciences, Vol. 24, No. 8, pp. 1828-1841, 2017.
  • H. Salem, G. Attiya and N. El-Fishawy, “Classification of Human Cancer Diseases by Gene Expression Profiles”, Applied Soft Computing, Vol. 50, pp. 124-134, 2017.
  • H. Motieghader, A. Najafi, B. Sadeghi and A. MasoudiNejad, “A Hybrid Gene Selection Algorithm for Microarray Cancer Classification using Genetic Algorithm and Learning Automata”, Informatics in Medicine Unlocked, Vol. 9, pp. 246-254, 2017.
  • H. Lu, J. Chen, K. Yan and Z. Gao, “A Hybrid Feature Selection Algorithm for Gene Expression Data Classification”, Neurocomputing, Vol. 256, pp. 56-62, 2017.
  • A.H. Berger, A.N. Brooks and X. Wu, “High-Throughput Phenotyping of Lung Cancer Somatic Mutations”, Cancer Cell, Vol. 30, No. 2, pp. 214-228, 2016.
  • H.M. Alshamlan and G.H. Badr, “Genetic Bee Colony (GBC) Algorithm: A New Gene Selection Method for Microarray Cancer Classification”, Computational Biology and Chemistry, Vol. 56, pp. 49-60, 2015.
  • B. Ghaddar and J. Naoum-Sawaya, “High Dimensional Data Classification and Feature Selection using Support Vector Machines”, European Journal of Operational Research, Vol. 265, No. 3, pp. 993-1004, 2018.
  • I. Jain, V.K. Jain and R. Jain, “Correlation Feature Selection based Improved-Binary Particle Swarm Optimization for Gene Selection and Cancer Classification”, Applied Soft Computing, Vol. 62, pp. 203-215, 2018.
  • M.M. Ahmed, E.H. Houssein and E. Hassanien, “Maximizing Lifetime of Wireless Sensor Networks based on Whale Optimization Algorithm”, Proceedings of International Conference on Advanced Intelligent Systems and Informatics, pp. 724-733, 2017.
  • M. Sharawi, H.M. Zawbaa and E. Emary, “Feature Selection Approach based on Whale Optimization Algorithm”, Proceedings of International Conference on Advanced Computational Intelligence, pp. 163-168, 2017.
  • F.S. Gharehchopogh and H. Gholizadeh, “A Comprehensive Survey: Whale Optimization Algorithm and its Applications”, Swarm and Evolutionary Computation, Vol. 48, pp. 1-24, 2019.
  • Y. Ling, Y. Zhou and Q. Luo, “Levy Flight TrajectoryBased Whale Optimization Algorithm for Global Optimization”, IEEE Access, Vol. 5, pp. 6168-6186, 2017.
  • M.M. Mafarja and S. Mirjalili, “Hybrid Whale Optimization Algorithm with Simulated Annealing for Feature Selection”, Neurocomputing, Vol. 260, pp. 302-312, 2017.
  • A.G. Hussien, A.E. Hassanien and E.H. Houssein, “SShaped Binary Whale Optimization Algorithm for Feature Selection”, Proceedings of Recent Trends in Signal and Image Processing, pp. 79-87, 2019.
  • P. Rodriguez, G. Cucurull, T.B. Moeslund and F.X. Roca, “Deep Pain: Exploiting Long Short-Term Memory Networks for Facial Expression Classification”, IEEE Transactions on Cybernetics, Vol. 23, No. 1, pp. 1-14, 2017.
  • L. Yu, J. Chen, G. Ding and J. Sun, “Spectrum Prediction based on Taguchi Method in Deep Learning with Long Short-Term Memory”, IEEE Access, Vol. 6, pp. 4592345933, 2018.
  • Y. Tian and L. Pan, “Predicting Short-Term Traffic Flow by Long Short-Term Memory Recurrent Neural Network”, Proceedings of IEEE International Conference on Smart City/SocialCom/SustainCom, pp. 153-158, 2015.
  • X. Yuan, L. Li and Y. Wang, “Nonlinear Dynamic Soft Sensor Modeling with Supervised Long Short-Term Memory Network”, IEEE Transactions on Industrial Informatics, Vol. 16, No. 5, pp. 3168-3176, 2019.
  • W. Lee, K. Kim, J. Park and Y. Kim, “Forecasting Solar Power using Long-Short Term Memory and Convolutional Neural Networks”, IEEE Access, Vol. 6, pp. 73068-73080, 2018.
  • J. Kim and H. Kim, “An Effective Intrusion Detection Classifier using Long Short-Term Memory with Gradient Descent Optimization”, Proceedings of International Conference on Platform Technology and Service, pp. 1-6, 2017.
  • X. Li and X. Wu, “Constructing Long Short-Term Memory based Deep Recurrent Neural Networks for Large Vocabulary Speech Recognition”, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4520-4524, 2015.

Abstract Views: 128

PDF Views: 1




  • Gene Selection And Modified Long Short Term Memorynetworkbased Lung Cancer Classification Using Gene Expression Data

Abstract Views: 128  |  PDF Views: 1

Authors

V. Yuvaraj
School of Computer Studies, Rathnavel Subramaniam College of Arts and Science, India
G. Pandiyan
School of Computer Studies, Rathnavel Subramaniam College of Arts and Science, India
G. Purusothaman
School of Computer Studies, Rathnavel Subramaniam College of Arts and Science, India

Abstract


Lung cancer is one of the fatal forms of cancer worldwide. Genetic variability has been identified as influencing a person vulnerability to lung cancer in epidemiologic research. A new study undertaken by a team of experts from the United States National Cancer Institute on 14,000 Asian women discovered that Asian women, regardless of whether they smoke or not, are more likely to acquire cancer owing to genetic abnormalities. Early detection of this lethal disease is a novel clinical application of microarray data. Recent research establishes a model for the early diagnosis of lung cancer. Additionally, multilayer perceptron, random subspace, and Sequential Minimal Optimization (SMO) approaches are used for classification. While information acquisition is typically a good indicator of an attribute significance, it is not perfect. A noticeable issue develops when knowledge gain is applied to qualities that might take on many distinct values. This paper provides an efficient gene selection model based on the Improved Whale Optimization Algorithm (IWOA) to address these concerns. It saves time and identifies relevant genes from gene expression data, increasing lung cancer categorization accuracy. Then, a Modified Long Short-Term Memory (MLSTM) Network is used to classify lung cancer. It accepts specified genes as inputs and determines which class they belong to, such as lung cancer or normal subjects. As demonstrated by empirical observations, the suggested model is effective in precision, recall, accuracy, and f–measure.

Keywords


Lung Cancer, Early Stage, Developing Cancer, Genetic Variations, Feature Selection, Information Gain Attribute, Whale Optimization, Long Short Term Memory

References