Open Access Open Access  Restricted Access Subscription Access

Optimal Feature Subset Selection Using Cuckoo Search on IoT Network


Affiliations
1 Department of Informatics Research, Electronics Research Institute, Cairo, Egypt
2 Department of InformaticsResearch, ElectronicsResearch Institute, Cairo, Egypt
 

The Internet of Things (IoT) became the basic axis in the information and network technology to create a smart environment. To build such an environment; it needs to use some IoT simulators such as Cooja Simulator. Cooja simulator creates an IoT environment and produces an IoT routing dataset that contains normal and malicious motes. The IoT routing dataset may have redundant and noisy features. The feature selection can affect on the performance metrics of the learning model. The feature selection can reduce complexity and over-fitting problem. There are many approaches for feature selection especially meta-heuristic algorithms such as Cuckoo search (CS). This paper presented a proposed model for feature selection that is built on using a standard cuckoo search algorithm to select near-optimal or optimal features. A proposed model may modify the CS algorithm which has implemented using Dagging with base learner Bayesian Logistic Regression (BLR). It increases the speed of the CS algorithm and improves the performance of BLR. Support Vector Machine (SVM), Deep learning, and FURIA algorithms are used as classification techniques used to evaluate the performance metrics. The results have demonstrated that the algorithm proposed is more effective and competitive in terms of performance of classification and dimensionality reduction. It achieved high accuracy that is near to 98 % and low error that is about 1.5%.

Keywords

IoT, Feature Selection, Meta-heuristic, Cuckoo Search Algorithm, Deep Learning.
User
Notifications
Font Size

  • G. Al-Rawashdeh, R. Mamat, and N. HafhizahBintiAbd Rahim, Hybrid Water Cycle Optimization Algorithm with Simulated Annealing for Spam E-mail Detection, IEEE Access, 7, 2019, 143721 – 143734. DOI: 10.1109/ACCESS.2019.2944089
  • A. Verma and V. Ranga, Analysis of Routing Attacks on RPL based 6LoWPAN Networks, International Journal of Grid and Distributed Computing, 2018, 11(8), 2018, 43-56. DOI: 10.14257/ijgdc.2018.11.8.05.
  • A. Mayzaud, R. Badonnel, and I. Chrisment, A taxonomy of attacks in RPL-based internet of things, International Journal of Network Security, 18(3), 2016, 459-473.
  • J. C. Ang, A. Mirzal, H. Haron, and H. N. A. Hamed, Supervised, unsupervised, and semi-supervised feature selection: A review on gene selection, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 13(5), 2016. DOI: 10.1109/TCBB.2015.2478454.
  • I. Boussaïd, J. Lepagnot, and P. Siarry, A survey on optimization meta-heuristic s, Information Sciences, 237 2013, 82–117. DOI: 10.1016/j.ins.2013.02.041.
  • I. Kahvazadeh and M. SanieeAbadeh, MOCANAR: A Multi-Objective Cuckoo Search Algorithm for Numeric Association Rule Discovery, 2015. DOI: 10.5121/csit.2015.51509.99–113
  • K. G. Dhal, S. Ray, A. Das, and S. Das, A Survey on Nature-Inspired Optimization Algorithms and Their Application in Image Enhancement Domain, Archives of Computational Methods in Engineering, 2019. DOI: 10.1007/s11831-018-9289-9.
  • X. S. Yang, Nature-Inspired Optimization Algorithms, (Elsevier, 2014).
  • Hira, Z. M. and Gillies, D. F, A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data, Advances in Bioinformatics, 2015, 1–13. DOI:10.1155/2015/198363
  • C. Blum, J. Puchinger, G. R. Raidl, and A. Roli, Hybrid meta-heuristic s in combinatorial optimization: A survey, Applied Soft Computing Journal, 11, 2011, 4135-4151. DOI: 10.1016/j.asoc.2011.02.032.
  • A. Soler-Dominguez, A. A. Juan, and R. Kizys, A survey on financial applications of meta-heuristic s, ACM Computing Surveys, 50(1). 2017. DOI:10.1145/3054133.
  • D. Jain and V. Singh, Feature selection and classification systems for chronic disease prediction: A review, Egyptian Informatics Journal, 19, 2018, 179-189. DOI: 10.1016/j.eij.2018.03.002.
  • A. Safiyari and R. Javidan, Predicting lung cancer survivability using ensemble learning methods, Ptoc. 2017 Intelligent Systems Conference, IntelliSys 2017, London, UK, 7-8 Sept. 2017. DOI:10.1109/IntelliSys.2017.8324368.
  • N. Meddouri, H. Khoufi, and M. Maddouri, Parallel learning and classification for rules based on formal concepts, Procedia Computer Science, 35, 2014, 358367. DOI: 10.1016/j.procs.2014.08.116.
  • S. Wang, D. Wang, J. Li, T. Huang, and Y. D. Cai, Identification and analysis of the cleavage site in a signal peptide using SMOTE, dagging, and feature selection methods, Molecular Omics, 14(1), 2018, 64–73. DOI: 10.1039/c7mo00030h.
  • D. S. Anyfantis, M. G. Karagiannopoulos, S. B. Kotsiantis, and P. E. Pintelas, Local dagging of decision stumps for regression and classification problems, Proc. 2007 Mediterranean Conference on Control and Automation, MED, 2007. DOI: 10.1109/MED.2007.4433917.
  • S. Salesi and G. Cosma, A novel extended binary cuckoo search algorithm for feature selection, Proc. 2017 2nd International Conference on Knowledge Engineering and Applications, ICKEA 2017, 2017. DOI:10.1109/ICKEA.2017.8169893.
  • A. A. A. Mohamed, S. A. Hassan, A. M. Hemeida, S. Alkhalaf, M. M. M. Mahmoud, and A. M. Baha Eldin, Parasitism – Predation algorithm (PPA): A novel approach for feature selection, Ain Shams Engineering Journal, 2019. DOI:10.1016/j.asej.2019.10.004.
  • A. F. Alia and A. Taweel, Feature Selection based on Hybrid Binary Cuckoo Search and Rough Set Theory in Classification for Nominal Datasets, International Journal of Information Technology and Computer Science, 9, 2017. DOI:10.5815/ijitcs.2017.04.08.
  • A. Kumar, A. Jaiswal, S. Garg, S. Verma, and S. Kumar, Sentiment Analysis Using Cuckoo Search for Optimized Feature Selection on Kaggle Tweets, International Journal of Information Retrieval Research, 9, 2018. DOI:10.4018/ijirr.2019010101.
  • M. A. El Aziz and A. E. Hassanien, Modified cuckoo search algorithm with rough sets for feature selection, Neural Computing and Applications, 29, 2018, 925– 934. DOI:10.1007/s00521-016-2473-7.
  • R. . Babukartik, Hybrid Algorithm using the advantage of ACO and Cuckoo Search for Job Scheduling, International Journal of Information Technology Convergence and Services, 2, 2012. DOI: 10.5121/ijitcs.2012.2403.
  • M. Mareli and B. Twala, An adaptive Cuckoo search algorithm for optimisation, Applied Computing and Informatics, 14, 2018, 107-115. DOI:10.1016/j.aci.2017.09.001.
  • K. Thirugnanasambandam, S. Prakash, V. Subramanian, S. Pothula, and V. Thirumal, Reinforced cuckoo search algorithm-based multimodal optimization, Applied Intelligence, 49, 2019, 2059–2083. DOI: 10.1007/s10489-0181355-3.
  • L. A. M. Pereira et al., A binary cuckoo search and its application for feature selection, Studies in Computational Intelligence, 2014, 141-154. DOI: 10.1007/978-3-319-02141-6_7.
  • C. Gunavathi and K. Premalatha, Cuckoo search optimisation for feature selection in cancer classification: A new approach, International Journal of Data Mining and Bioinformatics, 13(3), 2015, 24865. DOI: 10.1504/IJDMB.2015.072092.
  • W. Yamany, N. El-Bendary, A. E. Hassanien, and E. Emary, Multi-Objective Cuckoo Search Optimization for Dimensionality Reduction, Procedia Computer Science, 96, 2016, 207-215. DOI:10.1016/j.procs.2016.08.130.
  • R. Chi, Y. Su, Z. Qu, and X. Chi, A Hybridization of Cuckoo Search and Differential Evolution for the Logistics Distribution Center Location Problem, Mathematical Problems in Engineering, 2019, 2019. DOI: 10.1155/2019/7051248.
  • M. V. Kotpalliwar and R. Wajgi, Classification of attacks using support vector machine (SVM) on KDDCUP’99 IDS database, Proc. 2015 5th International Conference on Communication Systems and Network Technologies, CSNT 2015, 2015. DOI: 10.1109/CSNT.2015.185.
  • S. Albawi, T. A. Mohammed, and S. Al-Zawi, Understanding of a convolutional neural network, Proc. 2017 International Conference on Engineering and Technology, ICET 2017, 2018. DOI:10.1109/ICEngTechnol.2017.8308186.
  • A. Palacios, L. Sánchez, I. Couso, and S. Destercke, An extension of the FURIA classification algorithm to low quality data through fuzzy rankings and its application to the early diagnosis of dyslexia, Neurocomputing, 2016. DOI:10.1016/j.neucom.2014.11.088.
  • D. P. Pancho, J. M. Alonso, and L. Magdalena, Understanding the Inference Mechanism of FURIA by means of Fingrams, Proc. 2015 Conference of the International Fuzzy Systems Association and the European Society for Fuzzy Logic and Technology, 2015. DOI: 10.2991/ifsa-eusflat-15.2015.44.
  • K. Trawiński, O. Cordón, and A. Quirin, A study on the use of multiobjective genetic algorithms for classifier selection in furia-based fuzzy multiclassifiers, International Journal of Computational Intelligence Systems, 5, 2012, 231253. DOI:: 10.1080/18756891.2012.685272.
  • D. Tien Bui, T. C. Ho, B. Pradhan, B. T. Pham, V. H.Nhu, and I. Revhaug, GIS-based modeling of rainfallinduced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks, Environmental Earth Sciences,75(14), 2016. DOI: 10.1007/s12665016-5919-4.
  • V. R. Avali, G. F. Cooper, and V. Gopalakrishnan, Application of Bayesian logistic regression to mining biomedical data, Proc. AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium, 2014.
  • Ioannis I. Spyroglou, Gunter Spöck, Eleni A.Chatzimichail, Alexandros Rigas and Emmanouil Paraskakis, A Bayesian Logistic Regression approach in Asthma Persistence Prediction, Epidemiology Biostatistics and Public Health, 15(1), 2018.
  • Xia Zhao, Wei Chen, GIS-Based Evaluation of Landslide Susceptibility Models Using Certainty Factors and Functional Trees-Based Ensemble Techniques, Appl. Sci., 10(16), 2020,1-20. DOI:10.3390/app10010016
  • MahmoodMoghadasian, SeyedehParvanehHosseini, Binary Cuckoo Optimization Algorithm for Feature Selection in High-Dimensional Datasets, Proc. International conference on Innovative Engineering Technologies (ICIET’2014), Bangkok (Thailand), Dec. 28-29, 2014, 18/21. DOI:org/10.15242/IIE.E1214027
  • Sebastien C. Wong, Adam Gatt, Victor Stamatescu, Understanding data augmentation for classification: when to warp?, Proc. 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA), Gold Coast, QLD, Australia, 30 Nov.-2 Dec. 2016. DOI: 10.1109/DICTA.2016.7797091
  • Yang, X.-S., & Deb, S., Cuckoo search: State-of-theart and opportunities, Proc. 2017 IEEE 4th International Conference on Soft Computing & Machine Intelligence (ISCMI), 2017. DOI:10.1109/iscmi.2017.8279597

Abstract Views: 212

PDF Views: 0




  • Optimal Feature Subset Selection Using Cuckoo Search on IoT Network

Abstract Views: 212  |  PDF Views: 0

Authors

Samah Osama M. Kamel
Department of Informatics Research, Electronics Research Institute, Cairo, Egypt
SanaaAbou Elhamayed
Department of InformaticsResearch, ElectronicsResearch Institute, Cairo, Egypt

Abstract


The Internet of Things (IoT) became the basic axis in the information and network technology to create a smart environment. To build such an environment; it needs to use some IoT simulators such as Cooja Simulator. Cooja simulator creates an IoT environment and produces an IoT routing dataset that contains normal and malicious motes. The IoT routing dataset may have redundant and noisy features. The feature selection can affect on the performance metrics of the learning model. The feature selection can reduce complexity and over-fitting problem. There are many approaches for feature selection especially meta-heuristic algorithms such as Cuckoo search (CS). This paper presented a proposed model for feature selection that is built on using a standard cuckoo search algorithm to select near-optimal or optimal features. A proposed model may modify the CS algorithm which has implemented using Dagging with base learner Bayesian Logistic Regression (BLR). It increases the speed of the CS algorithm and improves the performance of BLR. Support Vector Machine (SVM), Deep learning, and FURIA algorithms are used as classification techniques used to evaluate the performance metrics. The results have demonstrated that the algorithm proposed is more effective and competitive in terms of performance of classification and dimensionality reduction. It achieved high accuracy that is near to 98 % and low error that is about 1.5%.

Keywords


IoT, Feature Selection, Meta-heuristic, Cuckoo Search Algorithm, Deep Learning.

References