Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Symbolic Representation of Internet Traffic Data using Multiple Kernel Fuzzy C-Means


Affiliations
1 Department of Information Science and Engineering, Sri Jayachamarajendra College of Engineering, India
2 Department of Information Science and Engineering, JSS Science and Technology University, India
     

   Subscribe/Renew Journal


Network traffic classification is a core part of the network traffic management. Network management is a critical task since the various new applications are emerging every moment and increase in the number of users of an internet. Due to this problem, there is a need of internet traffic classification for smooth management of an internet by the internet service providers (ISP). Network traffic can be classified based on port, payload and statistical approach. In the proposed work, a novel method to represent internet traffic data based on clustering of feature vector using Multiple Kernel Fuzzy C-Means (MKFCM) is proposed. Further, feature vector of each cluster is used to build an interval valued representation (symbolic) using mean and standard deviation. In addition, this interval valued features are stored in knowledge base as a representative of the cluster. Further, to classify the symbolic interval data, we used symbolic classifier. To validate the effectiveness of the proposed model, experimentation is conducted on standard Cambridge University internet traffic dataset. Further, the proposed symbolic classifier compared with other existing classifiers such as Naïve Bayes, KNN and SVM classifier. The experiment outcome infers that; the proposed symbolic representation classifier performs better than other classifiers.

Keywords

Internet Traffic, Representation, Symbolic Feature, Classification.
Subscription Login to verify subscription
User
Notifications
Font Size

  • M. Roughan, S. Sen, O. Spatscheck and N. Duffield, “Class-of-Service Mapping for QoS: A Statistical Signature-based Approach to IP Traffic Classification”, Proceedings of 4th ACM Conference on Internet Measurement, pp. 135-148, 2004.
  • T. Karagiannis, K. Papagiannaki and M. Faloutsos, “BLINC: Multilevel Traffic Classification in the Dark”, Proceedings of ACM International Conference on Computer Communication Review, pp. 229-240, 2005.
  • T.T. Nguyen and G.J. Armitage, “A Survey of Techniques for Internet Traffic Classification using Machine Learning”, IEEE Communications Surveys and Tutorials, Vol. 20, No. 1, pp. 56-76, 2008.
  • A. Finamore, M. Mellia, M. Meo and D. Rossi, “Kiss: Stochastic Packet Inspection Classifier for UDP Traffic”, IEEE/ACM Transactions on Networking, Vol. 18, No. 5, pp. 1505-1515, 2010.
  • Y. Xiang, W. Zhou and M. Guo, “Flexible Deterministic Packet Marking: An IP Traceback System to Find the Real Source of Attacks”, IEEE Transactions on Parallel and Distributed Systems, Vol. 20, No. 4, pp. 567-580, 2009.
  • Z.M. Fadlullah, T. Taleb, A.V. Vasilakos, M. Guizani and N. Kato, “DTRAB: Combating Against Attacks on Encrypted Protocols through Traffic-Feature Analysis”, IEEE/ACM Transactions on Networking, Vol. 18, No. 4, pp. 1234-1247, 2010.
  • R. Buyya, C.S. Yeo, S. Venugopal, J. Broberg and I. Brandic, “Cloud Computing and Emerging IT Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility”, Future Generation Computer Systems, Vol. 25, No. 6, pp. 599-616, 2009.
  • M. Armbrust, A. Fox and R. Griffith, “A View of Cloud Computing”, Communications of the ACM, Vol. 53, No. 4, pp. 50-58, 2010.
  • H. Dreger, A. Feldmann, M. Mai, V. Paxson and R. Sommer, “Dynamic Application-Layer Protocol Analysis for Network Intrusion Detection”, Proceedings of 15th International USENIX Association Symposium on Security, pp. 257-272, 2006.
  • A.W. Moore and K. Papagiannaki, “Toward the Accurate Identification of Network Applications”, Proceedings of International Workshop on Passive and Active Network Measurement, pp. 41-54, 2005.
  • S. Sen, O. Spatscheck and D. Wang, “Accurate, Scalable in-Network Identification of P2P Traffic using Application Signatures”, Proceedings of 13th International Conference on World Wide Web, pp. 512-521, 2004.
  • R.C. Jaiswal and S.D. Lokhande, “Machine Learning based Internet Traffic Recognition with Statistical Approach”, Proceedings of Annual IEEE International Conference on Networking and Security, pp. 1-6, 2013.
  • R.Y. Wang, L.I.U. Zhen and Z. Ling, “Method of Data Cleaning for Network Traffic Classification”, The Journal of China Universities of Posts and Telecommunications, Vol. 21, No. 3, pp. 35-45, 2014.
  • L. Zhen and L. Qiong, “A New Feature Selection Method for Internet Traffic Classification using ML”, Physics Procedia, Vol. 33, pp. 1338-1345, 2012.
  • M. Sun, J. Chen, Y. Zhang and S. Shi, “A New Method of Feature Selection for Flow Classification”, Physics Procedia, Vol. 24, pp. 1729-1736, 2012.
  • H. Zhang, G.Lu, M.T. Qassrawi, Y. Zhang and X. Yu, “Feature Selection for Optimizing Traffic Classification”, Computer Communications, Vol. 35, No. 12, pp. 1457-1471, 2012.
  • A. Fahad, Z. Tari, I. Khalil and I. Habib, “Toward an Efficient and Scalable Feature Selection Approach for Internet Traffic Classification”, Computer Networks, Vol. 57, No. 9, pp. 2040-2057, 2013.
  • S. Wang and X. Yao, “Multiclass Imbalance Problems: Analysis and Potential Solutions”, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol. 42, No. 4, pp. 1119-1130, 2012.
  • L. Peng, Lizhi, B. Yang, Y. Chen and X. Zhou, “An Under-Sampling Imbalanced Learning of Data Gravitation Based Classification”, Proceedings of 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery, pp. 1-6, 2016.
  • L. Peng, H. Zhang, B. Yang and Y. Chen, “A New Approach for Imbalanced Data Classification based on Data Gravitation”, Information Sciences, Vol. 288, pp. 347-373, 2014.
  • L. Peng, H. Zhang, B. Yang and Y. Chen, “Imbalanced Traffic Identification using an Imbalanced Data Gravitation-based Classification Model”, Computer Communications, Vol. 102, pp. 177-189, 2017.
  • T.T.T. Nguyen, G. Armitage, P. Branch and S. Zander, “Timely and Continuous Machine-Learning-based Classification for Interactive IP Traffic”, IEEE/ACM Transactions on Networking, Vol. 20, No. 6, pp. 1880-1894, 2012.
  • Neeraj Namdev, Shikha Agrawal and Sanjay Silkari, “Recent Advancement in Machine Learning based Internet Traffic Classification”, Procedia Computer Science, Vol. 60, pp. 784-791, 2015.
  • Y. Wang, Y. Xiang, J. Zhang, W. Zhou, G. Wei and L.T. Yang, “Internet Traffic Classification using Constrained Clustering”, IEEE Transactions on Parallel and Distributed Systems, Vol. 25, No. 11, pp. 2932-2943, 2014.
  • Hardeep Singh, “Performance Analysis of Unsupervised Machine Learning Techniques for Network Traffic Classification”, Proceedings of 5th IEEE International Conference on Advanced Computing and Communication Technologies, pp. 1-6, 2015.
  • J. Erman, M. Arlitt and A. Mahanti, “Traffic Classification using Clustering Algorithms”, Proceedings of SIGCOMM workshop on Mining Network Data, pp. 1-5, 2006.
  • D.S. Guru, B.S. Harish and S. Manjunath. “Symbolic Representation of Text Documents”, Proceedings of 3rd Annual ACM Bangalore Conference, pp. 1-5, 2010.
  • B.S. Harish, B. Prasad and B. Udayasri, “Classification of Text Documents using Adaptive Fuzzy C-Means Clustering”, Proceedings of IEEE International Conference on Recent Advances in Intelligent Informatics, pp. 205-214, 2014.
  • K.R. Muller, S. Mika, G. Ratsch, K. Tsuda and B. Scholkopf, “An Introduction to Kernel-Based Learning Algorithms”, IEEE Transactions on Neural Networks, Vol. 12, No. 2, pp. 181-200, 2001.
  • Hsin Chien Huang, Yung-Yu Chuang and Chu-Song Chen, “Multiple Kernel Fuzzy Clustering”, IEEE Transactions on Fuzzy Systems, Vol. 20, No. 1, pp. 120-134, 2012.
  • D. Cai, X. He, W.V. Zhang and J. Han, “Regularized Locality Preserving Indexing via Spectral Regression”, Proceedings of 16th ACM International Conference on Information and Knowledge Management, pp. 741-750, 2007.
  • A. Moore, Z. Denis Zuev and M. Crogan, “Discriminators for Use in Flow-Based Classification”, Technical Report, Department of Computer Science, Queen Mary University of London, pp. 1-19, 2013.
  • F. Ertam and A. Engin, “A New Approach for Internet Traffic Classification: GA-WK-ELM”, Measurement, Vol. 95, pp. 135-142, 2017.

Abstract Views: 764

PDF Views: 1




  • Symbolic Representation of Internet Traffic Data using Multiple Kernel Fuzzy C-Means

Abstract Views: 764  |  PDF Views: 1

Authors

N. Manju
Department of Information Science and Engineering, Sri Jayachamarajendra College of Engineering, India
B. S. Harish
Department of Information Science and Engineering, JSS Science and Technology University, India

Abstract


Network traffic classification is a core part of the network traffic management. Network management is a critical task since the various new applications are emerging every moment and increase in the number of users of an internet. Due to this problem, there is a need of internet traffic classification for smooth management of an internet by the internet service providers (ISP). Network traffic can be classified based on port, payload and statistical approach. In the proposed work, a novel method to represent internet traffic data based on clustering of feature vector using Multiple Kernel Fuzzy C-Means (MKFCM) is proposed. Further, feature vector of each cluster is used to build an interval valued representation (symbolic) using mean and standard deviation. In addition, this interval valued features are stored in knowledge base as a representative of the cluster. Further, to classify the symbolic interval data, we used symbolic classifier. To validate the effectiveness of the proposed model, experimentation is conducted on standard Cambridge University internet traffic dataset. Further, the proposed symbolic classifier compared with other existing classifiers such as Naïve Bayes, KNN and SVM classifier. The experiment outcome infers that; the proposed symbolic representation classifier performs better than other classifiers.

Keywords


Internet Traffic, Representation, Symbolic Feature, Classification.

References