Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Web Link Spam Identification Inspired by Artificial Immune System and the Impact of TPP-FCA Feature Selection on Spam Classification


Affiliations
1 Department of Computer Science, Vellalar College for Women, India
2 Department of Computer Science, K.S.R College of Arts and Science, India
     

   Subscribe/Renew Journal


Search engines are the doorsteps for retrieving required information from the web. Web spam is a bad method for improving the ranking and visibility of the web pages in search engine results. This paper addresses the problem of the link spam classification through the features of the web sites. Link related features retrieved from the website are used to discriminate the spam and non-spam sites. AIS inspired algorithms are applied for the dataset and results are evaluated. Artificial immune systems are machine learning systems inspired by the principles of the natural immunology. It comprises of supervised learning schemes which can be adapted for the wide range of the classification problems.UK- WEBSPAM-2007 Dataset [8] is used for the experiments. WEKA [9] is used to simulate the classifiers. Artificial Immune Recognition algorithm seems to perform well than the other classes. Best classification accuracy attained is 98.89 by AIRS1 Algorithm. This seems to be good when comparing with the other classifiers accuracy available on the existing literature.

Keywords

Web Spam, Search Engine, TPP, FCA, AIRS.
Subscription Login to verify subscription
User
Notifications
Font Size

Abstract Views: 170

PDF Views: 0




  • Web Link Spam Identification Inspired by Artificial Immune System and the Impact of TPP-FCA Feature Selection on Spam Classification

Abstract Views: 170  |  PDF Views: 0

Authors

S. K. Jayanthi
Department of Computer Science, Vellalar College for Women, India
S. Sasikala
Department of Computer Science, K.S.R College of Arts and Science, India

Abstract


Search engines are the doorsteps for retrieving required information from the web. Web spam is a bad method for improving the ranking and visibility of the web pages in search engine results. This paper addresses the problem of the link spam classification through the features of the web sites. Link related features retrieved from the website are used to discriminate the spam and non-spam sites. AIS inspired algorithms are applied for the dataset and results are evaluated. Artificial immune systems are machine learning systems inspired by the principles of the natural immunology. It comprises of supervised learning schemes which can be adapted for the wide range of the classification problems.UK- WEBSPAM-2007 Dataset [8] is used for the experiments. WEKA [9] is used to simulate the classifiers. Artificial Immune Recognition algorithm seems to perform well than the other classes. Best classification accuracy attained is 98.89 by AIRS1 Algorithm. This seems to be good when comparing with the other classifiers accuracy available on the existing literature.

Keywords


Web Spam, Search Engine, TPP, FCA, AIRS.