Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

NLSDF for Boosting the Recital of Web Spamdexing Classification


Affiliations
1 Department of Computer Science, Vellalar College for Women, India
2 Department of Computer Science, Hindusthan College of Arts and Science, India
     

   Subscribe/Renew Journal


Spamdexing is the art of black hat SEO. Features which are more influential for high rank and visibility are manipulated for the SEO task. The motivation behind the work is utilizing the state of the art Website optimization features to enhance the performance of spamdexing detection. Features which play a focal role in current SEO strategies show a significant deviation for spam and non-spam samples. This paper proposes 44 features named as NLSDF (New Link Spamdexing Detection Features). Social media creates an impact in search engine results ranking. Features pertaining to the social media were incorporated with the NLSDF features to boost the recital of the spamdexing classification. The NLSDF features with 44 attributes along with 5 social media features boost the classification performance of the WEBSPAM-UK 2007 dataset. The one tailed paired t-test with 95% confidence, performed on the AUC values of the learning models shows significance of the NLSDF.

Keywords

Web Spam, Search Engine, SVM, Decision Table, HMM.
Subscription Login to verify subscription
User
Notifications
Font Size

Abstract Views: 255

PDF Views: 0




  • NLSDF for Boosting the Recital of Web Spamdexing Classification

Abstract Views: 255  |  PDF Views: 0

Authors

S. K. Jayanthi
Department of Computer Science, Vellalar College for Women, India
S. Sasikala
Department of Computer Science, Hindusthan College of Arts and Science, India

Abstract


Spamdexing is the art of black hat SEO. Features which are more influential for high rank and visibility are manipulated for the SEO task. The motivation behind the work is utilizing the state of the art Website optimization features to enhance the performance of spamdexing detection. Features which play a focal role in current SEO strategies show a significant deviation for spam and non-spam samples. This paper proposes 44 features named as NLSDF (New Link Spamdexing Detection Features). Social media creates an impact in search engine results ranking. Features pertaining to the social media were incorporated with the NLSDF features to boost the recital of the spamdexing classification. The NLSDF features with 44 attributes along with 5 social media features boost the classification performance of the WEBSPAM-UK 2007 dataset. The one tailed paired t-test with 95% confidence, performed on the AUC values of the learning models shows significance of the NLSDF.

Keywords


Web Spam, Search Engine, SVM, Decision Table, HMM.