Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Fraud News Detection for Online Social Networks


Affiliations
1 Institute of Statistical Studies and Research, Cairo University, Egypt
     

   Subscribe/Renew Journal


Social media plays a vital role in all online aspects now, including personal communication, business and economics. It even affects political aspects seriously. A huge amount of available information, especially micro blogs is considered as a massive growth rate of human users, which is represented in the unprecedented diversity of its participants in terms of backgrounds, reasons and languages a revolution in its possibility of sharing public information, besides there is the way it makes its participants use their devices and perform their mission. 

Twitter, as a most famous used type of online social networking, contains huge data and news that throw the light on the content investigation in the tweets. This paper has discussed a proposed approach for determining the credibility of spread news on such social networks in two phases: The first phase is to detect the fake users enabling to ignore the news given by fake users. The second phase detects the credibility of the news content for the previously checked  

Account users by using the similarity measures and most popular machine learning algorithms such as (Support vector machine, Decision tree, Neural networks, Naive Bayes, Random forest) that enhance the credibility examining. The accuracy of the results of this phase is 99.8 %. In the second phase the news content credibility is detected by using the most popular similarity measures (Jacard, Cosine and Dice), which Jacard ended up with 95.4%percentage of accuracy.


Keywords

Fraud News, Support Vector Machine, Neural Networks, Naive Bayes, Random Forest, Fraud Text.
User
Subscription Login to verify subscription
Notifications
Font Size

  • El azab, A., Mahmood A. Mahmood, El-Aziz, A.,”Effectiveness of web usage mining techniques in business application”, web usage mining techniques and application across industries, p.p.324-350,igi global, 2017.
  • M. Dash, H. Liu,”Feature Selection for Classification”, Intelligent Data Analysis, Vol 1, p.p. 131156, 1997.
  • Kazem Jahanbakhsh, Yumi Moon,”The predictive power of social media: On the predictability of US Presidential Elections using Twitter”, Social and Information Networks, arXiv: 1407. 0622, 2014.
  • Adamic L., ZhangJ., Bakshy E., and Ackerman M., ZhangJ .BakshyE., ”Knowledge sharing and yahoo answers: everyone knows something”. Processed in 17th international conference on World Wide Web, ACM, pp 665-674, 2012.
  • Carlos Castillo, Marcelo Mendoza, Barbara Poblete, ”Information Credibility on Twitter”, the 20th international conference on World wide web ACM, 675-684,2011.
  • Fang Jin, Edward Dougherty, Parang Saraf, Yang Cao, Naren Ramakrishnan,”Epidemiological Modeling of News and Rumors on Twitter”. The 7th SNA-KDD Workshop 13 (SNA-KDD13), August 11, 2013.
  • Aditi Gupta, Hemank Lamba, Ponnurangam Kumaraguru, Anupam Joshi, ”Faking Sandy: characterizing and identifying fake images on Twitter during Hurricane Sandy”, In Proceedings of the 22nd international conference on World Wide Web companion, 729-736,2013.
  • Rajdev, Meet,”Fake and Spam Messages: Detecting Misinformation during Natural Disasters on Social Media”. All Graduate Theses and Dissertations. Paper 4462, 2015.
  • Supraja Gurajala, Joshua S White, Brian Hudson, Brian R Voter, Jeanna N Matthews, ”Profile characteristics of fake Twitter accounts” in SM Society ’15, July 27- 29, Toronto, ON, Canada, 2015.
  • Fabricio Benevenuto, Gabriel Magno, Tiago Rodrigues, and Virgilio Almeida,”Detecting Spammers on Twitter”, CEAS 2010 - Seventh annual Collaboration, Electronic messaging, AntiAbuse and Spam Conference July 13-14, 2010, Redmond, Washington, US.
  • Ahmed El Azab, Amira M. Idrees, Mahmood A. Mahmood, Hesham Hefny ,”Fake Account Detection in Twitter Based on Minimum Weighted Feature set”,World Academy of Science, Engineering and Technology International Journal of Computer, Electrical, Automation, Control and Information Engineering Vol 10, No 1,p.p. 13-18 2016.
  • Karegowda, A. G., Manjunath, A. S., & Jayaram, M. A. (2010). Comparative study of attribute selection using gain ratio and correlation based feature selection. International Journal of Information Technology and Knowledge Management, 2(2), 271-277.
  • Hu, X., & Liu, H. (2012). Text analytics in social media. In Mining text data (pp. 385-414). Springer, Boston
  • Niwattanakul, S., Singthongchai, J., Naenudorn, E., & Wanapu, S. (2013, March). Using of Jaccard coefficient for keywords similarity. In Proceedings of the International MultiConference of Engineers and Computer Scientists (Vol. 1, No. 6).
  • Cresci, S., Di Pietro, R., Petrocchi, M., Spognardi, A., Tesconi, M.,”A Fake Follower Story: improving fake accounts detection on Twitter”, IIT-CNR, Tech. Rep. TR-03, 2014.
  • Vahed Qazvinian Emily Rosengren Dragomir R. Radev Qiaozhu Mei, ”Rumor has it: Identifying Misinformation in Microblogs”, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, p.p. 15891599, Edinburgh, Scotland, UK, July 2731, Association for Computational Linguistics,2011

Abstract Views: 239

PDF Views: 3




  • Fraud News Detection for Online Social Networks

Abstract Views: 239  |  PDF Views: 3

Authors

Ahmed ELazab
Institute of Statistical Studies and Research, Cairo University, Egypt
Mahmoud A. Mahmoud
Institute of Statistical Studies and Research, Cairo University, Egypt

Abstract


Social media plays a vital role in all online aspects now, including personal communication, business and economics. It even affects political aspects seriously. A huge amount of available information, especially micro blogs is considered as a massive growth rate of human users, which is represented in the unprecedented diversity of its participants in terms of backgrounds, reasons and languages a revolution in its possibility of sharing public information, besides there is the way it makes its participants use their devices and perform their mission. 

Twitter, as a most famous used type of online social networking, contains huge data and news that throw the light on the content investigation in the tweets. This paper has discussed a proposed approach for determining the credibility of spread news on such social networks in two phases: The first phase is to detect the fake users enabling to ignore the news given by fake users. The second phase detects the credibility of the news content for the previously checked  

Account users by using the similarity measures and most popular machine learning algorithms such as (Support vector machine, Decision tree, Neural networks, Naive Bayes, Random forest) that enhance the credibility examining. The accuracy of the results of this phase is 99.8 %. In the second phase the news content credibility is detected by using the most popular similarity measures (Jacard, Cosine and Dice), which Jacard ended up with 95.4%percentage of accuracy.


Keywords


Fraud News, Support Vector Machine, Neural Networks, Naive Bayes, Random Forest, Fraud Text.

References