Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Detecting Deceptive Reviews: An Integrated Machine Learning Approach


Affiliations
1 College of Information Technology, United Arab Emirates University, United Arab Emirates
2 Department of Information Technology, Noorul Islam University, India
     

   Subscribe/Renew Journal


In recent years, online reviews have become a crucial factor in promoting products and services. However, the rise of fake reviews has posed a significant challenge. Businesses, marketers, and advertisers often resort to embedding fake reviews to attract customers or undermine their competitors. Deceptive reviews have become a common practice, as they serve as a means of promoting one's own business or tarnishing the reputation of rivals. Consequently, the identification of deceptive reviews has emerged as a critical and ongoing research area. This research paper presents a machine learning model approach to detect deceptive reviews. The study focuses on experiments conducted using a deceptive opinion spam corpus dataset, specifically targeting restaurant reviews. An n-gram model combined with max features is developed to identify deceptive content, with a particular emphasis on fake reviews. Additionally, a benchmark study is conducted to explore the performance of two different feature extraction techniques and their application in five machine learning classification techniques. The experimental findings demonstrate that the passive aggressive classifier outperforms other algorithms, achieving the highest accuracy not only in text classification but also in identifying fake reviews. Moreover, the research delves into the identification of deceptive reviews and explores diverse feature extraction and machine learning techniques to improve the model's accuracy.

Keywords

Natural Language Processing, Transformers, Deceptive Reviews.
Subscription Login to verify subscription
User
Notifications
Font Size

  • Jiwel Li, Myle Ott, Claire Cardie and Eduard Hovy, “Towards a General Rule for Identifying Deceptive Opinion Spam”, Proceedings of Annual Meeting of the Association for Computational Linguistics, pp. 1566-1576, 2014.
  • Ahmed Hadeer, Issa Traore and Sherif Saad, “Detecting Opinion Spams and Fake News using Text Classification”, Security and Privacy, Vol. 1, No. 1, pp.1-9, 2018.
  • Mukherjee Arjun, Bing Liu and Natalie Glance, “Spotting Fake Reviewer Groups in Consumer Reviews”, Proceedings of International Conference on World Wide Web, pp. 191-200, 2012.
  • Shojaee Somayeh, Masrah Azrifah Azmi Murad, Azreen Bin Azman, Nurfadhlina Mohd Sharef and Samaneh Nadali, “Detecting Deceptive Reviews using Lexical and Syntactic Features”, Proceedings of International Conference on Intelligent Systems Design and Applications, pp. 53-58, 2013.
  • P. Algur Siddu and S. Shivashankar, “Conceptual Level Similarity Measure based Review Spam Detection”, Proceedings of International Conference on Signal and Image Processing, pp. 416-423, 2010.
  • Y.K. Lau Raymond, Ron Chi-Wai Kwok, Kaiquan Xu, Yunqing Xia and Yuefeng Li, “Text Mining and Probabilistic Language Modeling for Online Review Spam Detection”, ACM Transactions on Management Information Systems, Vol. 2, No. 4, pp. 1-30, 2012.
  • Jindal Nitin and Bing Liu, “Opinion Spam and Analysis”, Proceedings of International Conference on Web Search and Data Mining, pp. 219-230, 2008.
  • Choi Wonil, Kyungmin Nam, Minwoo Park, Seoyi Yang, Sangyoon Hwang and Hayoung Oh, “Fake Review Identification and Utility Evaluation Model using Machine Learning”, Frontiers in Artificial Intelligence, Vol. 5, pp. 1064371-1064378, 2023.
  • Geli Fei, Arjun Mukherjee, Bing Liu, Meichun Hsu, Malu Castellanos and Riddhiman Ghosh, “Exploiting Business in Reviews for Review Spammer Detection”, Proceedings of International AAAI Conference on Weblogs and Social Media, pp. 175-184, 2013.
  • Feng Song, Ritwik Banerjee and Yejin Choi, “Syntactic Stylometry for Deception Detection”, Proceedings of Annual Meeting of the Association for Computational Linguistics, pp. 171-175, 2012.
  • Jindal Nitin and Bing Liu, “Review Spam Detection”, Proceedings of International Conference on World Wide Web, pp. 1189-1190, 2007.
  • Li Fangtao Huang, Minlie Huang, Yi Yang and Xiaoyan Zhu, “Learning to Identify Review Spam”, Proceedings of 22nd International Joint Conference on Artificial Intelligence, pp. 1-13, 2011.
  • Li Jiwei, Claire Cardie and Sujian Li, “Topicspam: A Topic-Model based Approach for Spam Detection”, Proceedings of the Annual Meeting of the Association for Computational Linguistics, pp. 217-221, 2013.
  • Myle Ott, Yejin Choi, Claire Cardie and Jeffrey T. Hancock, “Finding Deceptive Opinion Spam by any Stretch of the Imagination”, Proceedings of International Conference on World Wide Web, pp. 1-8, 2011.
  • Xie Sihong, Guan Wang, Shuyang Lin and Philip S. Yu, “Review Spam Detection via Temporal Pattern Discovery”, Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 823-831, 2012.
  • Mukherjee Arjun, Vivek Venkataraman, Bing Liu and Natalie Glance, “What Yelp Fake Review Filter Might be Doing?”, Proceedings of the International AAAI Conference on Web and Social Media, Vol. 7, No. 1, pp. 409-418, 2013.

Abstract Views: 145

PDF Views: 1




  • Detecting Deceptive Reviews: An Integrated Machine Learning Approach

Abstract Views: 145  |  PDF Views: 1

Authors

Anusuya Krishnan
College of Information Technology, United Arab Emirates University, United Arab Emirates
Kennedyraj
Department of Information Technology, Noorul Islam University, India

Abstract


In recent years, online reviews have become a crucial factor in promoting products and services. However, the rise of fake reviews has posed a significant challenge. Businesses, marketers, and advertisers often resort to embedding fake reviews to attract customers or undermine their competitors. Deceptive reviews have become a common practice, as they serve as a means of promoting one's own business or tarnishing the reputation of rivals. Consequently, the identification of deceptive reviews has emerged as a critical and ongoing research area. This research paper presents a machine learning model approach to detect deceptive reviews. The study focuses on experiments conducted using a deceptive opinion spam corpus dataset, specifically targeting restaurant reviews. An n-gram model combined with max features is developed to identify deceptive content, with a particular emphasis on fake reviews. Additionally, a benchmark study is conducted to explore the performance of two different feature extraction techniques and their application in five machine learning classification techniques. The experimental findings demonstrate that the passive aggressive classifier outperforms other algorithms, achieving the highest accuracy not only in text classification but also in identifying fake reviews. Moreover, the research delves into the identification of deceptive reviews and explores diverse feature extraction and machine learning techniques to improve the model's accuracy.

Keywords


Natural Language Processing, Transformers, Deceptive Reviews.

References