Detecting Product Review Spammers Using Principles of Big Data

IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT(2023)

引用 4|浏览21
暂无评分
摘要
The growing consumerism has led to the importance of online reviews on the Internet. Opinions voiced by these reviews are taken into consideration by many consumers for making financial decisions online. This has led to the development of opinion spamming for profitable motives or otherwise. This work has been done to tackle the challenge of identifying such spammers, but the scale of the real-world review systems demands this problem to be tackled as a big data challenge. So, an effort has been made to detect online review spammers using the principle of big data. In this article, a rating-based model has been studied under the light of large-scale datasets (more than 80 million reviews by 20 million reviewers) using the Hadoop and Spark frameworks. Scale effects have been identified and mitigated to provide better context to large review systems. An improved computational framework has been presented to compute the overall spamcity of reviewers using exponential smoothing. The value of the smoothing factor was set empirically. Finally, future directions have been discussed.
更多
查看译文
关键词
Big Data, Data models, Unsolicited e-mail, Smoothing methods, Context modeling, Computer science, Computational modeling, Big data, e-commerce, review spammer detection, spam reviews
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络