Collective Spammer Detection In Evolving Multi-Relational Social Networks

KDD(2015)

引用 145|浏览92
暂无评分
摘要
Detecting unsolicited content and the spammers who create it is a long-standing challenge that affects all of us on a daily basis. The recent growth of richly-structured social networks has provided new challenges and opportunities in the spam detection landscape. Motivated by the Tagged.com(1) social network, we develop methods to identify spammers in evolving multi-relational social networks. We model a social network as a time-stamped multi-relational graph where vertices represent users, and edges represent different activities between them. To identify spammer accounts, our approach makes use of structural features, sequence modelling, and collective reasoning. We leverage relational sequence information using k-gram features and probabilistic modelling with a mixture of Markov models. Furthermore, in order to perform collective reasoning and improve the predictive power of a noisy abuse reporting system, we develop a statistical relational model using hinge-loss Markov random fields (HL-MRFs), a class of probabilistic graphical models which are highly scalable. We use Graphlab Create (TM) and Probabilistic Soft Logic (PSL)(2) to prototype and experimentally evaluate our solutions on internet-scale data from Tagged.com. Our experiments demonstrate the effectiveness of our approach, and show that models which incorporate the multi-relational nature of the social network significantly gain predictive performance over those that do not.
更多
查看译文
关键词
Social Networks,Spam,Social Spam,Collective Classification,Graph Mining,Multi-relational Networks,Heterogeneous Networks,Sequence Mining,Tree-Augmented Naive Bayes,k-grams,Hinge-loss Markov Random Fields (HL-MRFs),Probabilistic Soft Logic (PSL),Graphlab
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要