Uncovering Social Spammers: Social Honeypots plus Machine Learning Conference Paper

Overview
Research
Identity
Additional Document Info
Other
View All

abstract

Web-based social systems enable new community-based opportunities for participants to engage, share, and interact. This community value and related services like search and advertising are threatened by spammers, content polluters, and malware disseminators. In an effort to preserve community value and ensure long-term success, we propose and evaluate a honeypot-based approach for uncovering social spammers in online social systems. Two of the key components of the proposed approach are: (1) The deployment of social honeypots for harvesting deceptive spam profiles from social networking communities; and (2) Statistical analysis of the properties of these spam profiles for creating spam classifiers to actively filter out existing and new spammers. We describe the conceptual framework and design considerations of the proposed approach, and we present concrete observations from the deployment of social honeypots in MySpace and Twitter. We find that the deployed social honeypots identify social spammers with low false positive rates and that the harvested spam data contains signals that are strongly correlated with observable profile features (e.g., content, friend information, posting patterns, etc.). Based on these profile features, we develop machine learning based classifiers for identifying previously unknown spammers with high precision and a low rate of false positives. 2010 ACM.

name of conference

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

authors

Caverlee, James

published proceedings

SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL

author list (cited authors)

Lee, K., Caverlee, J., & Webb, S.

citation count

439

complete list of authors

Lee, Kyumin||Caverlee, James||Webb, Steve

editor list (cited editors)

Crestani, F., Marchand-Maillet, S., Chen, H., Efthimiadis, E. N., & Savoy, J.

publication date

January 2010

publisher

Association for Computing Machinery (ACM) Publisher