Detecting Spam URLs in Social Media via Behavioral Analysis Conference Paper uri icon

abstract

  • Springer International Publishing Switzerland 2015. This paper addresses the challenge of detecting spam URLs in social media, which is an important task for shielding users from links associated with phishing, malware, and other low-quality, suspicious content. Rather than rely on traditional blacklist-based filters or content analysis of the landing page for Web URLs, we examine the behavioral factors of both who is posting the URL and who is clicking on the URL. The core intuition is that these behavioral signals may be more difficult to manipulate than traditional signals. Concretely, we propose and evaluate fifteen click and posting-based features. Through extensive experimental evaluation, we find that this purely behavioral approach can achieve high precision (0.86), recall (0.86), and area-under-the-curve (0.92), suggesting the potential for robust behavior-based spam detection.

name of conference

  • Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Vienna, Austria, March 29 - April 2, 2015. Proceedings

published proceedings

  • Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

author list (cited authors)

  • Cao, C., & Caverlee, J.

citation count

  • 45

complete list of authors

  • Cao, Cheng||Caverlee, James

editor list (cited editors)

  • Hanbury, A., Kazai, G., Rauber, A., & Fuhr, N.

publication date

  • January 2015