Detecting Spam URLs in Social Media via Behavioral Analysis Conference Paper uri icon

abstract

  • © Springer International Publishing Switzerland 2015. This paper addresses the challenge of detecting spam URLs in social media, which is an important task for shielding users from links associated with phishing, malware, and other low-quality, suspicious content. Rather than rely on traditional blacklist-based filters or content analysis of the landing page for Web URLs, we examine the behavioral factors of both who is posting the URL and who is clicking on the URL. The core intuition is that these behavioral signals may be more difficult to manipulate than traditional signals. Concretely, we propose and evaluate fifteen click and posting-based features. Through extensive experimental evaluation, we find that this purely behavioral approach can achieve high precision (0.86), recall (0.86), and area-under-the-curve (0.92), suggesting the potential for robust behavior-based spam detection.

author list (cited authors)

  • Cao, C., & Caverlee, J.

citation count

  • 38

editor list (cited editors)

  • Hanbury, A., Kazai, G., Rauber, A., & Fuhr, N.

publication date

  • January 2015