Detecting Spam URLs in Social Media via Behavioral Analysis

abstract

Springer International Publishing Switzerland 2015. This paper addresses the challenge of detecting spam URLs in social media, which is an important task for shielding users from links associated with phishing, malware, and other low-quality, suspicious content. Rather than rely on traditional blacklist-based filters or content analysis of the landing page for Web URLs, we examine the behavioral factors of both who is posting the URL and who is clicking on the URL. The core intuition is that these behavioral signals may be more difficult to manipulate than traditional signals. Concretely, we propose and evaluate fifteen click and posting-based features. Through extensive experimental evaluation, we find that this purely behavioral approach can achieve high precision (0.86), recall (0.86), and area-under-the-curve (0.92), suggesting the potential for robust behavior-based spam detection.

name of conference

Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Vienna, Austria, March 29 - April 2, 2015. Proceedings

authors

Caverlee, James

published proceedings

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

author list (cited authors)

Cao, C., & Caverlee, J.

citation count

45

complete list of authors

Cao, Cheng||Caverlee, James

editor list (cited editors)

Hanbury, A., Kazai, G., Rauber, A., & Fuhr, N.

publication date

January 2015

publisher

Springer Nature Publisher

published in

Lecture Notes in Artificial Intelligence Journal

keywords

Behavioral And Social Science

Digital Object Identifier (DOI)

10.1007/978-3-319-16354-3_77

International Standard Book Number (ISBN) 13

9783319163536

start page

703

end page

714

volume

9022

URL

https://doi.org/10.1007/978-3-319-16354-3

Detecting Spam URLs in Social Media via Behavioral Analysis Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

editor list (cited editors)

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

volume

Other

URL