Countering Web Spam with Credibility-Based Link Analysis Conference Paper uri icon

abstract

  • We introduce the concept of link credibility, identify the conflation of page quality and link credibility in popular Web link analysis algorithms, and discuss how to decouple link credibility from page quality. Our credibility-based link analysis exhibits three distinct features. First, we develop several techniques for semi-automatically assessing link credibility for all Web pages. Second, our link credibility assignment algorithms allow users to assess credibility in a personalized manner. Third, we develop a novel credibility-based Web ranking algorithm - CredibleRank - which incorporates credibility information directly into the quality assessment of each page on the Web. Our experimental study shows that our approach is significantly and consistently more spam-resilient than both PageRank and TrustRank. Copyright 2007 ACM.

name of conference

  • Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing

published proceedings

  • PODC'07: PROCEEDINGS OF THE 26TH ANNUAL ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING

author list (cited authors)

  • Caverlee, J., & Liu, L.

citation count

  • 31

complete list of authors

  • Caverlee, James||Liu, Ling

editor list (cited editors)

  • Gupta, I., & Wattenhofer, R.

publication date

  • January 2007