Review: Spam Double-Funnel: Connecting Web Spammers with Advertisers

May 12, 2008
Authors: Yi-Min Wang, Ming Ma, Yuan Niu, Hao Chen
Year: 2007
Published in: Proceedings of the 16th international conference on World Wide Web
Link: http://www.cs.ucdavis.edu/~hchen/paper/www07.pdf
Importance: High

Abstract

Spammers use questionable search engine optimization (SEO) techniques to promote their spam links into top search results. In this paper, we focus on one prevalent type of spam – redirection spam – where one can identify spam pages by the third-party domains that these pages redirect traffic to. We propose a five-layer, double-funnel model for describing end-to-end redirection spam, present a methodology for analyzing the layers, and identify prominent domains on each layer using two sets of commercial keywords. one targeting spammers and the other targeting advertisers. The methodology and findings are useful for search engines to strengthen their ranking algorithms against spam, for legitimate website owners to locate and remove spam doorway pages, and for legitimate advertisers to identify unscrupulous syndicators who serve ads on spam pages.

My Reiview

In this paper authors intrestingly model end-to-end spamming business with five-layer double funnel. These layer are as follow:

  1. Doorway: Spammer setup
  2. Redirection domain: Spammers set up redirection doamin.
  3. Aggregators: insulate below layers from spam pages.
  4. Sydicator: buy traffic from aggregators.
  5. Advertiser: pay sydicators to dispayer their ads.

Five top spam keywords:

  1. Phentermine
  2. Viagra
  3. Cialis
  4. Tramadol
  5. Xanax

More coming soon…


Review: A Taxonomy of JavaScript Redirection Spam

May 12, 2008
Authors: Kumar Chellapilla, Alexey Maykov
Year: 2007
Published in: Proceedings of the 3rd international workshop on Adversarial information retrieval on the web
Link: http://airweb.cse.lehigh.edu/2007/papers/paper_115.pdf
Importance: High

Abstract

Redirection spam presents a web page with false content to a crawler for indexing, but automatically redirects the browser to a different web page. Redirection is usually immediate (on page load) but may also be triggered by a timer or a harmless user event such as a mouse move. JavaScript redirection is the most notorious of redirection techniques and is hard to detect as many of the prevalent crawlers are script-agnostic. In this paper, we study common JavaScript redirection spam techniques on the web. Our findings indicate that obfuscation techniques are very prevalent among JavaScript redirection spam pages. These obfuscation techniques limit the effectiveness of static analysis and static feature based systems. Based on our findings, we recommend a robust counter measure using a light weight JavaScript parser and engine.

My Review

This paper only demonestrate the problem of redirect spam without any solutions. Authors categoriez redirection spam into 3 categories:

  1. HTTP status code
  2. META refresh
  3. Javascript

They mentioned that finding type 1 and 2 is very simple. But few works has been done on type 3.

They used blog spot as their data set and try to find amount of type 3 redirection spam.

The main problem of type 3 redirection spams back to nature of javascript. by using varioty techinques spammers can hide their redirection page. (e.g. one can encrpyt redirection script).

Suggestion

As paper suggestion, there are other category of redirection. such as server side redirection. so future study in thies area would be intrested.

server side