Spam blogs, or splogs, are blogs featuring plagiarized or auto-generated content. They create link farms to promote affiliates, and are motivated by the profitability of hosting ads. Splogs infiltrate the blogosphere at ping servers, systems that aggregate blog update pings. Over the past year, our work has focused on detecting and eliminating splogs. As techniques used by spammers have evolved, we have learned how splog signatures are tied to tools that create them, that they are beginning to be a problem across languages, and that they require a much quicker assessment. Though we continue to address these specific challenges, we discuss our larger goal in this work, of developing a scalable meta-ping filter that detects and eliminates update pings from splogs. This will considerably reduce computational requirements and manual efforts at downstream services (search engines) and involve the community in detecting spam blogs.
I like to review this paper in a question & answer manner. This paper answer to some questions as below:
Why spammer use blogs?
- Blog are more relevance in web search engines
- Ping servers quickly notify blog new data to search engines
- Hosting blogs available for free
How search engines filter Splog contents?
With two methods: Pre-Indexing and Post-Indexing
What is the proposed method for Splog detection in this paper?
Authors provide new system called Meta-Ping server, in pre-indexing time. This system provide search engins a blacklist of blogs.
What are the approaches for making meta-ping server?
Four filtering method will be used on Meta-Ping server:
- URL based filtering
- Blacklist based filtering
- Blog home-page based filtering,
- Feed based filtering
Cite this article as
Critical Review on “Towards Spam Detection at Ping Servers” by P.Hayati, 2nd Mar, 2008. Available Online – https://pi3ch.wordpress.com/2008/03/02/review-towards-spam-detection-at-ping-servers/