Why We Need Atom Now - The RSS Blog
Blogging about Really Simple Syndication, RDF, FOAF, The Semantic Web and Social Software.
Copyright 2003-5 Randy Charles Morin
The RSS Blog
<< Previous Main Next >>
Mon, 04 Apr 2005 13:44:24 GMT
Why We Need Atom Now

Tim Bray: We’re getting real, real close to sending the Atom data-format draft off for general IETF review. [cut] Via Technorati and PubSub, I subscribe to a bunch of synthetic feeds based on various keywords searches and URL linkages. They are infested with duplicates.

Randy: I don't see how Atom is gonna solve the Technorati duplicate problem. What Tim seemingly doesn't realize is that Technorati scrapes more than just the RSS feed, they also scrape the HTML. Most of the hits I get from Technorati are not present in the RSS, but rather in the HTML (blogrolls, sidebar lists, etc.) and the duplicates, from what I see, come from this HTML scraping, not the RSS.

Reader Comments Subscribe

You can't capture links in blogrolls and templates from the RSS.


Agreed. I'm exclusively using  Bloglines and Feedster now.


Type "339":
Top Articles
  1. Unblock MySpace
  2. MySpace
  3. FaceParty, the British MySpace
  4. del.icio.us and sex.com
  5. Blocking Facebook and MySpace
  1. Review of RSS Readers
  2. MySpace Layouts
  3. RSS Stock Ticker
  4. RSS Gets an Enema
  5. Google Reader rejects del.icio.us