Does Atom Validate? - RSS
Really Simple Syndication
 
Copyright 2003-4 Randy Charles Morin
RSS
<< Previous Main Next >>
Fri, 05 Nov 2004 04:55:26 GMT
Does Atom Validate?

Recently, I wrote the schematron schema to validate Atom version 0.3. I wrote the rules to conform as closely as I could to the actual Atom 0.3 specification. I finished the schematron earlier this week and immediate set out to find how valid the existing Atom feeds are.

Blog Really  Simple Validation FeedValidator
Intertwingly (Sam Ruby)

Invalid

Invalid

evhead (Evan Williams)

Invalid

Invalid

Six Apart

Valid

 Valid

The Real Geek on Blogger (me)

Valid

Valid

.Conform (Philippe Janvier)

Valid

Valid

.Conform Blogmarks

Invalid

Valid

Salad w/ Steve (Steve Jenson)

Valid

Valid

Atom Enabled

Invalid

Invalid

That should be enough to prove my point. About half of the Atom feeds are invalid. I should also point out that the FeedValidator was incorrect more often than not, pointing out validation issues that didn't exist in the spec and missing other validation issues present in the spec. Why is it so difficult to create a valid Atom feed? The problem is that Atom is simply too complex. Examples of this complexity follow:

Update: Here's another example of why Atom is complex. This fragment is from Kevin Mark's feed, blogger extraordinaire at Technorati.

<info mode="xml" type="text/html">
   <
div xmlns="http://www.w3.org/1999/xhtml">...</div>
</
info>

I added a new test case to my validator to flag this.

Reader Comments Subscribe

Perharps would it be better to conduct such a poll on a larger sample, though ;)

--philippe
It would. Maybe I'll put something together.

Randy

Your feed and Six Apart are valid, they just get warnings in the feed validator. That's different.
Agreed, my fault.

Randy

Type "339":
Top Articles
  1. Unblock MySpace
  2. MySpace
  3. FaceParty, the British MySpace
  4. del.icio.us and sex.com
  5. Blocking Facebook and MySpace
  1. Review of RSS Readers
  2. MySpace Layouts
  3. RSS Stock Ticker
  4. RSS Gets an Enema
  5. Google Reader rejects del.icio.us