Hey,
I'm doing a program in Java that takes data from various XML sources and mashes it together using RDF descriptions. In this particular stage I've been grabbing data from Audioscrobbler (last.fm) web services (
http://www.audioscrobbler.net/data/webservices/), but some of it is in RSS form. I tried using the DOM parser to read the RSS feeds, but I couldn't get it to give me anything other than a big pile of nulls. I eventually found RSS Utilities on the Sun website, which kind of works, but there are some non-standard elements that it seems to ignore. For example:
CODE
<item>
<title>An Evening with Dream Theater on 28 Jan 2008</title>
<description>Location: They are south of Greenhill Road and west of Goodwood Road in the suburb of Wayville., Adelaide, Australia
</description>
<link>http://www.last.fm/event/416069</link>
<guid>http://www.last.fm/event/416069</guid>
<pubDate>Wed, 14 Nov 2007 12:15:15 +0000</pubDate>
<xcal:dtstart>2008-01-28T00:00:00Z</xcal:dtstart>
<xcal:dtend>2008-01-28T23:59:59Z</xcal:dtend>
<xcal:location>http://www.last.fm/venue/8780432</xcal:location>
</item>
This is an item for an event, and I want to get the start and end dates out of it as well. However, the Item class that comes with the parser only lets me get the standard stuff like title and pubDate - I've messed around with the data I can get out of it, but as far as I can tell the dtstart, dtend and location data isn't even in there anywhere.
Anyone have any advice as to how I could do this? Is there any trick to just getting the DOM parser to read the RSS file? Or is there a better parser out there?
Thanks,
-Joe