6 Replies - 1363 Views - Last Post: 06 June 2010 - 02:21 PM

#1 Ghlavac  Icon User is offline

  • D.I.C Addict

Reputation: 83
  • View blog
  • Posts: 505
  • Joined: 14-January 09

Good HTML Parser

Posted 01 June 2010 - 05:28 PM

I'm in the process of developing a utility that will need to be able to read from html pages and RSS feeds, both of which require parsing HTML and I've yet to be able to locate a convenient parser.

Could anyone point me towards a useful library for this? Or a page that illustrates how to do it with built in java libraries?
Is This A Good Question/Topic? 0
  • +

Replies To: Good HTML Parser

#2 m-e-g-a-z  Icon User is offline

  • Winning
  • member icon


Reputation: 495
  • View blog
  • Posts: 1,451
  • Joined: 19-October 09

Re: Good HTML Parser

Posted 01 June 2010 - 05:32 PM

You may want to have a look at this Link. Also, you can check out the snippets in my contributions, as I have based some of my snippets on this HTML Parser.
Was This Post Helpful? 1
  • +
  • -

#3 Ghlavac  Icon User is offline

  • D.I.C Addict

Reputation: 83
  • View blog
  • Posts: 505
  • Joined: 14-January 09

Re: Good HTML Parser

Posted 01 June 2010 - 05:34 PM

How well does it work, considering its last update is 2006/09/23?
Was This Post Helpful? 0
  • +
  • -

#4 m-e-g-a-z  Icon User is offline

  • Winning
  • member icon


Reputation: 495
  • View blog
  • Posts: 1,451
  • Joined: 19-October 09

Re: Good HTML Parser

Posted 01 June 2010 - 05:44 PM

Well, they have an API like any other third party which i used if you want to check it out.

Depends on what you want to do, the good thing about a HTML Parser is that its good for complex operations which you may/may not be doing.


I used it to create a mini web crawler.

If its just grabbing source code your looking for, you can use the java.net class.

This post has been edited by m-e-g-a-z: 01 June 2010 - 05:49 PM

Was This Post Helpful? 1
  • +
  • -

#5 Ghlavac  Icon User is offline

  • D.I.C Addict

Reputation: 83
  • View blog
  • Posts: 505
  • Joined: 14-January 09

Re: Good HTML Parser

Posted 01 June 2010 - 05:55 PM

After looking at the parser class, this will suit me just fine.


Thanks very much. :)


Oh and saw your sample, lets me understand it even moreso.

This post has been edited by Ghlavac: 01 June 2010 - 06:06 PM

Was This Post Helpful? 0
  • +
  • -

#6 Programmist  Icon User is offline

  • CTO
  • member icon

Reputation: 249
  • View blog
  • Posts: 1,828
  • Joined: 02-January 06

Re: Good HTML Parser

Posted 03 June 2010 - 12:14 PM

I've used several of these.

http://java-source.n...ce/html-parsers
Was This Post Helpful? 0
  • +
  • -

#7 Guest_Mohamed Shahpoup*


Reputation:

Re: Good HTML Parser

Posted 06 June 2010 - 02:21 PM

1- you can also use JEditorpane Class in Java API .
2- you can use JDIC Lib developed by Java Community.

i think this help you.

Was This Post Helpful? 0

Page 1 of 1