new tool for parsing html
Submitted by philip wang on Tue, 09/02/2008 - 08:35
there is a tool called htmlparser, it can parse html , but as other paser tools ,it can't parse javascript.
only get its jar ,and put jar file into your project , it will work.
i am studying "programming spiders,bots,and aggregators in java", this is a very good book, and its example also good, but it is pity that i found this book so late, two weeks ago,i just found this book in internet. in 10days ,i have studied 7 chapters . it is helpful to my job. so thanks !
beijing china
philip wang













Thanks for the info
I will look at the HTML Parser, it could be useful to me for future versions of the book and code, if it has features that the existing HTML parser lacks.
By the way, the link is here:
http://htmlparser.sourceforge.net/
Jeff