java left logo
java middle logo
java right logo
 

Home arrow Java Libraries
 
 
Main Menu
Home
Java Tutorials
Book Reviews
Java SE Tips
Java ME Tips
Java EE Tips
Other API Tips
Java Applications
Java Libraries
Java Games
Java Network
Java Forums
Java Blog




Most Visited Tips
Java SE Tips
Java ME Tips
Java EE Tips
Other API Tips
Java Applications
Java Libraries
Java Games
Book Reviews
Top Rated Tips
Java SE Tips
Java ME Tips
Java EE Tips
Other API Tips
Java Applications
Java Libraries
Java Games
Book Reviews


Statistics
Registered Users: 3936
Java SE Tips: 614
Java ME Tips: 202
Java EE Tips: 183
Other API Tips: 779
Java Applications: 298
Java Libraries: 209
Java Games: 16
Book Reviews:
 
 
 
HotSAX E-mail
User Rating: / 20
PoorBest 

HotSAX is a small fast SAX2 parser for HTML, XHTML and XML.

SAX is 'Simple API for XML' which was developed by David Meggison and others on the xml-dev mailing list. (http://www.meggison.com/SAX). SAX parsers parse XML by generating events for start tags, text, and end tags which trigger event handlers in your code. They are meant to be faster and use less memory than an equivalent DOM parser. SAX2 adds lexical handling extensions like comments and CDATA.blocks.

Until now, you needed at least well-formed XML as input to a SAX parser. With the introduction of HotSAX, you can parse HTML (even badly formed HTML,) and still generate SAX events.

Why would you want to do this? This tool is designed to help build other useful things like link spiders, page scrapers, HTML to other format converters and scripted web browsers. A quick example would be a simple text only browser like 'lynx'.

You can embed HotSAX in larger projects like a headline grabber for a content management system. Similar to what My Yahoo does when it displays the top stories from CNET, NY Times etc. See the README and the FAQ files for more information.

URL: http://hotsax.sourceforge.net/
Licence: LGPL


 Related Tips

 
Next >

Page 1 of 0 ( 0 comments )

You can share your information about this topic using the form below!

Please do not post your questions with this form! Thanks.


Name (required)


E-Mail (required)

Your email will not be displayed on the site - only to our administrator
Homepage(optional)



Comment Enable HTML code : Yes No



 
       
         
     
 
 
 
   
 
 
java bottom left
java bottom middle
java bottom right
RSS 0.91 FeedRSS 1.0 FeedRSS 2.0 FeedATOM FeedOPML Feed

Home - About Us - Privacy Policy
Copyright 2005 - 2008 www.java-tips.org
Java is a trademark of Sun Microsystems, Inc.