Lately I’ve written about HTML parsing with JSoup. It just happened that I came across a problem for pretty printing or it is better to call it prettier printing of HTML documents with JSoup in another project and I thought this would be another good thing to share it with you through an article.
In this article I’ll do a final comparison of the two libraries — however do not expect anything professional. It will be just a simple run-and-measure-time analysis of the two libraries on my dataset — and on two machines.
Last time I was writing about XMLBeam, a new tool for parsing XML documents based on XPath. At the end of the article I mentioned that I’ll write about parsing an HTML website with XMLBeam and JSoup to compare them which one is better to use.
This article is the first part which is introducing the task and covering the XMLBeam implementation. The next article will tell more about JSoup and a comparison between the two tools.