Part 2 of IBM developerWorks article on Solr
Part 2 of my 2 part series on Apache Solr is now up on IBM developerWorks. You can read it here. This article covers some of the things that makes Solr great for the enterprise, like caching, replication and easy administration.

It’s quite good, I was looking for some articles for Solr when I found it.
it involkes Solr and describes it as so easy to use. it’s pretty useful for me just now, and it must also feed others well who have the interests of Solr.
In the case of Solr handling structured documents, It would be even better that you can recommend some libraries to deal with XML and HTML that’re used to convert some kinds of resource to the format transported to Solr interfaces. If you have any recommandation, please tell me if possible.
Thanks.
Thanks.
I would check out NekoHTML or (J)Tidy for HTML. Solr also comes with some HTML helpers.
As for XML, you can use either SAX or a pull parser. Java 1.5 comes with SAX, but you can also use Xerces. For a pull parser, have a look at XPP.
If you want all kinds of document extraction, take a look at Aperture (http://aperture.sourceforge.net)