Archive for the 'Tika' Category

Tika and Solr

As I mentioned in Congrats to Tika and Welcome to the Lucene Stack I’ve been working on adding Tika support to Solr.  Well, I finally committed it today, with a special thanks to Chris Harris and Eric Pugh for helping see it through with me. What does this mean?  It is now possible to send [...]

Congrats to Tika and Welcome to the Lucene Stack!

Congratulations to Apache Tika (nevermind the incubator address, it’s still in the process of migrating) for graduating from Incubation!   And welcome to the Lucene project!  Tika is a content extraction framework that wraps many other content extraction libraries such as PDFBox, POI, and others into a single, easy to use framework that makes it easy [...]

FeatherCast » Blog Archive » Episode 43: Lucene

FeatherCast » Blog Archive » Episode 43: Lucene I did a FeatherCast today with Rich Bowen.  Dang, he is quick at editing…