Archive for the 'Java' Category

Lucid Imagination

Some of you may have noted that I’ve been quieter than usual lately.  Well, the reason is I was preparing for the launch of the new company I helped found: Lucid Imagination.   Now, I don’t blog too often about what I do for work on this site, other than it is Lucene, Solr and [...]

Congrats to Tika and Welcome to the Lucene Stack!

Congratulations to Apache Tika (nevermind the incubator address, it’s still in the process of migrating) for graduating from Incubation!   And welcome to the Lucene project!  Tika is a content extraction framework that wraps many other content extraction libraries such as PDFBox, POI, and others into a single, easy to use framework that makes it easy [...]

“What’s new with Apache Solr” now available at IBM developerWorks

What’s new with Apache Solr. My latest article on Apache Solr, title “What’s New with Apache Solr” is now available over at IBM developerWorks.  It covers some of the new features like spell checking, Data Import Handler, distributed search, editorial results placement (a.k.a. “paid placement”), SolrJ and a variety of other pieces. Hope it is [...]

Charlotte JUG » October Slides Available – Search & Analysis

Charlotte JUG » October Slides Available – Search & Analysis Had a lot of fun at my recent talk at the Charlotte JUG.  They’ve got a good core of people and there was a lot of good discussion about the topic. Even managed to give away some free eBooks of “Taming Text“.  Wish I would [...]

Lucene Boot Camp at ApacheCon US 2008

Just a quick reminder that there is just over one week left before Lucene Boot Camp at this year’s ApacheCon. This year, it is a 2 day training, but for those who want to, they can sign up for the first day of Lucene Boot Camp, and then attend Solr Boot Camp on the second [...]

Some New Features in Solr

I’ve had a chance recently to work on some things in Solr that I think that can, in the right circumstances, really enhance Solr. First off, is SOLR-651, which implements what I am calling a Term Vector Component. The basic gist of it is that Solr can now serve up term vectors from Lucene.  For [...]

Lucene 2.4.0 available

Welcome to Lucene! Boy, I must be slipping, but Lucene 2.4.0 is open.  See the link for more details.

Lucene Boot Camp at ApacheCon US

Lucene Boot Camp (ApacheCon site) Lucene Boot Camp (http://www.lucenebootcamp.com) is scheduled this year for ApacheCon US on November 3 and 4th in New Orleans.  This year, I am doing a two day event, as I felt the one day event was just not enough time to get in all the goodness that is Lucene (not [...]

BarCamp wiki / BarCampRDU

BarCamp wiki / BarCampRDU I’ll be at BarCampRDU tomorrow.  I proposed two sessions, one on Hadoop and Mahout and one on Lucene and Solr.  I don’t think I really want to do both, but I would like to do at least one, so we’ll see what other people are interested in. If you’re around and [...]

HP, Intel and Yahoo To Research Cloud Computing – Yahoo News

HP, Intel and Yahoo To Research Cloud Computing – Yahoo News Boy, this could really come in handy in Open Source, especially projects like Mahout, Nutch and distributed Solr.  I find my biggest personal challenge on Mahout is access to computing resources.  I personally don’t have the financial backing to buy much time on Amazon [...]