In case you haven’t signed up already, Lucene Revolution is returning to Boston in May of this year, albeit at a different venue. You can learn more at www.lucenerevolution.org. I was just reviewing the submitted talks and looks to be another good conference.
March 26th, 2012 | Posted in Lucene, Lucid Imagination | No Comments
I’ll be doing a live panel interview today with DM Radio on The Art of Harnessing Unwieldy Data Big & Small. Click the link to register. Looks to be an interesting discussion on dealing with unstructured content.
March 22nd, 2012 | Posted in Lucene, Lucid Imagination, Mahout | No Comments
I’m looking for a Research Engineer with Hadoop and Solr experience to work on next generation search and big data problems. If you are interested or know someone who is, please take a look at Careers – Research Engineer | Lucid Imagination.
February 6th, 2012 | Posted in Lucene | No Comments
In case you haven’t heard, and are in Europe this June (or want to be), you should check out the Berlin Buzzwords conference. It’s a great conference for all things related to Lucene, Solr, Hadoop, Mahout, NoSQL and generally scaling. The CFP is open now through March 11.
January 18th, 2012 | Posted in Lucene, Mahout, Solr | No Comments

Drew, Tom and I are feverishly working away on finishing up Taming Text. We are currently in the process of addressing the feedback we got from our final review and should have updates up soon. I have also posted all of the book’s source code up on Github under the Taming Text user. The source includes, amongst other things, a simple Question Answering system using Solr and OpenNLP, as well as analyzers for Lucene that use OpenNLP for sentence detection, part of speech tagging and Named Entity Recognition. As with most books, these examples are meant to be just that, examples.
December 27th, 2011 | Posted in Lucene, OpenNLP, Solr, Taming Text | No Comments
I’ve posted my review of “Mahout in Action” on Lucid’s website: Mahout in Action Review.
October 15th, 2011 | Posted in Mahout | No Comments
For those who have wanted other scoring models in Lucene/Solr (Okapi, others) more details can be found on Lucid’s blog: Lucid Imagination » Flexible ranking in Lucene 4.
September 12th, 2011 | Posted in Lucene | No Comments
Just ordered “R in Action” from Manning. Looking forward to learning more about it, as it comes up often when discussing solving smaller problems that what is appropriate for Apache Mahout. Hopefully, I will have time to post a review in the coming weeks.
September 2nd, 2011 | Posted in Lucene | 1 Comment
Triangle Hadoop Users Group, Next Meeting: Sept. 13 @ Bronto Software.
Ted Dunning of Mahout fame will be speaking at the next TriHUG meeting on MapR and it’s relationship with Hadoop, etc.
August 28th, 2011 | Posted in TriHUG | No Comments