Archive for the 'Solr' Category

What I’ve been up to lately: Lucid Imagination

Lucid Imagination
Well, the cat is out of the bag.  In case you haven’t heard, a few Lucene/Solr/Mahout committers (Erik Hatcher and Yonik Seeley) and I have teamed up with some other long time search veterans (Marc Krellenstein from Northern Light and former CTO of Reed Elsevier, amongst others) to build a company around providing product, [...]

Manning: Taming Text

Manning: Taming Text
Scary…  I guess it is real!

How Rackspace Now Uses MapReduce and Hadoop to Query Terabytes of Data | High Scalability

How Rackspace Now Uses MapReduce and Hadoop to Query Terabytes of Data | High Scalability
Nice article on how the Lucene/Hadoop/Solr stack was used to solve a really big problem.  Someday, I hope (when we have actual code),  they can add Mahout to the equation and do even more interesting things with the data.

Coderspiel / The right tool for the slob

Coderspiel / The right tool for the slob
This guy’s comment system wasn’t working at the moment, so I will leave my comment here. This won’t make much sense without reading the post first:
It’s funny you mention Wikipedia as an example, since they are running Lucene. As is Technorati and the Internet Archive. [...]

ApacheCon EU 2008

ApacheCon EU 2008
Schedule is out for ApacheCon Europe.  I will be doing my Lucene Boot Camp training and a Lucene Performance talk.  Erik Hatcher will also be doing a Solr Boot Camp and a Lucene/Solr talk.  There will also be some Hadoop talks.

Triangle Java Users Group talk on Lucene and Solr

Welcome to the Triangle Java Users Group
I will be speaking November 19, 2007 at the Triangle Java Users Group on Lucene and Solr.   The talk will be an introduction to the features and capabilities of both Lucene and Solr, as well as some basic compare and contrast information.

Lucene and Solr at ApacheCon

Looks like they have put up the ApacheCon Atlanta schedule. As usual, there looks to be several very good talks covering Lucene and Solr, including talks by Chris Hostetter, Ken Krugler, Michael Busch and yours truly. My talk is at 3pm on November 16, details are here.
I will also be leading my “Lucene [...]

Part 2 of IBM developerWorks article on Solr

Part 2 of my 2 part series on Apache Solr is now up on IBM developerWorks. You can read it here. This article covers some of the things that makes Solr great for the enterprise, like caching, replication and easy administration.

Solr Article on IBM developerWorks

The first of my two part series on Solr appeared yesterday on IBM’s developerWorks, titled “Search Smarter with Apache Solr, Part 1: Essential Features and the Solr Schema“. The first article covers getting started with Solr and contains a simple application demonstrating some of the features of Solr, like XML based indexing and [...]