Archive for the 'Search' Category

ApacheCon 2007 Europe Talks

I have received official word from ApacheCon that 2 of my proposals have been accepted. I will be giving the “Advanced Lucene” talk on Wednesday, May 2nd, 2007. This talk will focus on advanced querying capabilities, term vectors and Lucene performance. I will also be giving a full day tutorial on Lucene Java on May [...]

IBM OmniFind Yahoo! Edition – Simple Search Just Got Easier

IBM OmniFind Yahoo! Edition – Simple Search Just Got Easier FYI: Uses Lucene under the hood

Query Parser Badly Broken

Interesting discussion on the Lucene User list about the Query Parser being badly broken.  One alternative is available here.  However, as anyone who follows the QP will tell you, there is no perfect solution available for the wide range of Query types that Lucene supports.  I think most users of more advanced apps will say [...]

Ferret (a.k.a Ruby Lucene)

Interesting interview with the creator of Ferret at http://on-ruby.blogspot.com/2006/10/ruby-hacker-interview-dave-balmain.html Talks about some of the performance changes he has made in Ferret C version to make it run a lot faster than Java Lucene.  He says he doubts they can be ported to Java, but I wonder if the Java version might still benefit.

Lucene Benchmarking

There is some effort under way to implement a standard benchmarking contribution for Lucene. It is chronichled at http://issues.apache.org/jira/browse/LUCENE-675. The goal is to provide a way for developers to see whether changes they are making are worthwhile. By running the benchmarks before and after applying a patch, it should become obvious whether the patch adversely [...]

Scoring Documentation

From my java-dev mailing list post this morning: Steve Rowe and I have added scoring.xml (with some contributions from Karl Wettin, Chris Hostetter and others) to the xdocs directory (and scoring.html to the docs directory). Our goals in writing this document were: 1. To better understand scoring 2. To document how scoring works for the [...]