Archive for the 'Java' Category

Part 2 of IBM developerWorks article on Solr

Part 2 of my 2 part series on Apache Solr is now up on IBM developerWorks. You can read it here. This article covers some of the things that makes Solr great for the enterprise, like caching, replication and easy administration.

Solr Article on IBM developerWorks

The first of my two part series on Solr appeared yesterday on IBM’s developerWorks, titled “Search Smarter with Apache Solr, Part 1: Essential Features and the Solr Schema“. The first article covers getting started with Solr and contains a simple application demonstrating some of the features of Solr, like XML based indexing and [...]

Advance Lucene slides from ApacheCon Europe 2007

The latest version of my slides for “Advanced Lucene” are located at http://www.cnlp.org/presentations/present.asp?show=conference
Talk covered term vectors, using various query types and Lucene performance tips and tricks.

Atlassian and Lucene

Nice presentation on Atlassian’s use of Lucene at http://blogs.atlassian.com/rebelutionary/archives/2007/04/my_serverside_java_symposium_2007_presen.html

ApacheCon Europe “Advanced Lucene” slides

My (slightly old) slides for ApacheCon Europe are now available in the conference proceedings available at http://eu.apachecon.com/downloads/materials.zip
I will post the latest version soon, but there is very little difference between this version and the latest.
Topics covered include Lucene performance, term vectors and query tips and tricks.
Feedback is always welcome

Lucene Indexing Performance: Managing RAM while Indexing

https://issues.apache.org/jira/browse/LUCENE-843
This patch, by Michael McCandless pretty much sums up what I love about Lucene and what makes Lucene an extraordinary open source project.
Take Lucene, which already has a pretty strong reputation as being fast, and add in a motivated committer (which Lucene has a high number of, IMO) and out comes a patch that, at [...]

Payloads

Michael Busch recently committed some code that enables Lucene to store payloads at the term level (see https://issues.apache.org/jira/browse/LUCENE-755) and I have started working on enabling these payloads to be incorporated into search and scoring. (see http://wiki.apache.org/lucene-java/Payload_Planning and https://issues.apache.org/jira/browse/LUCENE-834)
So, you might be asking yourself, what exactly are payloads good for?  Naturally, the answer is a lot!  [...]

Lucene 2.1.0 available

Version 2.1.0 of Apache Lucene is now available.  It includes many new performance enhancements and features.  Users should note that it changes the Lucene index file format, so previous versions of Lucene will not be able to read a 2.1.0 version of Lucene.  Lucene 2.1.0 will be able to read older versions, however.
Lucene, new release, [...]

More ApacheCon Info

As I posted earlier, I will be giving a talk and a tutorial at ApacheCon Europe this year on the Apache Lucene Java project. My talk is titled “Advance Lucene”. Here is the abstract:
Lucene Java is a high performance, scalable, cross-platform search engine that contains many advanced features that often are under utilized [...]

ApacheCon 2007 Europe Talks

I have received official word from ApacheCon that 2 of my proposals have been accepted. I will be giving the “Advanced Lucene” talk on Wednesday, May 2nd, 2007. This talk will focus on advanced querying capabilities, term vectors and Lucene performance. I will also be giving a full day tutorial on Lucene [...]