Archive for the 'Indexing' Category

Lucene and Solr at ApacheCon

Looks like they have put up the ApacheCon Atlanta schedule. As usual, there looks to be several very good talks covering Lucene and Solr, including talks by Chris Hostetter, Ken Krugler, Michael Busch and yours truly. My talk is at 3pm on November 16, details are here.
I will also be leading my “Lucene [...]

ImproveIndexingSpeed - Lucene-java Wiki

ImproveIndexingSpeed - Lucene-java Wiki
People might find the indexing speed tips here useful

Lucene Indexing Performance: Managing RAM while Indexing follow up

Nice discussion on tuning the new RAM based indexing in Lucene available here.  And I thought Lucene was already fast…  Beware, though, this fix isn’t officially released, so you will need to use the trunk version.

Lucene Documentation Promotion!

I just posted this to java-user@lucene.a.o and thought I would share here as well:
Calling all Lucene Users!
You know you love Lucene for a whole variety of reasons (fast, friendly, fun, did I say fast?) so how about showing a little love back? 
We (as in the committers and contributors) are trying out a new [...]

Solr Article on IBM developerWorks

The first of my two part series on Solr appeared yesterday on IBM’s developerWorks, titled “Search Smarter with Apache Solr, Part 1: Essential Features and the Solr Schema“. The first article covers getting started with Solr and contains a simple application demonstrating some of the features of Solr, like XML based indexing and [...]

Advance Lucene slides from ApacheCon Europe 2007

The latest version of my slides for “Advanced Lucene” are located at http://www.cnlp.org/presentations/present.asp?show=conference
Talk covered term vectors, using various query types and Lucene performance tips and tricks.

Atlassian and Lucene

Nice presentation on Atlassian’s use of Lucene at http://blogs.atlassian.com/rebelutionary/archives/2007/04/my_serverside_java_symposium_2007_presen.html

ApacheCon Europe “Advanced Lucene” slides

My (slightly old) slides for ApacheCon Europe are now available in the conference proceedings available at http://eu.apachecon.com/downloads/materials.zip
I will post the latest version soon, but there is very little difference between this version and the latest.
Topics covered include Lucene performance, term vectors and query tips and tricks.
Feedback is always welcome

Lucene Indexing Performance: Managing RAM while Indexing

https://issues.apache.org/jira/browse/LUCENE-843
This patch, by Michael McCandless pretty much sums up what I love about Lucene and what makes Lucene an extraordinary open source project.
Take Lucene, which already has a pretty strong reputation as being fast, and add in a motivated committer (which Lucene has a high number of, IMO) and out comes a patch that, at [...]

More ApacheCon Info

As I posted earlier, I will be giving a talk and a tutorial at ApacheCon Europe this year on the Apache Lucene Java project. My talk is titled “Advance Lucene”. Here is the abstract:
Lucene Java is a high performance, scalable, cross-platform search engine that contains many advanced features that often are under utilized [...]