Archive for the 'Lucene' Category

Lucene Indexing Performance: Managing RAM while Indexing follow up

Nice discussion on tuning the new RAM based indexing in Lucene available here.  And I thought Lucene was already fast…  Beware, though, this fix isn’t officially released, so you will need to use the trunk version.

ONJava.com — Using the Lucene Query Parser Without Lucene

ONJava.com — Using the Lucene Query Parser Without Lucene
Interesting use of the Lucene QueryParser.

Lucene 2.2 Available

Lucene 2.2.0 is available.  Lots of goodness, especially performance and documentation wise, make sure you read the CHANGES.txt

Lucene Documentation Promotion!

I just posted this to java-user@lucene.a.o and thought I would share here as well:
Calling all Lucene Users!
You know you love Lucene for a whole variety of reasons (fast, friendly, fun, did I say fast?) so how about showing a little love back? 
We (as in the committers and contributors) are trying out a new [...]

Part 2 of IBM developerWorks article on Solr

Part 2 of my 2 part series on Apache Solr is now up on IBM developerWorks. You can read it here. This article covers some of the things that makes Solr great for the enterprise, like caching, replication and easy administration.

Solr Article on IBM developerWorks

The first of my two part series on Solr appeared yesterday on IBM’s developerWorks, titled “Search Smarter with Apache Solr, Part 1: Essential Features and the Solr Schema“. The first article covers getting started with Solr and contains a simple application demonstrating some of the features of Solr, like XML based indexing and [...]

Advance Lucene slides from ApacheCon Europe 2007

The latest version of my slides for “Advanced Lucene” are located at http://www.cnlp.org/presentations/present.asp?show=conference
Talk covered term vectors, using various query types and Lucene performance tips and tricks.

Atlassian and Lucene

Nice presentation on Atlassian’s use of Lucene at http://blogs.atlassian.com/rebelutionary/archives/2007/04/my_serverside_java_symposium_2007_presen.html

ApacheCon Europe “Advanced Lucene” slides

My (slightly old) slides for ApacheCon Europe are now available in the conference proceedings available at http://eu.apachecon.com/downloads/materials.zip
I will post the latest version soon, but there is very little difference between this version and the latest.
Topics covered include Lucene performance, term vectors and query tips and tricks.
Feedback is always welcome

Lucene Indexing Performance: Managing RAM while Indexing

https://issues.apache.org/jira/browse/LUCENE-843
This patch, by Michael McCandless pretty much sums up what I love about Lucene and what makes Lucene an extraordinary open source project.
Take Lucene, which already has a pretty strong reputation as being fast, and add in a motivated committer (which Lucene has a high number of, IMO) and out comes a patch that, at [...]