Archive for the 'Indexing' Category
Interesting comparison of open source search engines available at http://wrg.upf.edu/WRG/dctos/Middleton-Baeza.pdf. While it reflects OK on Lucene (hey, we can’t be perfect at everything,) I am interested in finding out more details about what settings were used for indexing. If they just used the out of the box settings, then I would argue that they need [...]
December 4th, 2007 | Posted in Indexing, Java, Lucene, Performance, Search | 2 Comments
I have setup a new site to support my Lucene Boot Camp training. Check it out at http://lucenebootcamp.com. From there, you can download training setup information, read the class outline, etc.
November 6th, 2007 | Posted in ApacheCon, Indexing, Java, Lucene, Search | No Comments
Lots of good things happening in Lucene land lately, all of which should benefit users with faster indexing and searching capabilities. Most notably, Lucene 2.3 (hopefully released this quarter) has some major changes in indexing memory management and performance. I have personally clocked indexing using release 2.2 at about 400 rec/s (single threaded, Mac Pro [...]
November 2nd, 2007 | Posted in Indexing, Java, Lucene, Performance, Search, term vectors | No Comments
Just a friendly reminder, I am giving my Lucene Boot Camp training at ApacheCon Atlanta this year (November.) Still plenty of time to sign up. Details on the class are here. Also, feel free to email me with any questions or things you would like to see. My apache.org email is gsingers. I will be [...]
October 3rd, 2007 | Posted in ApacheCon, Indexing, Java, Lucene, Search | No Comments
Welcome to the Triangle Java Users Group I will be speaking November 19, 2007 at the Triangle Java Users Group on Lucene and Solr. The talk will be an introduction to the features and capabilities of both Lucene and Solr, as well as some basic compare and contrast information.
September 7th, 2007 | Posted in Cary, Chapel Hill, Durham, Indexing, Java, Lucene, North Carolina, payloads, Performance, Raleigh, Search, Solr, Triangle | No Comments
Looks like they have put up the ApacheCon Atlanta schedule. As usual, there looks to be several very good talks covering Lucene and Solr, including talks by Chris Hostetter, Ken Krugler, Michael Busch and yours truly. My talk is at 3pm on November 16, details are here. I will also be leading my “Lucene Boot [...]
August 7th, 2007 | Posted in ApacheCon, Indexing, Java, Lucene, Performance, Search, Solr | No Comments
ImproveIndexingSpeed – Lucene-java Wiki People might find the indexing speed tips here useful
July 19th, 2007 | Posted in Indexing, Lucene, Performance | No Comments
Nice discussion on tuning the new RAM based indexing in Lucene available here. And I thought Lucene was already fast… Beware, though, this fix isn’t officially released, so you will need to use the trunk version.
July 13th, 2007 | Posted in Indexing, Java, Lucene, Performance | No Comments
I just posted this to java-user@lucene.a.o and thought I would share here as well: Calling all Lucene Users! You know you love Lucene for a whole variety of reasons (fast, friendly, fun, did I say fast?) so how about showing a little love back? We (as in the committers and contributors) are trying out a [...]
June 7th, 2007 | Posted in Indexing, Java, Lucene, Search | No Comments
The first of my two part series on Solr appeared yesterday on IBM’s developerWorks, titled “Search Smarter with Apache Solr, Part 1: Essential Features and the Solr Schema“. The first article covers getting started with Solr and contains a simple application demonstrating some of the features of Solr, like XML based indexing and faceted browsing. [...]
May 30th, 2007 | Posted in Indexing, Java, Lucene, Search, Solr | No Comments