<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Open Source Search Engine Comparison</title>
	<atom:link href="http://lucene.grantingersoll.com/2007/12/04/open-source-search-engine-comparison/feed/" rel="self" type="application/rss+xml" />
	<link>http://lucene.grantingersoll.com/2007/12/04/open-source-search-engine-comparison/</link>
	<description>Thoughts on Apache Lucene, Mahout, Solr, Tika and Nutch</description>
	<pubDate>Fri, 05 Sep 2008 16:23:16 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.1</generator>
		<item>
		<title>By: marc</title>
		<link>http://lucene.grantingersoll.com/2007/12/04/open-source-search-engine-comparison/#comment-5340</link>
		<dc:creator>marc</dc:creator>
		<pubDate>Fri, 25 Jan 2008 14:33:23 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/2007/12/04/open-source-search-engine-comparison/#comment-5340</guid>
		<description>I was also surprised to see they neglected to mention any assumptions -- default language for all engines was English.  A very poor assumption for a comparison of this sort, especially with globalization on all our minds.  I grabbed the Zettair package to take a peek.  At first glance it appears to be English only... and its possible romance languages w/stoplist or stemmers could be added.  But that is a serious limitation. 

I wouldn't even touch a search engine toolkit if it was limited to one language or just romance languages these days.

-marc

((Note -- your CAPTCHA has the letter l or i -- could not tell which.))</description>
		<content:encoded><![CDATA[<p>I was also surprised to see they neglected to mention any assumptions &#8212; default language for all engines was English.  A very poor assumption for a comparison of this sort, especially with globalization on all our minds.  I grabbed the Zettair package to take a peek.  At first glance it appears to be English only&#8230; and its possible romance languages w/stoplist or stemmers could be added.  But that is a serious limitation. </p>
<p>I wouldn&#8217;t even touch a search engine toolkit if it was limited to one language or just romance languages these days.</p>
<p>-marc</p>
<p>((Note &#8212; your CAPTCHA has the letter l or i &#8212; could not tell which.))</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: grant_ingersoll</title>
		<link>http://lucene.grantingersoll.com/2007/12/04/open-source-search-engine-comparison/#comment-5202</link>
		<dc:creator>grant_ingersoll</dc:creator>
		<pubDate>Fri, 18 Jan 2008 12:50:57 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/2007/12/04/open-source-search-engine-comparison/#comment-5202</guid>
		<description>I should also follow up on this that I think it is the duty of people publishing works like this to also publish the code used to do the evaluation.  What good is research if you can't replicate it?

I know several others who have been able to do large scale evaluations of Lucene and have not run into some of the issues mentioned in the article.</description>
		<content:encoded><![CDATA[<p>I should also follow up on this that I think it is the duty of people publishing works like this to also publish the code used to do the evaluation.  What good is research if you can&#8217;t replicate it?</p>
<p>I know several others who have been able to do large scale evaluations of Lucene and have not run into some of the issues mentioned in the article.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.567 seconds -->
