<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for Grant's Grunts: Lucene Edition</title>
	<atom:link href="http://lucene.grantingersoll.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://lucene.grantingersoll.com</link>
	<description>Thoughts on Apache Lucene, Mahout, Solr, Tika and Nutch</description>
	<lastBuildDate>Sat, 10 Sep 2011 20:15:34 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>Comment on Complex Fields (aka &#8220;poly&#8221; fields) in Apache Solr by grant_ingersoll</title>
		<link>http://lucene.grantingersoll.com/2009/12/24/complex-fields-aka-poly-fields-in-apache-solr/comment-page-1/#comment-14350</link>
		<dc:creator>grant_ingersoll</dc:creator>
		<pubDate>Sat, 10 Sep 2011 20:15:34 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=307#comment-14350</guid>
		<description>Poly fields currently do not support multi valued.  You might be able to hack something using SpanQueries, but that would require custom code.</description>
		<content:encoded><![CDATA[<p>Poly fields currently do not support multi valued.  You might be able to hack something using SpanQueries, but that would require custom code.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Complex Fields (aka &#8220;poly&#8221; fields) in Apache Solr by JB</title>
		<link>http://lucene.grantingersoll.com/2009/12/24/complex-fields-aka-poly-fields-in-apache-solr/comment-page-1/#comment-14334</link>
		<dc:creator>JB</dc:creator>
		<pubDate>Fri, 09 Sep 2011 20:17:23 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=307#comment-14334</guid>
		<description>@Greg (from 5/27), I&#039;m having the same issue where a PolyField with multiValued=true will return false positives similar to your car example.

A document will have two addresses:
125 Stark Lane, Apt #24, Beverly Hills, CA 90210-1243
123 Fake Road, Arlington, VA 22201

These will parsed into 5 subfields.  

If I do a search for 
125 Stark Lane, Apt #24, Arlington, VA 90210-1243, the document returns when I believe it shouldn&#039;t.

Any tricks to guarantee the search is ONLY executed across a single PolyField?</description>
		<content:encoded><![CDATA[<p>@Greg (from 5/27), I&#8217;m having the same issue where a PolyField with multiValued=true will return false positives similar to your car example.</p>
<p>A document will have two addresses:<br />
125 Stark Lane, Apt #24, Beverly Hills, CA 90210-1243<br />
123 Fake Road, Arlington, VA 22201</p>
<p>These will parsed into 5 subfields.  </p>
<p>If I do a search for<br />
125 Stark Lane, Apt #24, Arlington, VA 90210-1243, the document returns when I believe it shouldn&#8217;t.</p>
<p>Any tricks to guarantee the search is ONLY executed across a single PolyField?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on R in Action by Dan Brickley</title>
		<link>http://lucene.grantingersoll.com/2011/09/02/r-in-action/comment-page-1/#comment-14310</link>
		<dc:creator>Dan Brickley</dc:creator>
		<pubDate>Wed, 07 Sep 2011 18:14:28 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=448#comment-14310</guid>
		<description>Yes, &quot;that&#039;s not such a big dataset, ... you could probably get by in R&quot; seems to be heard regularly on the Mahout users list. It would be very nice to have a few shared R utilities that could interact with (and visualize) at least some of the same data formats as Mahout, e.g. Taste-style preference datamodels.</description>
		<content:encoded><![CDATA[<p>Yes, &#8220;that&#8217;s not such a big dataset, &#8230; you could probably get by in R&#8221; seems to be heard regularly on the Mahout users list. It would be very nice to have a few shared R utilities that could interact with (and visualize) at least some of the same data formats as Mahout, e.g. Taste-style preference datamodels.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Complex Fields (aka &#8220;poly&#8221; fields) in Apache Solr by Rob</title>
		<link>http://lucene.grantingersoll.com/2009/12/24/complex-fields-aka-poly-fields-in-apache-solr/comment-page-1/#comment-14309</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Wed, 07 Sep 2011 18:13:22 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=307#comment-14309</guid>
		<description>@Greg thats exactly the problem I have, too!</description>
		<content:encoded><![CDATA[<p>@Greg thats exactly the problem I have, too!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Complex Fields (aka &#8220;poly&#8221; fields) in Apache Solr by Steven Hildreth</title>
		<link>http://lucene.grantingersoll.com/2009/12/24/complex-fields-aka-poly-fields-in-apache-solr/comment-page-1/#comment-13859</link>
		<dc:creator>Steven Hildreth</dc:creator>
		<pubDate>Tue, 26 Jul 2011 15:16:07 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=307#comment-13859</guid>
		<description>Any activity on polygons, perhaps in Solr Trunk? Possible to get some idea on how to do a polygon search? I have a collection of points I need to find documents &#039;inside&#039; of. Thanks for any information.</description>
		<content:encoded><![CDATA[<p>Any activity on polygons, perhaps in Solr Trunk? Possible to get some idea on how to do a polygon search? I have a collection of points I need to find documents &#8216;inside&#8217; of. Thanks for any information.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on TriJUG: Intro to Mahout Slides and Demo examples by bin</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-13630</link>
		<dc:creator>bin</dc:creator>
		<pubDate>Thu, 23 Jun 2011 07:44:16 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-13630</guid>
		<description>Thank you very much ! You help me so much. I can get the result of clustering now.</description>
		<content:encoded><![CDATA[<p>Thank you very much ! You help me so much. I can get the result of clustering now.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Complex Fields (aka &#8220;poly&#8221; fields) in Apache Solr by Greg</title>
		<link>http://lucene.grantingersoll.com/2009/12/24/complex-fields-aka-poly-fields-in-apache-solr/comment-page-1/#comment-13529</link>
		<dc:creator>Greg</dc:creator>
		<pubDate>Fri, 27 May 2011 16:02:14 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=307#comment-13529</guid>
		<description>Let’s say I create a car PolyField to support a document that looks like this:
field name=id value=123
field name=name value=car picture.jpg
field name=car value=Honda,Red
field name=car value=Toyota,Blue

Now I want to find a picture of a blue Honda, so I run this query:
q=car:Honda,Blue
Behind the scenes Solr will execute a query like this:
car__0:Honda AND car__1:Blue
I believe that the Solr response will include the example doc – id:123 – but doc id:123 does not have a blue Honda, it contains a red Honda and blue Toyota.

How do you avoid &quot;false hits&quot; with PolyFields?
Thx.</description>
		<content:encoded><![CDATA[<p>Let’s say I create a car PolyField to support a document that looks like this:<br />
field name=id value=123<br />
field name=name value=car picture.jpg<br />
field name=car value=Honda,Red<br />
field name=car value=Toyota,Blue</p>
<p>Now I want to find a picture of a blue Honda, so I run this query:<br />
q=car:Honda,Blue<br />
Behind the scenes Solr will execute a query like this:<br />
car__0:Honda AND car__1:Blue<br />
I believe that the Solr response will include the example doc – id:123 – but doc id:123 does not have a blue Honda, it contains a red Honda and blue Toyota.</p>
<p>How do you avoid &#8220;false hits&#8221; with PolyFields?<br />
Thx.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Complex Fields (aka &#8220;poly&#8221; fields) in Apache Solr by Greg</title>
		<link>http://lucene.grantingersoll.com/2009/12/24/complex-fields-aka-poly-fields-in-apache-solr/comment-page-1/#comment-13528</link>
		<dc:creator>Greg</dc:creator>
		<pubDate>Fri, 27 May 2011 15:32:59 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=307#comment-13528</guid>
		<description>Let&#039;s say I create a car PolyField to support a document that looks like this:

123
car picture.jpg


Honda,Red
Toyota,Blue

Now I want to find a picture of a blue Honda, so I run this query:
q=car:Honda,Blue

Behind the scenes Solr will execute a query like this:
car__0:Honda AND car__1:Blue

I believe that the Solr response will include the example doc - id:123 - but doc id:123 does not have a blue Honda, it contains a red Honda and blue Toyota.

Thx.</description>
		<content:encoded><![CDATA[<p>Let&#8217;s say I create a car PolyField to support a document that looks like this:</p>
<p>123<br />
car picture.jpg</p>
<p>Honda,Red<br />
Toyota,Blue</p>
<p>Now I want to find a picture of a blue Honda, so I run this query:<br />
q=car:Honda,Blue</p>
<p>Behind the scenes Solr will execute a query like this:<br />
car__0:Honda AND car__1:Blue</p>
<p>I believe that the Solr response will include the example doc &#8211; id:123 &#8211; but doc id:123 does not have a blue Honda, it contains a red Honda and blue Toyota.</p>
<p>Thx.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on TriJUG: Intro to Mahout Slides and Demo examples by Christoph</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-13496</link>
		<dc:creator>Christoph</dc:creator>
		<pubDate>Fri, 20 May 2011 19:39:48 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-13496</guid>
		<description>Running Frequent Pattern Mining Test in mahout-0.4. step 3 is:

./mahout seqdumper --seqFile patterns/fpgrowth/part-r-00000</description>
		<content:encoded><![CDATA[<p>Running Frequent Pattern Mining Test in mahout-0.4. step 3 is:</p>
<p>./mahout seqdumper &#8211;seqFile patterns/fpgrowth/part-r-00000</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Tika In Action in MEAP by Jack</title>
		<link>http://lucene.grantingersoll.com/2010/09/18/tika-in-action-in-meap/comment-page-1/#comment-13474</link>
		<dc:creator>Jack</dc:creator>
		<pubDate>Mon, 16 May 2011 10:49:56 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=392#comment-13474</guid>
		<description>It&#039;s good for me.</description>
		<content:encoded><![CDATA[<p>It&#8217;s good for me.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

