<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Tika and Solr</title>
	<atom:link href="http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/feed/" rel="self" type="application/rss+xml" />
	<link>http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/</link>
	<description>Thoughts on Apache Lucene, Mahout, Solr, Tika and Nutch</description>
	<lastBuildDate>Sat, 10 Sep 2011 20:15:34 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Just when I thought I had a unique idea&#8230; &#171; Information Processing</title>
		<link>http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/comment-page-1/#comment-8138</link>
		<dc:creator>Just when I thought I had a unique idea&#8230; &#171; Information Processing</dc:creator>
		<pubDate>Mon, 16 Aug 2010 17:30:52 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=134#comment-8138</guid>
		<description>[...] thought I had a unique&#160;idea&#8230; I found this response from Sameer on this &#8220;old&#8221; Tika and SOLR article: A natural enhancement / extension to Metadata extraction and identification toolkit would [...]</description>
		<content:encoded><![CDATA[<p>[...] thought I had a unique&nbsp;idea&#8230; I found this response from Sameer on this &#8220;old&#8221; Tika and SOLR article: A natural enhancement / extension to Metadata extraction and identification toolkit would [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: grant_ingersoll</title>
		<link>http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/comment-page-1/#comment-6285</link>
		<dc:creator>grant_ingersoll</dc:creator>
		<pubDate>Mon, 26 Jan 2009 16:34:45 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=134#comment-6285</guid>
		<description>Thanks, Eric!  Your patch was the catalyst, without a doubt.</description>
		<content:encoded><![CDATA[<p>Thanks, Eric!  Your patch was the catalyst, without a doubt.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Eric Pugh</title>
		<link>http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/comment-page-1/#comment-6252</link>
		<dc:creator>Eric Pugh</dc:creator>
		<pubDate>Tue, 20 Jan 2009 15:10:20 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=134#comment-6252</guid>
		<description>Grant,

Just been digging through Tika, and it looks like it&#039;s come a long ways since I first heard about it!  And the patches you&#039;ve made to Solr to support rich documents is great!

Eric</description>
		<content:encoded><![CDATA[<p>Grant,</p>
<p>Just been digging through Tika, and it looks like it&#8217;s come a long ways since I first heard about it!  And the patches you&#8217;ve made to Solr to support rich documents is great!</p>
<p>Eric</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sameer</title>
		<link>http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/comment-page-1/#comment-6210</link>
		<dc:creator>Sameer</dc:creator>
		<pubDate>Wed, 31 Dec 2008 01:28:35 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=134#comment-6210</guid>
		<description>Thanks! I&#039;ll check it out.</description>
		<content:encoded><![CDATA[<p>Thanks! I&#8217;ll check it out.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: grant_ingersoll</title>
		<link>http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/comment-page-1/#comment-6201</link>
		<dc:creator>grant_ingersoll</dc:creator>
		<pubDate>Tue, 30 Dec 2008 21:19:45 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=134#comment-6201</guid>
		<description>Hi Sameer,

Tom Morton and I have a written on this in &quot;Taming Text&quot; (http://www.manning.com/ingersoll).  The associated code has integration between Solr and OpenNLP, which can do Named Entity Recognition.  That&#039;s a starting point.  You could also easily plugin other algorithms, I think, but I don&#039;t know if anyone is currently offering that in Solr.</description>
		<content:encoded><![CDATA[<p>Hi Sameer,</p>
<p>Tom Morton and I have a written on this in &#8220;Taming Text&#8221; (<a href="http://www.manning.com/ingersoll" rel="nofollow">http://www.manning.com/ingersoll</a>).  The associated code has integration between Solr and OpenNLP, which can do Named Entity Recognition.  That&#8217;s a starting point.  You could also easily plugin other algorithms, I think, but I don&#8217;t know if anyone is currently offering that in Solr.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sameer</title>
		<link>http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/comment-page-1/#comment-6200</link>
		<dc:creator>Sameer</dc:creator>
		<pubDate>Tue, 30 Dec 2008 20:41:07 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=134#comment-6200</guid>
		<description>A natural enhancement / extension to Metadata extraction and identification toolkit would be to layer a content analysis framework on top. There is value in extracting named entities out of the content. These named entities can then be used to slice and dice information by People, Company, Places, etc.

Grant, is someone already working on it? Any plans in the pipeline?</description>
		<content:encoded><![CDATA[<p>A natural enhancement / extension to Metadata extraction and identification toolkit would be to layer a content analysis framework on top. There is value in extracting named entities out of the content. These named entities can then be used to slice and dice information by People, Company, Places, etc.</p>
<p>Grant, is someone already working on it? Any plans in the pipeline?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Enterprise Search Study</title>
		<link>http://lucene.grantingersoll.com/2008/12/06/tika-and-solr/comment-page-1/#comment-6179</link>
		<dc:creator>Enterprise Search Study</dc:creator>
		<pubDate>Mon, 08 Dec 2008 09:43:15 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=134#comment-6179</guid>
		<description>[...] * Tika and Solr [...]</description>
		<content:encoded><![CDATA[<p>[...] * Tika and Solr [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

