<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Grant's Grunts: Lucene Edition &#187; payloads</title>
	<atom:link href="http://lucene.grantingersoll.com/category/payloads/feed/" rel="self" type="application/rss+xml" />
	<link>http://lucene.grantingersoll.com</link>
	<description>Thoughts on Apache Lucene, Mahout, Solr, Tika and Nutch</description>
	<lastBuildDate>Thu, 08 Jul 2010 17:23:22 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Grant Ingersoll Talks with Monster&#8217;s Peter Keegan &#124; Enterprise Search support for Apache Lucene and Solr by Lucid Imagination</title>
		<link>http://lucene.grantingersoll.com/2009/10/26/grant-ingersoll-talks-with-monsters-peter-keegan-enterprise-search-support-for-apache-lucene-and-solr-by-lucid-imagination/</link>
		<comments>http://lucene.grantingersoll.com/2009/10/26/grant-ingersoll-talks-with-monsters-peter-keegan-enterprise-search-support-for-apache-lucene-and-solr-by-lucid-imagination/#comments</comments>
		<pubDate>Mon, 26 Oct 2009 13:06:34 +0000</pubDate>
		<dc:creator>grant_ingersoll</dc:creator>
				<category><![CDATA[Apache]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[payloads]]></category>

		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=285</guid>
		<description><![CDATA[For those interested in how Monster uses Lucene to help you find a job, check out my podcast with Peter Keegan of Monster.com: Grant Ingersoll Talks with Peter Keegan &#124; Enterprise Search support for Apache Lucene and Solr by Lucid Imagination. Peter has been a long time contributor to Lucene and is doing some cool [...]]]></description>
			<content:encoded><![CDATA[<p>For those interested in how Monster uses Lucene to help you find a job, check out my podcast with Peter Keegan of Monster.com:</p>
<p><a href="http://www.lucidimagination.com/Community/Hear-from-the-Experts/Podcasts-and-Videos/Grant-Ingersoll-Talks-Peter-Keegan">Grant Ingersoll Talks with Peter Keegan | Enterprise Search support for Apache Lucene and Solr by Lucid Imagination</a>.</p>
<p>Peter has been a long time contributor to Lucene and is doing some cool things with Lucene&#8217;s payload functionality as well as many other Lucene features.</p>
]]></content:encoded>
			<wfw:commentRss>http://lucene.grantingersoll.com/2009/10/26/grant-ingersoll-talks-with-monsters-peter-keegan-enterprise-search-support-for-apache-lucene-and-solr-by-lucid-imagination/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Triangle Java Users Group talk on Lucene and Solr</title>
		<link>http://lucene.grantingersoll.com/2007/09/07/triangle-java-users-group-talk-on-lucene-and-solr/</link>
		<comments>http://lucene.grantingersoll.com/2007/09/07/triangle-java-users-group-talk-on-lucene-and-solr/#comments</comments>
		<pubDate>Fri, 07 Sep 2007 16:54:11 +0000</pubDate>
		<dc:creator>grant_ingersoll</dc:creator>
				<category><![CDATA[Cary]]></category>
		<category><![CDATA[Chapel Hill]]></category>
		<category><![CDATA[Durham]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[North Carolina]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[Raleigh]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[Triangle]]></category>
		<category><![CDATA[payloads]]></category>

		<guid isPermaLink="false">http://lucene.grantingersoll.com/2007/09/07/triangle-java-users-group-talk-on-lucene-and-solr/</guid>
		<description><![CDATA[Welcome to the Triangle Java Users Group I will be speaking November 19, 2007 at the Triangle Java Users Group on Lucene and Solr.   The talk will be an introduction to the features and capabilities of both Lucene and Solr, as well as some basic compare and contrast information.]]></description>
			<content:encoded><![CDATA[<p><a href="http://trijug.org/">Welcome to the Triangle Java Users Group</a></p>
<p>I will be speaking November 19, 2007 at the Triangle Java Users Group on <a href="http://lucene.apache.org/java/docs/index.html">Lucene</a> and <a href="http://lucene.apache.org/solr">Solr</a>.   The talk will be an introduction to the features and capabilities of both Lucene and Solr, as well as some basic compare and contrast information.</p>
]]></content:encoded>
			<wfw:commentRss>http://lucene.grantingersoll.com/2007/09/07/triangle-java-users-group-talk-on-lucene-and-solr/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Advance Lucene slides from ApacheCon Europe 2007</title>
		<link>http://lucene.grantingersoll.com/2007/05/07/advance-lucene-slides-from-apachecon-europe-2007/</link>
		<comments>http://lucene.grantingersoll.com/2007/05/07/advance-lucene-slides-from-apachecon-europe-2007/#comments</comments>
		<pubDate>Mon, 07 May 2007 14:31:12 +0000</pubDate>
		<dc:creator>grant_ingersoll</dc:creator>
				<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Europe]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[payloads]]></category>
		<category><![CDATA[queries]]></category>
		<category><![CDATA[term vectors]]></category>

		<guid isPermaLink="false">http://lucene.grantingersoll.com/2007/05/07/advance-lucene-slides-from-apachecon-europe-2007/</guid>
		<description><![CDATA[The latest version of my slides for &#8220;Advanced Lucene&#8221; are located at http://www.cnlp.org/presentations/present.asp?show=conference Talk covered term vectors, using various query types and Lucene performance tips and tricks.]]></description>
			<content:encoded><![CDATA[<p>The latest version of my slides for &#8220;Advanced Lucene&#8221; are located at <a href="http://www.cnlp.org/presentations/present.asp?show=conference">http://www.cnlp.org/presentations/present.asp?show=conference</a></p>
<p>Talk covered term vectors, using various query types and Lucene performance tips and tricks.</p>
]]></content:encoded>
			<wfw:commentRss>http://lucene.grantingersoll.com/2007/05/07/advance-lucene-slides-from-apachecon-europe-2007/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ApacheCon Europe &#8220;Advanced Lucene&#8221; slides</title>
		<link>http://lucene.grantingersoll.com/2007/05/03/apachecon-europe-advanced-lucene-slides/</link>
		<comments>http://lucene.grantingersoll.com/2007/05/03/apachecon-europe-advanced-lucene-slides/#comments</comments>
		<pubDate>Thu, 03 May 2007 13:18:29 +0000</pubDate>
		<dc:creator>grant_ingersoll</dc:creator>
				<category><![CDATA[ApacheCon]]></category>
		<category><![CDATA[Europe]]></category>
		<category><![CDATA[Indexing]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[payloads]]></category>
		<category><![CDATA[queries]]></category>
		<category><![CDATA[term vectors]]></category>

		<guid isPermaLink="false">http://lucene.grantingersoll.com/2007/05/03/apachecon-europe-advanced-lucene-slides/</guid>
		<description><![CDATA[My (slightly old) slides for ApacheCon Europe are now available in the conference proceedings available at http://eu.apachecon.com/downloads/materials.zip I will post the latest version soon, but there is very little difference between this version and the latest. Topics covered include Lucene performance, term vectors and query tips and tricks. Feedback is always welcome]]></description>
			<content:encoded><![CDATA[<p>My (slightly old) slides for ApacheCon Europe are now available in the conference proceedings available at <a href="http://eu.apachecon.com/downloads/materials.zip">http://eu.apachecon.com/downloads/materials.zip</a></p>
<p>I will post the latest version soon, but there is very little difference between this version and the latest.</p>
<p>Topics covered include Lucene performance, term vectors and query tips and tricks.</p>
<p>Feedback is always welcome</p>
]]></content:encoded>
			<wfw:commentRss>http://lucene.grantingersoll.com/2007/05/03/apachecon-europe-advanced-lucene-slides/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Payloads</title>
		<link>http://lucene.grantingersoll.com/2007/03/18/payloads/</link>
		<comments>http://lucene.grantingersoll.com/2007/03/18/payloads/#comments</comments>
		<pubDate>Sun, 18 Mar 2007 14:07:11 +0000</pubDate>
		<dc:creator>grant_ingersoll</dc:creator>
				<category><![CDATA[Java]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[payloads]]></category>

		<guid isPermaLink="false">http://lucene.grantingersoll.com/2007/03/18/payloads/</guid>
		<description><![CDATA[Michael Busch recently committed some code that enables Lucene to store payloads at the term level (see https://issues.apache.org/jira/browse/LUCENE-755) and I have started working on enabling these payloads to be incorporated into search and scoring. (see http://wiki.apache.org/lucene-java/Payload_Planning and https://issues.apache.org/jira/browse/LUCENE-834) So, you might be asking yourself, what exactly are payloads good for?  Naturally, the answer is a [...]]]></description>
			<content:encoded><![CDATA[<p>Michael Busch recently committed some code that enables Lucene to store payloads at the term level (see <a href="https://issues.apache.org/jira/browse/LUCENE-755">https://issues.apache.org/jira/browse/LUCENE-755</a>) and I have started working on enabling these payloads to be incorporated into search and scoring. (see <a href="http://wiki.apache.org/lucene-java/Payload_Planning">http://wiki.apache.org/lucene-java/Payload_Planning</a> and <a href="https://issues.apache.org/jira/browse/LUCENE-834">https://issues.apache.org/jira/browse/LUCENE-834</a>)</p>
<p>So, you might be asking yourself, what exactly are payloads good for?  Naturally, the answer is a lot!  For example, in the &#8220;<a href="http://infolab.stanford.edu/~backrub/google.html">Anatomy of a Search Engine</a>&#8221; by Brin and Page (discussed <a href="http://www.paperoftheweek.com/2007/01/22/the-anatomy-of-a-search-engine/">here</a>) see section 4.2.5 on Hit Lists where they discuss how they store information about the term in the index, such as font, capitalization, etc. which then get factored into the scoring algorithm later.  You could also store things like part of speech, or per term weights and then do things like score noun matches higher than verbs or use the encoded weight as part of the score calculation.  Another option would be to store synonyms of the words, or the synset from Wordnet.  In NLP applications, it could be useful for storing co-references or other types of linkage and then use a graph ranking strategy such as PageRank or TextRank (discussed <a href="http://www.paperoftheweek.com/2007/01/29/textrank-by-rada-mihalcea-and-paul-tarau/">here</a>.)  Another option is to store XPath information or other metadata which are often stored in separate fields and require stitching the information back together.</p>
<p>In a sense, payloads open up a lot of new avenues for search in Lucene.  Open questions remain as to how much data should be stored and still have good performance.  Also note, that the current API is still considered experimental and may change, although I doubt it will drastically change.</p>
<p>What ideas do you have for payloads that I missed?  Let me know or update the planning page on the Lucene wiki.</p>
]]></content:encoded>
			<wfw:commentRss>http://lucene.grantingersoll.com/2007/03/18/payloads/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
