<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Tao and the Art of Search: Yin Yang and TF-IDF</title>
	<atom:link href="http://lucene.grantingersoll.com/2008/11/08/tao-and-the-art-of-search-yin-yang-and-tf-idf/feed/" rel="self" type="application/rss+xml" />
	<link>http://lucene.grantingersoll.com/2008/11/08/tao-and-the-art-of-search-yin-yang-and-tf-idf/</link>
	<description>Thoughts on Apache Lucene, Mahout, Solr, Tika and Nutch</description>
	<lastBuildDate>Thu, 20 May 2010 14:31:37 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: Google-gate and search engines &#171; Techapilla</title>
		<link>http://lucene.grantingersoll.com/2008/11/08/tao-and-the-art-of-search-yin-yang-and-tf-idf/comment-page-1/#comment-7350</link>
		<dc:creator>Google-gate and search engines &#171; Techapilla</dc:creator>
		<pubDate>Mon, 07 Dec 2009 10:29:07 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/2008/11/08/tao-and-the-art-of-search-yin-yang-and-tf-idf/#comment-7350</guid>
		<description>[...] This balancing and counter-balancing involved in search algorithms has been called the &#8220;yin and yang of search&#8221; [...]</description>
		<content:encoded><![CDATA[<p>[...] This balancing and counter-balancing involved in search algorithms has been called the &#8220;yin and yang of search&#8221; [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Enterprise Search in 2009</title>
		<link>http://lucene.grantingersoll.com/2008/11/08/tao-and-the-art-of-search-yin-yang-and-tf-idf/comment-page-1/#comment-6169</link>
		<dc:creator>Enterprise Search in 2009</dc:creator>
		<pubDate>Wed, 12 Nov 2008 09:36:30 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/2008/11/08/tao-and-the-art-of-search-yin-yang-and-tf-idf/#comment-6169</guid>
		<description>[...] * Tao and the Art of Search: Yin Yang and TF-IDF [...]</description>
		<content:encoded><![CDATA[<p>[...] * Tao and the Art of Search: Yin Yang and TF-IDF [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Tunkelang</title>
		<link>http://lucene.grantingersoll.com/2008/11/08/tao-and-the-art-of-search-yin-yang-and-tf-idf/comment-page-1/#comment-6166</link>
		<dc:creator>Daniel Tunkelang</dc:creator>
		<pubDate>Sat, 08 Nov 2008 15:51:09 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/2008/11/08/tao-and-the-art-of-search-yin-yang-and-tf-idf/#comment-6166</guid>
		<description>Interesting, I might put that out there the next time I explain tf*idf. But, in my experience, people quickly grasp the basic information theory idea that the significance of a signal is a combination of its strength and its distinctiveness. The nice thing about explaining tf*idf this way is that it helps you further explain where it breaks down, e.g., that term frequency is a very crude proxy for signal strength and that inverse document frequency conflates true distinctiveness with noise.</description>
		<content:encoded><![CDATA[<p>Interesting, I might put that out there the next time I explain tf*idf. But, in my experience, people quickly grasp the basic information theory idea that the significance of a signal is a combination of its strength and its distinctiveness. The nice thing about explaining tf*idf this way is that it helps you further explain where it breaks down, e.g., that term frequency is a very crude proxy for signal strength and that inverse document frequency conflates true distinctiveness with noise.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
