<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: TriJUG: Intro to Mahout Slides and Demo examples</title>
	<atom:link href="http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/feed/" rel="self" type="application/rss+xml" />
	<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/</link>
	<description>Thoughts on Apache Lucene, Mahout, Solr, Tika and Nutch</description>
	<lastBuildDate>Mon, 16 Aug 2010 17:30:52 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
	<item>
		<title>By: Exploring mahout &#171; Slense</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-8108</link>
		<dc:creator>Exploring mahout &#171; Slense</dc:creator>
		<pubDate>Tue, 20 Jul 2010 15:33:47 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-8108</guid>
		<description>[...] here you can find some more code and some background info at metaoptimize   Explore posts in the same [...]</description>
		<content:encoded><![CDATA[<p>[...] here you can find some more code and some background info at metaoptimize   Explore posts in the same [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: yang jie</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-8040</link>
		<dc:creator>yang jie</dc:creator>
		<pubDate>Thu, 20 May 2010 14:31:37 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-8040</guid>
		<description>Dear Grant,

I got an exception when trying mahout 0.3 locally at step 2 in LDA test:

CMD is:
./mahout org.apache.mahout.clustering.lda.LDAPrintTopics –input /content/reuters/seqfiles-TF/lda-output/state-19 –dict /content/reuters/seqfiles-TF/dictionary.file-0 –words 10 –output /content/reuters/seqfiles-TF/lda-output/topics –dictionaryType sequencefile

And the warning &amp; exception is:

WARNING: No org.apache.mahout.clustering.lda.LDAPrintTopics.props found on classpath, will use command-line arguments only
May 20, 2010 10:13:27 PM org.slf4j.impl.JCLLoggerAdapter error
SEVERE: MahoutDriver failed with args: [--input, tf_sparse_seq/lda/state-0, --dict, tf_sparse_seq/dictionary.file-0, --words, 10, --output, tf_sparse_seq/lda/topics, --dictionaryType, sequencefile, null]
31659
Exception in thread &quot;main&quot; java.lang.ArrayIndexOutOfBoundsException: 31659
	at java.util.Arrays$ArrayList.get(Arrays.java:3393)
	at org.apache.mahout.clustering.lda.LDAPrintTopics.topWordsForTopics(LDAPrintTopics.java:214)
	at org.apache.mahout.clustering.lda.LDAPrintTopics.main(LDAPrintTopics.java:153)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:616)
	at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
	at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
	at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:172)

And I found that *LDAPrintTopics.props* is indeed not in the $MAHOUT_HOME/conf directory. How should I solve the problem?

Thank you!</description>
		<content:encoded><![CDATA[<p>Dear Grant,</p>
<p>I got an exception when trying mahout 0.3 locally at step 2 in LDA test:</p>
<p>CMD is:<br />
./mahout org.apache.mahout.clustering.lda.LDAPrintTopics –input /content/reuters/seqfiles-TF/lda-output/state-19 –dict /content/reuters/seqfiles-TF/dictionary.file-0 –words 10 –output /content/reuters/seqfiles-TF/lda-output/topics –dictionaryType sequencefile</p>
<p>And the warning &amp; exception is:</p>
<p>WARNING: No org.apache.mahout.clustering.lda.LDAPrintTopics.props found on classpath, will use command-line arguments only<br />
May 20, 2010 10:13:27 PM org.slf4j.impl.JCLLoggerAdapter error<br />
SEVERE: MahoutDriver failed with args: [--input, tf_sparse_seq/lda/state-0, --dict, tf_sparse_seq/dictionary.file-0, --words, 10, --output, tf_sparse_seq/lda/topics, --dictionaryType, sequencefile, null]<br />
31659<br />
Exception in thread &#8220;main&#8221; java.lang.ArrayIndexOutOfBoundsException: 31659<br />
	at java.util.Arrays$ArrayList.get(Arrays.java:3393)<br />
	at org.apache.mahout.clustering.lda.LDAPrintTopics.topWordsForTopics(LDAPrintTopics.java:214)<br />
	at org.apache.mahout.clustering.lda.LDAPrintTopics.main(LDAPrintTopics.java:153)<br />
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)<br />
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)<br />
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)<br />
	at java.lang.reflect.Method.invoke(Method.java:616)<br />
	at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)<br />
	at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)<br />
	at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:172)</p>
<p>And I found that *LDAPrintTopics.props* is indeed not in the $MAHOUT_HOME/conf directory. How should I solve the problem?</p>
<p>Thank you!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lucid Imagination &#187; Integrating Apache Mahout with Apache Lucene and Solr &#8211; Part I (of 3)</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-7812</link>
		<dc:creator>Lucid Imagination &#187; Integrating Apache Mahout with Apache Lucene and Solr &#8211; Part I (of 3)</dc:creator>
		<pubDate>Tue, 16 Mar 2010 15:35:59 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-7812</guid>
		<description>[...] ranging from clustering to classification and collaborative filtering.  For more on Mahout, see my TriJUG talk or my developerWorks article.  Instead of going over the litany of things implemented in Mahout, [...]</description>
		<content:encoded><![CDATA[<p>[...] ranging from clustering to classification and collaborative filtering.  For more on Mahout, see my TriJUG talk or my developerWorks article.  Instead of going over the litany of things implemented in Mahout, [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: grant_ingersoll</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-7661</link>
		<dc:creator>grant_ingersoll</dc:creator>
		<pubDate>Thu, 18 Feb 2010 02:30:37 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-7661</guid>
		<description>It should be --input (two dashes) if that helps at all.</description>
		<content:encoded><![CDATA[<p>It should be &#8211;input (two dashes) if that helps at all.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: wilson</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-7652</link>
		<dc:creator>wilson</dc:creator>
		<pubDate>Wed, 17 Feb 2010 06:38:10 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-7652</guid>
		<description>When I do step 2:
# Create the sequence files: bin/mahout seqdirectory –input /content/reuters/reuters-out –output /content/reuters/seqfiles –charset UTF-8

I get this error 
$~/src/java/mahout/bin/mahout seqdirectory -input ~/src/java/reuters-mahout/data/reuters-out/ -output ~/src/java/reuters-mahout/data/seqfiles --charset UTF-8
Exception in thread &quot;main&quot; org.apache.commons.cli2.OptionException: Unexpected -input while processing Options
        at org.apache.commons.cli2.commandline.Parser.parse(Parser.java:99)
        at org.apache.mahout.text.SequenceFilesFromDirectory.main(SequenceFilesFromDirectory.java:205)

I also tried using &quot;--&quot;
~/src/java/mahout/bin/mahout seqdirectory --input ~/src/java/reuters-mahout/data/reuters-out/ --output ~/src/java/reuters-mahout/data/seqfiles --charset UTF-8
Exception in thread &quot;main&quot; org.apache.commons.cli2.OptionException: Unexpected --input while processing Options
        at org.apache.commons.cli2.commandline.Parser.parse(Parser.java:99)
        at org.apache.mahout.text.SequenceFilesFromDirectory.main(SequenceFilesFromDirectory.java:205)</description>
		<content:encoded><![CDATA[<p>When I do step 2:<br />
# Create the sequence files: bin/mahout seqdirectory –input /content/reuters/reuters-out –output /content/reuters/seqfiles –charset UTF-8</p>
<p>I get this error<br />
$~/src/java/mahout/bin/mahout seqdirectory -input ~/src/java/reuters-mahout/data/reuters-out/ -output ~/src/java/reuters-mahout/data/seqfiles &#8211;charset UTF-8<br />
Exception in thread &#8220;main&#8221; org.apache.commons.cli2.OptionException: Unexpected -input while processing Options<br />
        at org.apache.commons.cli2.commandline.Parser.parse(Parser.java:99)<br />
        at org.apache.mahout.text.SequenceFilesFromDirectory.main(SequenceFilesFromDirectory.java:205)</p>
<p>I also tried using &#8220;&#8211;&#8221;<br />
~/src/java/mahout/bin/mahout seqdirectory &#8211;input ~/src/java/reuters-mahout/data/reuters-out/ &#8211;output ~/src/java/reuters-mahout/data/seqfiles &#8211;charset UTF-8<br />
Exception in thread &#8220;main&#8221; org.apache.commons.cli2.OptionException: Unexpected &#8211;input while processing Options<br />
        at org.apache.commons.cli2.commandline.Parser.parse(Parser.java:99)<br />
        at org.apache.mahout.text.SequenceFilesFromDirectory.main(SequenceFilesFromDirectory.java:205)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lukas Vlcek</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-7644</link>
		<dc:creator>Lukas Vlcek</dc:creator>
		<pubDate>Tue, 16 Feb 2010 13:30:24 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-7644</guid>
		<description>thnx, cool</description>
		<content:encoded><![CDATA[<p>thnx, cool</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ian</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-7643</link>
		<dc:creator>ian</dc:creator>
		<pubDate>Tue, 16 Feb 2010 13:17:21 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-7643</guid>
		<description>ok.. it&#039;s a typo in the link
try http://lucene.grantingersoll.com/wp-content/uploads/2010/02/intro-mahout.pptx</description>
		<content:encoded><![CDATA[<p>ok.. it&#8217;s a typo in the link<br />
try <a href="http://lucene.grantingersoll.com/wp-content/uploads/2010/02/intro-mahout.pptx" rel="nofollow">http://lucene.grantingersoll.com/wp-content/uploads/2010/02/intro-mahout.pptx</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ian</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-7642</link>
		<dc:creator>ian</dc:creator>
		<pubDate>Tue, 16 Feb 2010 13:16:42 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-7642</guid>
		<description>it&#039;s happening to me as well.</description>
		<content:encoded><![CDATA[<p>it&#8217;s happening to me as well.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lukas Vlcek</title>
		<link>http://lucene.grantingersoll.com/2010/02/16/trijug-intro-to-mahout-slides-and-demo-examples/comment-page-1/#comment-7641</link>
		<dc:creator>Lukas Vlcek</dc:creator>
		<pubDate>Tue, 16 Feb 2010 13:10:16 +0000</pubDate>
		<guid isPermaLink="false">http://lucene.grantingersoll.com/?p=331#comment-7641</guid>
		<description>Grant, I can not download slides (404), is it just me?</description>
		<content:encoded><![CDATA[<p>Grant, I can not download slides (404), is it just me?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
