Archive for the 'Lucene' Category
Lucid Imagination
Well, the cat is out of the bag. In case you haven’t heard, a few Lucene/Solr/Mahout committers (Erik Hatcher and Yonik Seeley) and I have teamed up with some other long time search veterans (Marc Krellenstein from Northern Light and former CTO of Reed Elsevier, amongst others) to build a company around providing product, [...]
May 2nd, 2008 | Posted in Lucene, Lucid Imagination, Mahout, Solr | No Comments
Manning: Taming Text
Scary… I guess it is real!
April 28th, 2008 | Posted in Hadoop, Lucene, Mahout, Manning, Solr, Taming Text, machine learning | 3 Comments
BarCamp wiki / BarCampRDU
Threw my name in the ring for BarCamp RDU today. Haven’t been to BarCamp before, but Erik Hatcher suggested I go and check it out.
Also put in a Proposed Session of “Apache Mahout and Hadoop - Having fun with Map Reduce and distributed computing”. Figure we talk about the basics of M/R, Hadoop [...]
April 23rd, 2008 | Posted in Apache, BarCampRDU, Hadoop, Java, Lucene, Mahout, Map Reduce, machine learning | No Comments
It’s been an interesting few months over in Mahout land. First off, I am psyched about the response the project has been getting. Seems like there is a pent up demand for large scale machine learning these days. I figured we would do all right in the early months, but I [...]
April 20th, 2008 | Posted in Apache, ApacheCon, Hadoop, Java, Lucene, Mahout, Map Reduce, machine learning | No Comments
Why Lucene Isn’t That Good | Javalobby
Patches welcome… I know that is an old saw, but that is the only way it’s going to get better.
There are some good points in here, and some stuff that is a bit dramatic.
We do try to keep adapting Lucene and make it better, but in some respects we [...]
March 28th, 2008 | Posted in Apache, Indexing, Lucene, Search | No Comments
Jeff Eastman’s Marvelous Cloud Computing Adventure
Mahout’s newest committer, Jeff Eastman, has a new blog on Mahout and Hadoop…
March 28th, 2008 | Posted in Apache, Hadoop, Java, Lucene, Mahout, Map Reduce, clustering, machine learning | No Comments
FeatherCast » Blog Archive » Episode 43: Lucene
I did a FeatherCast today with Rich Bowen. Dang, he is quick at editing…
February 21st, 2008 | Posted in Apache, ApacheCon, Hadoop, Java, Lucene, Mahout, Nutch, Performance, Search, Tika, feathercast, machine learning | No Comments
How Rackspace Now Uses MapReduce and Hadoop to Query Terabytes of Data | High Scalability
Nice article on how the Lucene/Hadoop/Solr stack was used to solve a really big problem. Someday, I hope (when we have actual code), they can add Mahout to the equation and do even more interesting things with the data.
February 1st, 2008 | Posted in Apache, Hadoop, Indexing, Java, Lucene, Mahout, Search, Solr, database | No Comments
Coderspiel / January 2008
I hardly think Lucene is creating an isolationist culture, nor do we think our project is perfect. What we do agree on is that our time is better spent on figuring out how to make Lucene better, not how to spend our time doing UNIX administration in a virtual server environment. As [...]
January 21st, 2008 | Posted in Indexing, Java, Lucene, Search | No Comments
Coderspiel / The right tool for the slob
This guy’s comment system wasn’t working at the moment, so I will leave my comment here. This won’t make much sense without reading the post first:
It’s funny you mention Wikipedia as an example, since they are running Lucene. As is Technorati and the Internet Archive. [...]
January 19th, 2008 | Posted in Apache, Indexing, Java, Lucene, Nutch, Search, Solr | 2 Comments