Archive for the 'Map Reduce' Category

Surprise and Coincidence – musings from the long tail: Real-time decision making using map-reduce

Ted Dunning has a nice blurb on “scale free” development and Mahout/Hadoop/Map Reduce that is worth the quick read:
Surprise and Coincidence – musings from the long tail: Real-time decision making using map-reduce

BarCamp wiki / BarCampRDU

BarCamp wiki / BarCampRDU
I’ll be at BarCampRDU tomorrow.  I proposed two sessions, one on Hadoop and Mahout and one on Lucene and Solr.  I don’t think I really want to do both, but I would like to do at least one, so we’ll see what other people are interested in.
If you’re around and you want [...]

HP, Intel and Yahoo To Research Cloud Computing – Yahoo News

HP, Intel and Yahoo To Research Cloud Computing – Yahoo News
Boy, this could really come in handy in Open Source, especially projects like Mahout, Nutch and distributed Solr.  I find my biggest personal challenge on Mahout is access to computing resources.  I personally don’t have the financial backing to buy much time on Amazon EC2.  [...]

Apache Hadoop Wins Terabyte Sort Benchmark (Hadoop and Distributed Computing at Yahoo!)

Apache Hadoop Wins Terabyte Sort Benchmark (Hadoop and Distributed Computing at Yahoo!)
Congrats to the Hadoop team!  Score one for Open Source!

Taste is now committed

I haven’t tried it yet (pesky day job   ) but I see that Taste is now committed to Mahout.  In fact, I think Sean has already started on some parallelization efforts!  Very cool.

Mahout News

Wow!  Mahout has just got me pumped up.  I feel like we’ve got a lot of positive momentum and that we are starting to get the various pieces of our suite of machine learning libraries in place.  Various news items include:

Ted Dunning is now a committer!  Welcome Ted!
I put up a patch for a map-reduce [...]

BarCampRDU

BarCamp wiki / BarCampRDU
Threw my name in the ring for BarCamp RDU today.  Haven’t been to BarCamp before, but Erik Hatcher suggested I go and check it out.
Also put in a Proposed Session of “Apache Mahout and Hadoop – Having fun with Map Reduce and distributed computing”.  Figure we talk about the basics of M/R, Hadoop [...]

Mahout Machine Learning Fun

It’s been an interesting few months over in Mahout land. First off, I am psyched about the response the project has been getting. Seems like there is a pent up demand for large scale machine learning these days.  I figured we would do all right in the early months, but I [...]

Jeff Eastman’s Marvelous Cloud Computing Adventure

Jeff Eastman’s Marvelous Cloud Computing Adventure
Mahout’s newest committer, Jeff Eastman, has a new blog on Mahout and Hadoop…

SummerOfCode2008 – Looking for a summer project in Machine Learning?

SummerOfCode2008 – General Wiki
Check out the Apache Summer of Code page (link above) to see how you can spend the summer developing large scale machine learning algorithms and help out the Mahout project.  We’d love to have a few students put together a some projects implementing one or more machine learning algorithms using Hadoop.  So, [...]