Archive for the 'Hadoop' Category
For those who live in the Triangle, I’ll be giving an intro talk on Mahout next Monday. See Welcome to the Triangle Java Users Group for more details. Due note the location is no longer in RTP, but at the Red Hat campus at NCSU.
Hope to see you there!
February 9th, 2010 | Posted in Apache, Cary, Chapel Hill, Durham, Hadoop, Java, Mahout, North Carolina, Raleigh, Triangle | No Comments
Just wanted to follow up on last night’s Lucene/Solr Meetup in San Francisco.
First off, special thanks to all the speakers (Jason Rutherglen, Michael Busch, Erik Hatcher and all the lightning talks.) We had a lot of excellent talks ranging from low level Lucene details on payloads and real time search to high level discussions on [...]
June 4th, 2009 | Posted in Droids, Hadoop, Java, Latent Dirichlet Allocation, Lucene, Lucid Imagination, Mahout, Open Relevance, Real Time Search, Solr, Tika, canopy clustering, machine learning, relevance | No Comments
Hadoop, Analytical Software, Finds Uses Beyond Search – NYTimes.com.
Nice writeup on Hadoop in the NYT today. Of course, Hadoop is often used to power machine learning, too, which is the premise behind using it on Apache Mahout.
March 17th, 2009 | Posted in Hadoop, Mahout, machine learning | No Comments
Ted Dunning has a nice blurb on “scale free” development and Mahout/Hadoop/Map Reduce that is worth the quick read:
Surprise and Coincidence – musings from the long tail: Real-time decision making using map-reduce
January 15th, 2009 | Posted in Hadoop, Mahout, Map Reduce | No Comments
Lots of goodness this week at ApacheCon, at least when it comes to Lucene, Solr, Mahout, Tika and Hadoop (i.e. the Lucene eco-system). There’s 2 full days on Hadoop, with lots of coverage of all the pieces that go into Hadoop. There’s also a full day of Lucene related talks, plus Erik and I are [...]
November 1st, 2008 | Posted in ApacheCon, Hadoop, Lucene, Mahout, Solr | No Comments
ZooKeeper/Tao – Hadoop Wiki
I like Zookeeper already, and I just started looking at it… Hopefully the code lives up to the Tao.
September 9th, 2008 | Posted in Hadoop, Zookeeper | No Comments
BarCamp wiki / BarCampRDU
I’ll be at BarCampRDU tomorrow. I proposed two sessions, one on Hadoop and Mahout and one on Lucene and Solr. I don’t think I really want to do both, but I would like to do at least one, so we’ll see what other people are interested in.
If you’re around and you want [...]
August 1st, 2008 | Posted in Apache, BarCampRDU, Hadoop, Java, Lucene, Mahout, Map Reduce, Nutch, Raleigh, Triangle, machine learning | 5 Comments
HP, Intel and Yahoo To Research Cloud Computing – Yahoo News
Boy, this could really come in handy in Open Source, especially projects like Mahout, Nutch and distributed Solr. I find my biggest personal challenge on Mahout is access to computing resources. I personally don’t have the financial backing to buy much time on Amazon EC2. [...]
July 30th, 2008 | Posted in Apache, Hadoop, Java, Lucene, Mahout, Map Reduce, machine learning | 2 Comments
Apache Hadoop Wins Terabyte Sort Benchmark (Hadoop and Distributed Computing at Yahoo!)
Congrats to the Hadoop team! Score one for Open Source!
July 3rd, 2008 | Posted in Apache, Hadoop, Java, Map Reduce, Performance | 1 Comment
Wow! Mahout has just got me pumped up. I feel like we’ve got a lot of positive momentum and that we are starting to get the various pieces of our suite of machine learning libraries in place. Various news items include:
Ted Dunning is now a committer! Welcome Ted!
I put up a patch for a map-reduce [...]
May 6th, 2008 | Posted in Hadoop, Java, Mahout, Map Reduce | No Comments