Archive for the 'Hadoop' Category
Just a few more days until the next Triangle Hadoop User’s Group meeting. Get the details and sign up via Triangle Hadoop Users Group, TriHUG Next Meeting featuring Josh Patterson of Cloudera set for Oct. 11.
October 7th, 2011 | Posted in Cary, Chapel Hill, Durham, Hadoop, Raleigh, TriHUG | No Comments
It’s that time of year again: time to vote for SXSW talks. Last year I did a talk with RC Johnson of BazaarVoice on Solr as NoSQL, this year I thought I would try to fly solo and submitted a talk on Apache Mahout. So, if you are so inclined to do the whole crowdsourcing [...]
August 15th, 2011 | Posted in Hadoop, machine learning, Mahout, Map Reduce | No Comments
After some time away, I’m happy to have had some time recently to work on Mahout again. Lots of goodness all over the place happening there that I’ll leave to others to explain while I focus in on a few recent things I’ve been doing. First off, I was doing a fair amount of work [...]
August 5th, 2011 | Posted in Hadoop, Lucene, Mahout, Solr | No Comments
If you’re interested in working on large scale problems like Apache Hadoop, Lucene, Solr, Mahout, Cassandra, etc. and you live in the Raleigh/Durham/Chapel Hill or greater NC area, then you might be interested in the upcoming Scale-A-Thon event that several of us from the Triangle Hadoop User’s Group are putting on June 18th at Bronto [...]
May 31st, 2011 | Posted in Hadoop | No Comments
The next TriHUG meeting has been announced: Sept. 14. There will be two speakers: Wei Wei on Practical Hadoop Security and Me on Hadoop and Lucene and Solr. For more info and to RSVP, see Triangle Hadoop Users Group.
August 19th, 2010 | Posted in Apache, Hadoop, Java, Lucene | No Comments
I’m pleased to announce a few of us Apache Hadoop users in the Triangle (Raleigh, Durham, Chapel Hill North Carolina) have finally reached critical mass since I sent out an email over a year ago to the Hadoop mailing list asking for interested people. We’ve found a place to meet and discuss the Hadoop ecosystem, [...]
July 8th, 2010 | Posted in Apache, Hadoop, Mahout | No Comments
For those who live in the Triangle, I’ll be giving an intro talk on Mahout next Monday. See Welcome to the Triangle Java Users Group for more details. Due note the location is no longer in RTP, but at the Red Hat campus at NCSU. Hope to see you there!
February 9th, 2010 | Posted in Apache, Cary, Chapel Hill, Durham, Hadoop, Java, Mahout, North Carolina, Raleigh, Triangle | No Comments
Just wanted to follow up on last night’s Lucene/Solr Meetup in San Francisco. First off, special thanks to all the speakers (Jason Rutherglen, Michael Busch, Erik Hatcher and all the lightning talks.) We had a lot of excellent talks ranging from low level Lucene details on payloads and real time search to high level discussions [...]
June 4th, 2009 | Posted in canopy clustering, Droids, Hadoop, Java, Latent Dirichlet Allocation, Lucene, Lucid Imagination, machine learning, Mahout, Open Relevance, Real Time Search, relevance, Solr, Tika | No Comments
Hadoop, Analytical Software, Finds Uses Beyond Search – NYTimes.com. Nice writeup on Hadoop in the NYT today. Of course, Hadoop is often used to power machine learning, too, which is the premise behind using it on Apache Mahout.
March 17th, 2009 | Posted in Hadoop, machine learning, Mahout | No Comments
Ted Dunning has a nice blurb on “scale free” development and Mahout/Hadoop/Map Reduce that is worth the quick read: Surprise and Coincidence – musings from the long tail: Real-time decision making using map-reduce
January 15th, 2009 | Posted in Hadoop, Mahout, Map Reduce | No Comments