Archive for the 'Hadoop' Category

TriHUG Next Meeting featuring Josh Patterson of Cloudera set for Oct. 11

      Just a few more days until the next Triangle Hadoop User’s Group meeting.  Get the details and sign up via Triangle Hadoop Users Group, TriHUG Next Meeting featuring Josh Patterson of Cloudera set for Oct. 11.

SXSW 2012 – Apache Mahout: Bringing Intelligence to Your App

It’s that time of year again: time to vote for SXSW talks.  Last year I did a talk with RC Johnson of BazaarVoice on Solr as NoSQL, this year I thought I would try to fly solo and submitted a talk on Apache Mahout. So, if you are so inclined to do the whole crowdsourcing [...]

Mahout and Other News

After some time away, I’m happy to have had some time recently to work on Mahout again.  Lots of goodness all over the place happening there that I’ll leave to others to explain while I focus in on a few recent things I’ve been doing. First off, I was doing a fair amount of work [...]

Scale-A-Thon RTP Spring 2011

If you’re interested in working on large scale problems like Apache Hadoop, Lucene, Solr, Mahout, Cassandra, etc. and you live in the Raleigh/Durham/Chapel Hill or greater NC area, then you might be interested in the upcoming Scale-A-Thon event that several of us from the Triangle Hadoop User’s Group are putting on June 18th at Bronto [...]

Next Meeting: Triangle HUG

The next TriHUG meeting has been announced:  Sept. 14.  There will be two speakers: Wei Wei on Practical Hadoop Security and Me on Hadoop and Lucene and Solr. For more info and to RSVP, see Triangle Hadoop Users Group.

Triangle Hadoop Users Group First Meeting

I’m pleased to announce a few of us Apache Hadoop users in the Triangle (Raleigh, Durham, Chapel Hill North Carolina) have finally reached critical mass since I sent out an email over a year ago to the Hadoop mailing list asking for interested people.  We’ve found a place to meet and discuss the Hadoop ecosystem, [...]

Apache Mahout talk at Triangle Java User’s Group

For those who live in the Triangle, I’ll be giving an intro talk on Mahout next Monday.  See Welcome to the Triangle Java Users Group for more details.  Due note the location is no longer in RTP, but at the Red Hat campus at NCSU. Hope to see you there!

SF Bay Area Lucene/Solr Meetup

Just wanted to follow up on last night’s Lucene/Solr Meetup in San Francisco. First off, special thanks to all the speakers (Jason Rutherglen, Michael Busch, Erik Hatcher and all the lightning talks.)  We had a lot of excellent talks ranging from low level Lucene details on payloads and real time search to high level discussions [...]

Hadoop, Analytical Software, Finds Uses Beyond Search – NYTimes.com

Hadoop, Analytical Software, Finds Uses Beyond Search – NYTimes.com. Nice writeup on Hadoop in the NYT today.  Of course, Hadoop is often used to power machine learning, too, which is the premise behind using it on Apache Mahout.

Surprise and Coincidence – musings from the long tail: Real-time decision making using map-reduce

Ted Dunning has a nice blurb on “scale free” development and Mahout/Hadoop/Map Reduce that is worth the quick read: Surprise and Coincidence – musings from the long tail: Real-time decision making using map-reduce