Archive for the 'Tika' Category

SFBay Apache Lucene/Solr Meetup Jan 21st.

Details and RSVP at: SFBay Apache Lucene/Solr Meetup San Mateo, CA – Meetup.com.

SF Bay Area Lucene/Solr Meetup

Just wanted to follow up on last night’s Lucene/Solr Meetup in San Francisco.
First off, special thanks to all the speakers (Jason Rutherglen, Michael Busch, Erik Hatcher and all the lightning talks.)  We had a lot of excellent talks ranging from low level Lucene details on payloads and real time search to high level discussions on [...]

LuceneMeetupMarch2009 – Lucene-java Wiki

LuceneMeetupMarch2009 – Lucene-java Wiki
Looks like an interesting crowd is forming for the Lucene meetup at ApacheCon EU this year.  Even if you aren’t attending the conference, you can still go to the Meetup.  We’ve got a lot of the Lucene ecosystem projects covered by the looks of it: Lucene, Solr, Mahout, Tika, Droids.  Hope to [...]

Lucid Imagination » Add our Lucene Ecosystem Search Engine to Firefox

Lucid Imagination » Add our Lucene Ecosystem Search Engine to Firefox
Mark Miller shows how to add Lucid’s Lucene ecosystem search as a Firefox plugin.  Now you can search all the Lucene project (and subproject) archives, website, wiki from the comfort of your browser plugin.

GSOC 2009 at the ASF: Looking for students interested in Lucene

SummerOfCode2009 – General Wiki
It’s that time of year again.  Time for students to sign up for Google Summer of Code.  Gist of it:  Get paid to work in Open Source for the summer.
I’ve signed up to mentor for Apache Mahout.  We are looking for students interested in implementing cutting-edge machine learning algorithms, optionally using Hadoop [...]

Lucid Imagination

Some of you may have noted that I’ve been quieter than usual lately.  Well, the reason is I was preparing for the launch of the new company I helped found: Lucid Imagination.   Now, I don’t blog too often about what I do for work on this site, other than it is Lucene, Solr and [...]

Tika and Solr

As I mentioned in Congrats to Tika and Welcome to the Lucene Stack
I’ve been working on adding Tika support to Solr.  Well, I finally committed it today, with a special thanks to Chris Harris and Eric Pugh for helping see it through with me.
What does this mean?  It is now possible to send any [...]

Congrats to Tika and Welcome to the Lucene Stack!

Congratulations to Apache Tika (nevermind the incubator address, it’s still in the process of migrating) for graduating from Incubation!   And welcome to the Lucene project!  Tika is a content extraction framework that wraps many other content extraction libraries such as PDFBox, POI, and others into a single, easy to use framework that makes it easy [...]

FeatherCast » Blog Archive » Episode 43: Lucene

FeatherCast » Blog Archive » Episode 43: Lucene
I did a FeatherCast today with Rich Bowen.  Dang, he is quick at editing…