Distributed Lucene : A distributed free text index for Hadoop

HPL-2008-64 Distributed Lucene : A distributed free text index for Hadoop - Butler, Mark H.; Rutherford, James
Keyword(s): distributed, high availability, free text, parallel, search
Abstract: This technical report described a parallel, distributed free text index written at HP Labs called Distributed Lucene. Distributed Lucene is based on two Apache open source projects, Lucene and Hadoop. It was written to gain a better understanding of the Apache Hadoop architecture, which is derived f ...
Full Report

More...