Skip to main content

Mike Chelen's Library tagged hadoop   View Popular

21 Feb 09

SourceForge.net: katta

An search grid, build on top of apache hadoop, lucene and zookeeper.

sourceforge.net/katta - Preview

hadoop lucene katta open source distributed processing search

29 Oct 08

Hive - Hadoop Wiki

Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built in capabilities of the language.

wiki.apache.org/Hive - Preview

hadoop sql hive

1 - 7 of 7
Showing 20 items per page

Highlighter, Sticky notes, Tagging, Groups and Network: integrated suite dramatically boosting research productivity. Learn more »

Join Diigo