Carlos Santos's Library tagged → View Popular
MapReduce Online! (and some gimmes) « Data Beta
"Introducing HOP: the Hadoop Online Prototype. With modest changes to the structure of Hadoop, we were able to convert it from a batch-processing system to an interactive, online system that can provide features like “early returns” from big jobs, and continuous data stream processing, while preserving the simple MapReduce programming and fault tolerance models popularized by Google and Hadoop. And by the way, it exposes pipeline parallelism that can even make batch jobs finish faster. This is a project led by Tyson Condie, in collaboration with folks at Berkeley and Yahoo! Research."
Hadoop Studio
Hadoop Studio is a map-reduce development environment (IDE) based on Netbeans. It makes it easy to create, understand and debug map-reduce applications based on Hadoop, without requiring development-time access to a map-reduce cluster.
Cloudera Hadoop & Big Data Blog » Blog Archive » Introducing Sqoop
Sqoop (”SQL-to-Hadoop”) is a straightforward command-line tool with the following capabilities:
* Imports individual tables or entire databases to files in HDFS
* Generates Java classes to allow you to interact with your imported data
* Provides the ability to import from SQL databases straight into your Hive data warehouse
Amazon Web Services Developer Community : Any AWS MapReduce examples using the R ...
discussion of R for big data processing
22S:295-HPC Home Page
High Performance Computing in Statistics: course notes; uses R running on a cluster
Hadoop User Group UK: HUGUK #2 - Wrap up
Practical MapReduce - (Tom White, Cloudera) video, slides
Introducing Apache Mahout - (Isabel Drost, ASF) video, slides
Hadoop Training: Virtual Machine | Cloudera
-
In order to make it easy for you to get started with Hadoop and complete our various training exercises, we have created a virtual machine with everything you need. The VM includes Cloudera's Distribution for Hadoop, all of our example code, as well as eclipse and other standard tools
Amazon Elastic MapReduce
Using Hadoop on Amazon's "cloud"
-
Amazon Elastic MapReduce is a web service
-
t utilizes a hosted Hadoop framework running on the web-scale infrastructure of Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Simple Storage Service (Amazon S3)
Selected Tags
Related Tags
Sponsored Links
Top Contributors
Groups interested in mapreduce
-
Parallel Databases
Links related to mapreduce ...
Items: 11 | Visits: 3
Created by: Dmitry Serebrennikov
Highlighter, Sticky notes, Tagging, Groups and Network: integrated suite dramatically boosting research productivity. Learn more »
Join Diigo
