This link has been bookmarked by 226 people . It was first bookmarked on 07 Sep 2008, by someone privately.
-
21 Jan 18
-
02 Jan 17
-
28 Sep 16
-
03 May 16
-
01 Jun 15
-
21 Mar 15
-
15 Dec 14
-
01 Oct 14
-
01 Jun 14
-
28 May 14
-
22 May 14
-
20 Feb 14
-
28 Jan 14
-
08 Dec 13
-
25 Nov 13
-
03 Oct 13
-
30 Sep 13
-
21 Sep 13
-
16 Sep 13
-
Cascading is an application framework for Java developers to simply develop robust Data Analytics and Data Management applications on Apache Hadoop
-
Cascading is an application framework for Java developers to simply develop robust Data Analytics and Data Management applications on Apache Hadoop
-
-
29 Aug 13
-
23 Jul 13
-
Cascading is an application framework for Java developers to simply develop robust Data Analytics and Data Management applications on Apache Hadoop.
-
-
25 Jun 13
-
21 May 13
-
07 Apr 13
-
05 Feb 13
-
06 Dec 12
-
14 Nov 12
-
09 Nov 12
-
27 Sep 12
Rhea Myers"Cascading is an application framework for Java developers to quickly and easily develop robust Data Analytics and Data Management applications on Apache Hadoop."
-
19 Sep 12
-
23 Jul 12
-
19 Jul 12
-
05 Jul 12
-
11 Jun 12
-
20 May 12
-
05 Apr 12
-
01 Apr 12
-
08 Mar 12
-
29 Feb 12
-
03 Jan 12
-
30 Nov 11
Claudio BergaminiCascading is a Data Processing API, Process Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on an Apache Hadoop cluster. All without having to 'think' in MapReduce.
-
18 Oct 11
-
13 Oct 11
-
06 Oct 11
-
21 Sep 11
-
31 Aug 11
-
Cascading is a Data Processing API, Process Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on an Apache Hadoop cluster. All without having to 'think' in MapReduce.
-
Cascading does support the development of such languages and DSLs like Multitool, Cascalog, and Cascading.JRuby. Multitool allows you to either "grep", "sed", or join large datasets on a Hadoop FileSystem or Amazon S3 from the command line.
-
-
02 Aug 11
-
28 Jul 11
Carlos Veira LorenzoCascading is a Data Processing API, Process Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on an Apache Hadoop cluster. All without having to 'think' in MapReduce.
Cascading is a thin Java library and API that sits on top of Hadoop's MapReduce layer and is executed from the command line like any other Hadoop application.
As a library and API that can be driven from any JVM based language (Jython, JRuby, Groovy, Clojure, etc.), developers can create applications and frameworks that are "operationalized". That is, a single deployable Jar can be used to encapsulate a series of complex and dynamic processes all driven from the command line or a shell. Instead of using external schedulers to glue many individual applications together with XML against each individual command line interface.
The Cascading API approach dramatically simplifies development, regression and integration testing, and deployment of business critical applications on both Amazon Web Services (like Elastic MapReduce) or on dedicated hardware.
Cascading is not a new text based query syntax (like Pig) or another complex system that must be installed on a cluster and maintained (like Hive). But Cascading is both complimentary and a valid alternative to either application.
Cascading does support the development of such languages and DSLs like Multitool, Cascalog, and Cascading.JRuby. Multitool allows you to either "grep", "sed", or join large datasets on a Hadoop FileSystem or Amazon S3 from the command line.
Cascading is Open Source and licensed under the GPL. Alternatively Standard or OEM Licenses, and Production and Developer Support can be obtained through Concurrent, Inc.
Cascading has a strong community of users and contributors, see our Cascading modules page for related projects and extensions.
Cascading, extensions, and related libraries are also hosted in the Conjars maven repository maintained by Concurrent, Inc. The repository is open to the public.BigData cloud cloud-computing PaaS hadoop API map-reduce MapReduce distributed-computing clustering software tools programming development java FLOSS OSS
-
30 Jun 11
-
27 Jun 11
-
18 Jun 11
-
13 Jun 11
-
09 Jun 11
Doug DanielsCascading is a data flow API implemented in Java for writing data flows that run on MapReduce.
-
02 May 11
Dmitry Serebrennikov"Cascading is a Data Processing API, Process Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on an Apache Hadoop cluster. All without having to 'think' in MapReduce."
-
20 Mar 11
-
24 Feb 11
devrimbarisand API that sits on top of Hadoop's MapReduce layer and is executed from the command line like any other Hadoop application.
As a library and API that can be driven from any JVM based language (Jython, JRuby, Groovy, Clojure, etc.), developers can creat -
02 Feb 11
-
06 Jan 11
-
Cascading is a Query API, Query Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.
Cascading is a thin Java library that sits on top of Hadoop's MapReduce layer and is executed from the command line like any other Hadoop application. It is not a new text based query syntax (like Pig) or another complex system that must be installed on a cluster and maintained (like Hive). Though Cascading is both complimentary to and is a valid alternative to either application.
Cascading lets the developer quickly assemble complex distributed data-processing applications without having to "think" in MapReduce. And to efficiently schedule them based on their dependencies. Obviously simple data processing applications are supported as well, as complex applications tend to start simple.
-
-
29 Dec 10
-
23 Dec 10
-
18 Nov 10
-
31 Oct 10
-
22 Oct 10
-
13 Sep 10
-
31 Aug 10
-
08 Jul 10
-
25 Jun 10
John MitchellCascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster
hadoop mapreduce opensource cascading distributed aws delicious
-
27 May 10
-
25 May 10
-
27 Apr 10
-
29 Mar 10
-
24 Mar 10
-
23 Mar 10
-
22 Mar 10
-
20 Mar 10
-
18 Mar 10
-
18 Dec 09
Samuel VijaykumarCascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster
-
16 Dec 09
-
14 Dec 09
-
10 Dec 09
-
13 Nov 09
Andrew GilmartinCascading is a feature rich API for building complex and fault tolerant data processing workflows.
Some of the key features are:
Data Processing API
Topological Scheduler
Event Notification
MapReduce Job Planner
Stream Assertions
Failure Traps
Scriptabl -
03 Nov 09
Alex YakovlevCascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.
The processing API lets the developer quickly assemble complex distributed processes without having to "thinhadoop mapreduce java distributed programming opensource cluster api cascading
-
01 Nov 09
-
22 Oct 09
-
19 Oct 09
-
18 Sep 09
-
30 Aug 09
-
24 Aug 09
-
21 Aug 09
-
29 Jul 09
-
25 Jun 09
rawwell"Cascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.
The processing API lets the developer quickly assemble complex distributed processes without having to "thi -
16 Jun 09
-
12 Jun 09
-
16 May 09
Drew SudellData processing workflows and job planner on top of hadoop.
Would you like to comment?
Join Diigo for a free account, or sign in if you are already a member.